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t  (ns) 

PD*  ' 

DESCRIPTION 

PAGE 

AmPAL22XP10 

24 

TTL 

20,  30,  40 

90,  180,  210 

XOR  gate 

5-271 

PAL20S10 

24 

TTL 

35 

240 

Product  term  steering 

5-103 

AmPAL22P10 

24 

TTL 

15,  25 

105,  210 

24-pin  superset 

5-291 

Am/PAL20L10 

24 

TTL 

15,  20,  25,  30 

105,  165,  210 

1 0  outputs 

5-113,  5-306 

PAL20L8 

24 

TTL,  E  CMOS 

15,  25,  35,  45 

0.1,  105,  210 

Standard 

5-122 

PAL6L16 

24 

TTL 

25 

90 

Wide  output 

5-141 

PAL8L14 

24 

TTL 

25 

90 

Wide  output 

5-141 

PAL12L10 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 

PAL14L8 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 

PAL16L6 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 

PAL18L4 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 

PAL20L2 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 

PAL20C1 

24 

TTL 

40 

100 

Simple  combinatorial 

5-147 
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PAGE 

AmPA!_C29MA16 

24 

EE  CMOS 

20,  15 

120 

Advanced  Async.  Macro 

5-209 

AmPALC29M16 

24 

EE  CMOS 

20,  15 

120 

Advanced  Macrocell 

5-231 

PAL32VX1 0 

24 

TTL 

25,22 

180 

J-K,  varied  terms 

5-70 

Am/PAL22V10 

24 

TTL.  E  CMOS 

40,  33,  28.5,  20,  18 

90,  180 

Versatile 

5-79,  5-249 

PAL22RX8 

24 

TTL 

28.5 

210 

J-K  flip-flops 

5-87 

PAL20RA10 

24 

TTL 

20 

200 

Asynchronous 

5-95 

AmPAL20XRP10 

24 

TTL 

30,  22,  14 

105,  180,  210 

XOR  gate  &  polarity 

5-271 

AmPAL20XRP8 

24 

TTL 

30,  22,  14 

105,  180,  210 

XOR  gate  &  polarity 

5-271 

AmPAL20XRP6 

24 

TTL 

30,  22,  14 

105,  180,  210 

XOR  gate  &  polarity 

5-271 

AmPAL20XRP4 

24 

TTL 

30,  22,  14 

105,  180,  210 

XOR  gate  &  polarity 

5-271 

PAL20RS10 

24 

TTL 

20 

240 

Product  term  steering 

5-103 

PAL20RS8 

24 

TTL 

20 

240 

Product  term  steering 

5-103 

PAL20RS4 

24 

TTL 

20 

240 

Product  term  steering 

5-103 

PAL20X10 

24 

TTL 

22 

180 

XOR  gate 

5-113 

PAL20X8 

24 

TTL 

22 

180 

XOR  gate 

5-113 

PAL20X4 

24 

TTL 

22 

180 

XOR  gate 

5-113 

AmPAL20RP10 

24 

TTL 

37.  25 

105,  210 

Programmable  polarity 

5-291 

AmPAL20RP8 

24 

TTL 

37,25 

105,  210 

Programmable  polarity 

5-291 

AmPAL20RP6 

24 

TTL 

37,  25 

105,  210 

Programmable  polarity 

5-291 

AmPAL20RP4 

24 

TTL 

37,  25 

105,  210 

Programmable  polarity 

5-291 

PAL20R8 

24 

TTL,  E  CMOS 

37.  25,  20,  15 

0.1,  105,  210 

Standard 

5-122 

PAL20R6 

24 

TTL,  E  CMOS 

37,  25,  20,  15 

0.1,  105,  210 

Standard 

5-122 

PAL20R4 

24 

TTL,  E  CMOS 

37,  25,  20,  15 

0.1,  105,  210 

Standard 

5-122 

PAL32R16 

40 

TTL 

16 

280 

MegaPAL™  device 

5-158 
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AmPAL  18P8 

20 

TTL 

15  25 

55  QO  1 ftO 

Pfl-nin  <si  inor^ot 

L-  *J    pill  SU^JdOeL 

5-202 

PAL16P8 

20 

TTL 

25 

180 

Prnnrammshlp  rvilaritv 

i  i  uy  i  ai  i  ii  iiuuio  r^*^  / 

5-17 

Am/PAL1 6L8  ■ 

20 

TTL,  E  CMOS 

10,  15,  25,  35,  55 

0.1,  45,  90,  180 

Standard 

5-26,  5-184 

PAL10H8 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL10L8 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL12H6 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL12L6 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL14H4 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL14L4 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL16H2 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL16L2 

20 

TTL 

35 

90 

Simple  combinatorial 

5-56 

PAL16C1 

20 

TTL 

40 

0 

Simple  combinatorial 

5-56 
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PAGE 

AmPAL23S8 

20 

TTL 

33,  28.5 

200 

Buried  registers 

5-169 

PAL16RA8 

20 

TTL 

20 

170 

Asynchronous 

5-11 

PAL16RP8 

20 

TTL 

25 

180 

Programmable  polarity 

5-17 

PAL16RP6 

20 

TTL 

25 

180 

Programmable  polarity 

5-17 

PAL1 6RP4 

20 

TTL 

25 

180 

Programmable  polarity 

5-17 

Am/PAL1 6R8 

20 

TTL,  E  CMOS 

58,  40,  28.5,  18,  11 

0.1,  45,  90,  180 

Standard 

5-26,5-184 

Am/PAL16R6 

20 

TTL,  E  CMOS 

58,  40,  28.5,  18 

11 

0.1,  45,  90,  180 

Standard 

5-26,  5-1 84 

Am/PAL16R4 

20 

TTL,  E  CMOS 

58,  40,  28.5,  18 

11 

0.1,  45,  90,  180 

Standard 

5-26,  5-184 

PAL16X4 

20 

TTL 

14 

225 

Arithmetic 

5-51 
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PAGE 

PMS14R21 

24 

TTL 

30,  25 

210 

PROSE  Sequencer 

5-315 

PLS167 

24 

TTL 

33 

200 

Programmable  Logic  Seq. 

5-331 

PLS168 

24 

TTL 

33 

200 

Programmable  Logic  Seq. 

5-331 

PLS105 

24 

TTL 

37 

200 

Programmable  Logic  Seq. 

5-331 

Am29PL141 

28 

TTL 

20 

450 

Fuse  Prog.  Controller 

5-339 

Am2971 

24 

TTL 

85 

310 

Prog.  Event  Generator 

5-365 
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PAGE 

PAL10H20EV/EG8 

24 

ECL  10KH 

125  MHz 

220 

Registered/latched 

5-381 

PAL10020EV/EG8 

24 

ECL  100K 

125  MHz 

220 

Registered/latched 

5-381 

PAL10H20G8 

24 

ECL  10KH 

6  ns 

225 

Latched 

5-382 

PAL10H20P8 

24 

ECL  10KH 

6  ns 

210 

Combinatorial 

5-386 
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PAGE 

M2064 

48,68 

RAM  CMOS 

70,  50,  33 

5 

Logic  Cell  Array 

5-483 

M2018 

68,84 

RAM  CMOS 

50,  33 

5 

Logic  Cell  Array 

5-483 
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©  1988  Advanced  Micro  Devices,  Inc. 


Advanced  Micro  Devices  reserves  the  right  to  make  changes 
in  its  products  without  notice  in  order  to  improve  design  or  performance  characteristics. 
The  performance  characteristics  listed  in  this  document  are  guaranteed  by  specific  tests, 
correlated  testing,  guard  banding,  design  and  other  practices  common  to  the  industry. 

For  specific  testing  details  contact  your  local  AMD  sales  representative. 
The  company  assumes  no  responsibility  for  the  use  of  any  circuits  described  herein. 

901  Thompson  Place,  P.O.  Box  3453,  Sunnyvale,  California  94088 
(408)  732-2400  TWX:  910-339-9280  TELEX:  34-6306 


In  late  1 987,  the  two  programmable  logic  market  leaders,  Monolithic  Memories  and  Advanced  Micro 
Devices,  merged  into  one  great  company.  We  combined  our  strong  traditions  of  customer  service 
and  innovation  to  offer  you  the  best  line  of  programmable  logic  devices.  The  1988  Data  Book 
presents  the  technical  specifications  on  the  entire  product  line.  The  separate  1988  Handbook 
presents  all  of  the  necessary  support  material,  whether  you  are  accustomed  to  using  MMI  or 
AMD  products. 

AMD/MMI  has  the  products,  manufacturing  capacity  and  technology  to  perpetuate  the  leadership 
in  the  programmable  logic  field  which  we  pioneered.  In  the  1988  Data  Book you  will  notice  that  we 
not  only  have  the  broadest  programmable  logic  line,  but  we  also  have  the  industry's  most 
comprehensive  CMOS  programmable  logic  line.  We  think  that  you  will  find  the  Data  Book  and 
Handbook  informative  and  useful.  If  you  have  any  comments  or  questions  on  the  books  or  the 
product  line,  please  contact  us. 


Cyrus  Tsui 


Vice  President 

Programmable  Logic  Division 


Description 


This  1988  PAL  Device  Handbook/Data  Book  is  your  complete  guide  to  all  programmable  logic 
devices  (PLDs)  from  Monolithic  Memories  and  Advanced  Micro  Devices.  The  merger  of  the  two 
companies  provides  a  greater  wealth  of  products  and  services  for  you.  Note  that  all  PLDs  which 
were  in  production  before  the  merger  are  still  being  produced. 

The  PAL  Device  Handbook/Data  Book  is  organized  into  two  volumes  and  six  easy-to-use  sections: 


PAL  Device  Handbook 

Section  1:  Introduction 

Includes  an  overview  of  the  PLD  product  family. 

Section  2:  Applications 

Includes  detailed  application  examples.  The  first  few  chapters  provide  tutorials  in  PLD  design. 
The  application  notes  are  grouped  by  application  area. 


PAL  Device  Data  Book 

Section  3:  Programming  and  Quality 

Includes  information  on  PLD  software  programs,  programming  information,  PLD  technology  and 
quality  discussions,  and  package  information. 

Section  4:  PALASM  2  Software  User  Documentation 

Includes  complete  documentation  for  PALASM  2  software. 

Section  5:  Data  Sheets 

Includes  specifications  for  all  PLDs  from  the  combined  company.  PAL  devices  formerly  from  MMI 
are  under  "PAL  Devices,"  while  PAL  devices  formerly  from  AMD  are  under  "AmPAL  Devices." 

Section  6:  Appendices 

Includes  quick  reference  information. 

If  you  have  any  questions  or  comments  on  PLDs  or  any  other  products,  please  contact  your  most 
convenient  AMD/MMI  sales  office,  listed  at  the  end  of  each  book. 
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Introduction 


The  Programmable  Array  Logic  device,  commonly  known  as  the 
PAL®  device,  was  invented  at  Monolithic  Memories  over  1 2  years 
ago.  The  concept  for  this  revolutionary  type  of  device  sprang  forth 
as  a  simple  solution  to  the  shortcomings  of  discrete  TTL  logic. 

The  successfully  proven  PROM  technology  which  allowed  the 
end  user  to  "write  on  silicon"  provided  the  technological  basis 
which  made  this  kind  of  device  not  only  possible,  but  very  popular 
as  well. 

The  availability  of  design  software  made  it  much  easier  to  design 
with  programmable  logic.  As  designers  were  freed  from  the 
drudgery  of  low-level  implementation  issues,  new  complex  de- 
signs were  easier  to  implement,  and  could  be  completed  more 
quickly. 

This  chapter  outlines  some  basic  information  essential  to  those 
who  are  unfamiliar  with  Programmable  Logic  devices  (PLDs). 
The  information  may  also  be  useful  to  those  who  are  current  users 
of  programmable  logic.  The  specific  issues  which  need  to  be 
addressed  are: 

•  What  is  a  PLD? 

•  What  other  implementations  are  possible? 

•  What  advantages  do  PLDs  have  over  other 
implementations? 

What  is  a  PLD? 

In  general,  a  programmable  logic  device  is  a  circuit  which  can  be 
configured  by  the  user  to  perform  a  logic  function.  Most  "stan- 
dard" PLDs  consist  of  an  AND  array  followed  by  an  OR  array, 
either  (or  both)  of  which  is  programmable.  Inputs  are  fed  into  the 
AND  array,  which  performs  the  desired  AND  functions  and 
generates  product  terms.  The  product  terms  are  then  fed  into  the 
OR  array.  In  the  OR  array,  the  outputs  of  the  various  product 
terms  are  combined  to  produce  the  desired  outputs.  There  are 
three  fundamental  types  of  standard  PLD:  the  PROM,  the  PAL 
device,  and  the  PLS  device. 

PROMs 

PROMs  are  usually  thought  of  as  memory  elements.  However, 
the  PROM  has  a  fixed  AND  array  (which  decodes  the  memory 
address)  followed  by  a  programmable  OR  array  (Figure  1 ).  For 


each  of  a  given  set  of  input  combinations  (addresses),  it  gener- 
ates a  value  which  has  been  programmed  into  the  device. 
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Figure  1.  PROM  Array  Structure 
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PAL  Devices 

The  PAL  device  has  a  programmable  AND  array  followed  by  a 
fixed  OR  array  (Figure  2).  The  fact  that  the  AND  array  is 
programmable  makes  it  possible  for  the  devices  to  have  many 
inputs.  ThefactthattheOR  array  isfixed  makes  the  devices  small 
(which  means  less  expensive)  and  fast. 


PLS  Devices 


The  PLS  (Programmable  Logic-based  sequencer)  devices  are 
based  on  the  standard  PLA  architecture  (Figure  3),  where  both 
the  AND  and  the  OR  arrays  are  programmable.  This  arrange- 
ment allows  for  greater  overall  flexibility.  The  architecture  is  a  bit 
more  costly  in  terms  of  die  size  and  speed,  so  for  simple  logic 
functions,  a  PAL  device  is  usually  more  cost  effective.  However, 
this  architecture  is  very  effective  for  sequencers,  where  the 
flexibility  allows  larger  state  machines  than  might  fit  in  a  PAL 
device. 

Other  PLDs 

In  addition  to  the  basic  sum-of-products  PLDs,  some  more 
complex  PLDs  dedicated  to  sequencing  are  available,  most 
notably  the  PROSE™  device  and  the  Am29PL1 41 .  Their  archi- 
tectures are  described  elsewhere  in  the  handbook,  but  their  fun- 
damental benefits  are  the  same  as  those  of  the  more  traditional 
PLDs. 


In  practice,  the  distinctions  between  architectures  are  not  as 
significant  as  the  differences  between  the  types  of  functions  to  be 
performed.  Forthis  reason,  this  handbook  is  organized  primarily 
into  discussions  about  PAL  devices  and  discussions  about  se- 
quencers, whether  those  sequencers  be  implemented  in  PAL 
devices,  PLS  devices,  or  one  of  the  dedicated  sequencers. 

What  Other  Implementations  Are 
Possible? 

There  are  essentially  four  alternatives  to  programmable  logic: 

•  Discrete  Logic 

•  Gate  Arrays 

•  Standard  Cell  Circuits 

•  Full  Custom  Circuits 

Discrete  Logic 

Discrete  logic,  or  conventional  TTL  logic,  has  the  advantage  of 
familiarity;  hence  its  popularity.  It  is  also  quite  inexpensive  when 
only  unit  cost  is  considered.  The  drawback  is  that  the  implemen- 
tation of  even  a  simple  portion  of  a  system  may  require  many  units 
of  discrete  logic.  There  are  "hidden"  costs  associated  with  each 
unit  that  goes  into  a  system,  which  can  render  the  overall  system 
more  expensive. 
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Designing  with  discrete  chips  can  also  be  very  tedious.  Each 
design  decision  directly  affects  the  layout  of  the  board.  Changes 
are  difficult  to  make.  Thedesign  is  also  more  difficult  to  document, 
making  it  harder  to  debug  and  maintain  later.  These  items  all 
contribute  to  a  long  design  cycle  when  discrete  chips  are  used 
extensively. 

Gate  Arrays 

Gate  arrays  have  been  increasing  in  popularity.  The  attractive- 
ness of  this  solution  lies  in  the  device's  flexibility.  By  packing  the 
functions  into  the  device,  a  great  majority  of  the  available  silicon 
is  actually  used.  Since  such  a  device  is  customized  for  an 
application,  it  would  seem  to  be  the  optimum  device  for  that 
application. 


then  assembled  and  collected  to  perform  the  desired  functions. 
This  can  ideally  lead  to  reduced  turnaround  from  conception  to 
implementation,  and  a  much  more  efficient  circuit. 

The  drawback  is  that  even  though  the  individual  components  of 
the  circuit  have  been  laid  out,  a  complete  layout  must  still  be  per- 
formed to  arrange  the  cells.  Instead  of  just  customizing  the  metal 
interconnections,  as  is  done  in  a  gate  array,  the  circuit  must  be 
developed  from  the  bottom  up.  Development  costs  can  be  even 
higher  than  for  gate  arrays,  and  despite  the  standard  cell  concept, 
turnaround  time  often  tends  to  be  longer  than  planned.  Again,  the 
volume  must  be  sufficiently  high  to  warrant  the  development 
costs. 

Full  Custom  Circuits 


However,  one  also  pays  substantial  development  costs,  espe- 
cially in  the  case  of  a  design  which  needs  changes  after  silicon 
has  already  been  processed.  Even  though  the  unit  costs  are 
generally  quite  low  for  gate  arrays,  the  volumes  required  to  make 
their  use  worthwhile  excludes  them  as  a  solution  for  many 
designers.  This  fact,  added  to  the  long  design  cycle  and  high  risk 
involved,  make  this  solution  practical  for  only  a  limited  number  of 
designers. 

Standard  Cell  Circuits 


Full  custom  designs  require  that  a  specific  chip  be  designed  from 
scratch  to  perform  the  needed  functions.  The  intent  is  to  provide 
a  solution  which  gives  the  designer  exactly  what  is  needed  for  the 
application  in  question;  no  more  and  no  less.  Ideally,  not  asquare 
micron  of  silicon  is  wasted.  This  normally  results  in  the  smallest 
piece  of  silicon  possible  to  fit  the  needs  of  the  design,  which  in  turn 
reduces  the  system  cost.  Understandably,  though,  development 
costs  and  risks  for  such  a  design  are  extremely  high,  and  volumes 
must  be  commensurately  high  in  order  for  such  a  solution  to  be 
of  value. 


Standard  cell  circuits  are  quite  similar  to  gate  arrays,  their  main 
advantage  being  that  they  consist  of  a  collection  of  different  parts 
of  circuits  which  have  already  been  debugged.  These  circuits  are 
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What  Advantages  do  PLDs  Have  Over 
Other  Implementations? 

As  user-programmable  semicustom  circuits,  PLDs  provide  a 
valuable  compromise  which  combines  many  of  the  benefits  of 
discrete  logic  with  many  of  the  benefits  of  other  semicustom 
circuits.  The  overall  advantages  can  be  found  in  several  areas: 

•  Ease  of  design 

•  Performance 

•  Reliability 

•  Cost  savings 

Ease  of  Design 

The  support  tools  available  for  use  in  designing  with  PLDs  greatly 
simplify  the  design  process  by  making  the  lower-level  implemen- 
tation details  transparent.  In  a  matter  of  one  or  two  hours,  a  first 
time  PLD  user  can  learn  to  design  with  a  PAL  device,  program  it, 
and  implement  the  design  in  a  system. 

The  design  support  tools  consist  of  design  software  and  a 
programmer.  The  design  software  is  used  in  generating  the 
design;  the  programmer  is  used  to  configure  the  device.  The 
software  provides  the  link  between  the  higher-level  design  and 
the  low-level  programming  details. 

All  of  the  available  design  software  packages  (of  which  Monolithic 
Memories'  PALASM®  software  is  the  most  widely  used)  perform 
essentially  the  same  tasks.  The  design  is  specified  with  relatively 
high-level  constructs;  the  software  takes  the  design  and  converts 
it  into  a  form  which  the  programmer  uses  to  configure  the  PLD. 
Most  software  packages  provide  logic  simulation,  which  allows 
one  to  debug  the  design  before  actually  programming  a  device. 
The  high-level  design  file  also  serves  as  documentation  of  the 
design.  This  documentation  can  be  even  easier  to  understand 
than  traditional  schematics. 

Depending  on  the  capabilities  desired,  a  device  programmer  can 
cost  anywhere  from  under  $1,000.00  to  around  $15,000.00  for  a 
high-volume  production  programmer.  Many  PLD  users  do  not 
find  it  necessary  to  purchase  a  programmer;  it  is  often  quite  cost 
effective  and  convenient  to  have  either  the  manufacturer  or  an 
outside  distributor  do  the  programming  for  them.  For  design  and 
prototyping,  though,  it  is  very  helpful  to  have  a  programmer;  this 
allows  one  to  implement  designs  immediately. 

The  convenience  of  programmable  logic  lies  in  the  ability  to 
customize  a  standard,  off-the-shelf  product.  PLDs  can  be  found 
in  stock  to  suit  a  wide  range  of  speed  and  power  requirements. 
The  variety  of  architectures  available  also  allows  a  choice  of  the 
proper  functionality  for  the  application  at  hand.  Thus  a  design  can 
be  implemented  using  a  standard  device,  with  the  end  result 
essentially  being  a  custom  device.  If  a  design  change  is  needed, 
it  is  a  simple  matter  to  edit  the  original  design  and  then  program 
a  new  device,  or,  in  the  case  of  reprogrammable  CMOS  devices, 
erase  and  reprogram  the  old  device. 

Board  layout  is  vastly  simplified  with  the  use  of  programmable 
logic.  PLDs  offer  great  flexibility  in  the  location  of  inputs  and 
outputs  on  the  device.  Since  larger  functions  are  implemented 
inside  the  PLD,  board  layout  can  begin  once  the  inputs  and 


outputs  are  known.  The  details  of  what  will  actually  be  inside  the 
PLD  can  be  worked  out  independently  of  the  layout.  In  many 
cases,  any  needed  design  changes  can  be  taken  care  of  entirely 
within  the  PLD,  and  will  not  affect  the  PC  board. 

Performance 

Speed  is  one  of  the  main  reasons  that  designers  use  PALdevices. 
The  TTL  PAL  devices  presently  on  the  market  can  provide  equal 
or  better  performance  than  the  fastest  discrete  logic  available. 
ECL  PAL  devices  extend  the  benefits  of  programmable  logic  to 
the  even  higher-speed  realm  of  ECL  logic.  Today's  fastest  PAL 
devices  are  being  developed  on  the  newest  technologies  to  gain 
every  extra  nanosecond  of  performance. 

Performance  cannot  come  strictly  at  the  expense  of  power 
consumption.  Since  PLDs  can  be  used  to  replace  several 
discrete  circuits,  the  power  consumption  of  a  PLD  may  well  be 
less  than  that  of  the  combined  discrete  devices.  As  more  PLDs 
are  developed  in  CMOS  technology,  the  option  for  even  lower 
power  becomes  available,  including  zero  standby  power  devices 
for  systems  which  can  tolerate  only  minute  standby  power  con- 
sumption. 

Reliability 

Reliability  is  an  area  of  increasing  concern.  As  systems  get  larger 
and  more  complex,  the  increase  in  the  amount  of  circuitry  tends 
to  reduce  the  reliability  of  the  system;  there  are  "more  things  to  go 
wrong".  Thus  a  solution  which  inherently  reduces  the  number  of 
chips  in  the  system  will  contribute  to  higher  reliability.  A  program- 
mable logic  approach  can  provide  device  quality  levels  up  to  50 
parts  per  million  (ppm),  while  also  providing  a  more  reliable 
solution  due  to  the  smaller  number  of  devices  required. 

With  the  reduction  in  units  and  board  space,  PC  boards  can  be 
laid  out  less  densely,  which  greatly  improves  the  reliability  of  the 
board  itself.  This  also  reduces  crosstalk  and  other  potential 
sources  of  noise,  making  the  operation  of  the  system  cleaner  and 
more  reliable. 

Cost 

For  any  design  approach  to  be  practical,  it  must  be  cost  effective. 
Cost  is  almost  always  a  factor  in  considering  a  new  design  or  a 
design  change.  But  the  calculation  of  total  system  cost  can  be 
misleading  if  not  all  aspects  are  considered.  Many  of  the  costs 
can  be  elusive  or  difficult  to  measure.  For  example,  it  is  difficult 
to  quantify  the  cost  of  market  share  lost  due  to  late  product 
introduction. 

The  greatest  savings  over  a  discrete  design  are  derived  from  the 
fact  that  a  single  PLD  can  replace  several  discrete  chips.  Board 
space  requirements  can  drop  by  25%  or  more  when  PLDs  are 
used.  Figure  4  illustrates  some  of  the  costs  of  the  various 
solutions  discussed  so  far,  with  many  of  the  factors  that  may  not 
always  be  considered  included  for  comparison.  These  involve 
such  items  as  inventory  costs,  inspection  costs,  test  costs,  board 
materials  costs,  and  of  course  the  very  costly  time  spent  design- 
ing and  debugging  such  systems,  and  isolating  and  replacing 
units  which  fail.  With  each  design  change,  the  cost  of  a  custom 
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Unit 
Cost 
Range 

NREs/ 
Development  Costs 

Purchase 
Volume 
For  Best 
Economics 

Design 
Flexibility 

Gate  Count 
per  Device 

Engineering 
Design 
Time 

Design 
Turnaround 
Time 

Cost 
of  Each 
Design  Change 

PLDs:  BLANK 

Low-Med 

None 

5K-200K 

Med 

200-2K 

1/2  Week 

Short 
(1-10  Days) 

Very  Low 

PLDs:  FACTORY 
PROGRAMMED 

Low-Med 

Low 

10K-200K 

Med 

200-2K 

3-10 
Weeks 

Moderate 
(8-10  Wks.) 

Med 

DISCRETE  LOGIC 

Low-Med 

None 

1K-10K 

Low 

10-30 

1  Week 

Short 
(1-10  Days) 

Med 

GATE  ARRAYS 

Low 

Med-High 

10K-200K 

Med-High 

1K-10K 

12^10 
Weeks 

Long 
(3-9  Mos.) 

High 

STANDARD  CELLS 

Low 

High 

100K-300K 

Med-High 

1K-10K 

26-52 
Weeks 

Long 
(6-12  Mos.) 

High 

FULL  CUSTOM 

Very  Low 

High 

200K  and  Up 

High 

1K-60K 

1-2  Yrs. 

Long 
(6-12  Mos.) 

High 

Figure  4.  Cost  and  Design-time  Comparisons 
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solution  rises  dramatically,  while  that  of  a  user-customizable 
approach  is  minimal.  The  relationship  between  the  various 
alternatives  is  summarized  in  Figure  5. 

Another  economic  benefit  of  the  use  of  PLDs  is  that  when  one  PAL 
device  is  used  in  several  different  designs,  as  is  often  the  case, 
the  user  has  not  com  mitted  that  device  to  any  one  of  the  particular 
designs  until  the  device  has  been  programmed.  This  means  that 
inventory  can  be  stocked  for  several  different  designs  in  the  form 
of  one  device.  As  requirements  change,  the  parts  can  be 
programmed  to  fit  the  need.  And  in  the  case  of  reprogrammable 
CMOS  devices,  one  is  not  committed  even  after  programming. 

There  is  also  a  cost-effective  PLD  solution  for  high-volume 
production.  Just  as  a  ROM  is  a  PROM  which  has  been  hard-wired 
for  mass  production,  HAL®  (Hard  Array  Logic)  devices  can  be 
produced  in  high  volumes  for  extra  cost  savings.  This  mask- 
programmed  version  can  be  produced  in  high  volume  with 
unparalleled  quality.  In  addition,  in  the  event  that  production 
quantities  for  your  system  show  an  unexpected  increase,  the 
equivalent  PAL  or  ProPAL™  devices  (in-factory  programmed 
PAL  devices)  can  be  quickly  obtained  and  programmed  in  your 
factory,  by  Monolithic  Memories,  or  through  distribution  to  cover 
the  temporary  shortage.  More  details  on  HAL  and  ProPAL 
devices  are  provided  on  page  3-104  of  the  Data  Book. 

One  final  subtle  cost  issue  is  derived  from  the  ease  with  which  a 
competitorcan  copy  a  design.  PLDs  have  a  unique  feature  called 
a  security  fuse,  whose  purpose  is  to  protect  a  design  from  being 
copied.  By  using  secured  PLDs  extensively  in  a  system,  one  can 


safely  avoid  having  one's  system  easily  deciphered.  The  added 
design  security  provided  by  this  feature  can  buy  extra  market 
time,  forcing  competitors  to  do  their  own  original  design  work 
rather  than  copying  the  designs  of  others. 

Summary 

Programmable  logic  provides  the  means  of  creating  semi-custom 
designs  with  readily  available  standard  components.  There  is  a 
wide  variety  of  PLDs;  PAL  devices  are  most  widely  used,  and 
perform  well  for  basic  logic  and  some  sequencing  functions. 
Other  dedicated  sequencers  provide  the  circuitry  required  to 
implement  more  complex  designs. 

By  assuming  some  of  the  attributes  of  gate  arrays,  programmable 
logic  provides  the  cost  savings  of  any  other  semicustom  device, 
without  the  extra  engineering  costs,  risks,  and  design  delays. 
Reliability  is  also  enhanced  as  quality  increases  and  board 
complexity  decreases. 

The  design  tasks  are  greatly  simplified  due  to  the  design  tools 
which  are  now  available.  Design  software  and  device  program- 
mers allow  top-down  high-level  designs,  with  a  minimum  of  time 
spent  on  actual  implementation  issues.  Simulation  allows  some 
design  debug  before  a  device  is  programmed. 

For  all  of  these  reasons,  programmable  logic  has  become,  and 
will  continue  to  be,  the  design  methodology  of  choice  among 
digital  systems  designers. 


DEVELOPMENT  TIME 


Figure  5.  Development  Cost  vs  Time  for  Alternative  Logic 
Implementations 
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Introduction 

Monolithic  Memories  and  Advanced  Micro  Devices  offer  a  wide 
variety  of  PLDs,  implemented  in  a  variety  of  technologies.  In  this 
section,  we  will  briefly  discuss  the  device  families,  and  look  at  the 
various  architecture,  speed,  and  power  options.  More  specific 
device  information  can  be  found  in  the  individual  data  sheets  in 
Section  5.  Discussions  on  some  of  the  special  architectural  fea- 
tures of  many  of  the  devices  can  also  be  found  in  their  respective 
datasheets. 

There  are  three  basic  PLD  areas  addressed  by  Monolithic 
Memories'  devices: 

•  PAL  devices— general  purpose 

•  Programmable  sequencers — state  machines 

•  LCA™  devices — high  density 

The  largest  application  area  is  that  covered  by  the  PAL  devices. 
There  is  a  wide  variety  of  PAL  devices,  ranging  from  simple 
devices  that  address  general  logic  design  problems  to  more 
sophisticated  devices  that  deal  with  more  complex  problems. 

There  is  also  a  series  of  sequencers  which  are  not  PAL  devices, 
featuring  architectures  particularly  well  suited  to  sequencing 
operations.  While  there  are  PAL  devices  that  work  well  as  state 
machines  in  addition  to  their  other  applications,  these  dedicated 
sequencers  have  given  up  some  of  their  generality  to  provide 
optimal  state  machine  solutions. 

The  final  area  covered  is  that  of  high-density  design,  addressed 
by  the  LCA  devices.  The  LCA  device  takes  the  approach  of  a 
programmable  gate  array  to  provide  a  PLD  with  many  usable 
gates. 

Design  software  radically  simplifies  the  design  of  any  circuit  in  a 
PLD.  PALASM  software  is  usedforallof  the  PLDs  exceptthe  LCA 
devices.  TheLCAdevicesmakeuseofXACT™  software  for  logic 
configuration.  The  PALASM  software  documentation  is  in 
Section  4;  XACT  software  is  discussed  more  fully  on 
page  3-17  of  the  Data  Book. 

PAL  Devices 

PAL  devices  are  available  in  three  different  technologies: 

•  TTL 

•  CMOS 

•  ECL 

The  CMOS  devices  often  provide  the  same  functions  as  the  TTL 
devices,  and  can  be  used  in  the  same  sockets,  with  the  added 
benefit  of  lower  power.  Thus  the  TTL  and  CMOS  devices  will  be 
discussed  together,  followed  by  a  summary  of  specific  CMOS 
issues.  ECL  devices  require  completely  different  design  consid- 
erations, and  cannot  be  interchanged  with  TTLor  CMOS  devices. 
They  willthereforebediscussed  separately. 

PAL  devices  generally  have  a  mnemonic  naming  scheme,  which 
provides  some  basic  information  about  the  devices'  capabilities. 


Because  Monolithic  Memories  and  Advanced  Micro  Devices 
recently  merged,  there  are  presently  two  slightly  different  naming 
conventions  in  use.  Some  devices  were  originally  Monolithic 
Memories'  devices;  some  were  Advanced  Micro  Devices';  some 
devices  were  produced  by  both  companies  before  the  merger.  In 
order  to  minimize  the  confusion  after  the  merger,  the  old  device 
names  are  being  maintained.  The  two  naming  conventions  will  be 
discussed  separately.  Throughout  this  handbook,  "AmPAL" 
designates  products  originally  from  Advanced  Micro  Devices. 

Note  that  all  products  that  Monolithic  Memories  and  Advanced 
Micro  Devices  were  producing  before  the  merger  are  still  in 
production. 

Monolithic  Memories  Nomenclature 

Monolithic  Memories  originally  set  the  PAL  device  naming  stan- 
dard. An  example  of  an  older  device  name  is  shown  in  Figure  1 . 

As  devices  have  become  faster,  and  as  CMOS  has  become 
available,  it  has  become  necessary  to  change  the  way  we 
describe  the  performance  of  newer  devices.  While  the  conven- 
tion is  slightly  different,  it  does  make  it  easier  to  see  exactly  what 
the  speed  grade  and  power  level  of  a  device  are.  An  example  of 
such  a  new  device  is  shown  in  Figure  2. 

Advanced  Micro  Devices 
Nomenclature 

The  Advanced  Micro  Devices  nomenclature  follows  the  Mono- 
lithic Memories  nomenclature  with  a  few  modifications.  Atypical 
older  device  name  is  shown  in  Figure  3. 

Advanced  Micro  Devices  also  found  the  need  to  modify  the 
nomenclature  with  the  advent  of  faster  devices  and  CMOS 
devices.  An  example  of  a  newer  device  is  shown  in  Figure  4. 

The  relationship  between  a  device  and  its  name  will  become 
apparent  in  the  Tables  following. 

The  package  designators  also  differ.  A  cross  reference  for  com- 
mercial packages  is  shown  in  Table  1 . 


MONOLITHIC 

ADVANCED 

PACKAGE 

MEMORIES 

MICRO  DEVICES 

Plastic  DIP 

N 

P 

Plastic  SKINNYDIP® 

NS 

P 

Ceramic  DIP 

J 

D 

Ceramic  SKINNYDIP 

JS 

D 

PLCC-20-pin 

NL 

J 

PLCC  from  24-pin  DIP, 

FN 

J 

JEDEC 

PLCC  from  24-pin  DIP 

NL 

PLCC  from  28-pin  DIP 

FN 

J 

LCC 

L 

L 

Table  1.  Package  Designators 
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PAL 


J 


NUMBER  OF  INPUTS 


ARCHITECTURE 


ECL  devices  also  have  either  '10H'  or  '100'  following  'PAL'  to 
indicate  the  ECL  family  compatibility. 

The  number  of  inputs  includes  feedback,  if  feedback  exists. 

The  architecture  code  varies.  It  is  usually  mnemonic,  and  the 
meanings  of  each  code  will  be  listed  in  each  of  the  following 
sections. 


I 


POWER  LEVEL 
SPEED  GRADE 


NUMBER  OF  OUTPUTS 

(OR  FLIP-FLOPS)  01 

The  speed  grade  indicates  the  propagation  delay.  In  general,  the 
speeds  are  as  follows: 

blank:  35  ns  (Note  that  -2  and  -4  devices  are  slower) 
A:  25  ns 
B:  15  ns 
D:  10  ns 

The  power  level  can  be  one  of  three  designators: 

blank:  'full  power',  180-240  mA 
-2:  'half  power',  90-105  mA 
-4:  'quarter  power',  45-55  mA 


Figure  1.  Monolithic  Memories  Nomenclature  for  Older  Products 


TECHNOLOGY   

NUMBER  OF  INPUTS  . 
ARCHITECTURE   


I 


SPEED  GRADE 


POWER  LEVEL 


NUMBER  OF  OUTPUTS 
(OR  FLIP-FLOPS) 


Notes: 

The  technology  designator  can  be  one  of  four  codes: 

blank  =  TTL 
C  =  CMOS 
10H  =10KHECL 
100  =100KECL 

The  number  of  inputs  includes  feedback,  if  feedback  exists. 


The  architecture  code  varies.  It  is  usually  mnemonic,  and  the 
meanings  of  each  code  will  be  listed  in  each  of  the  following 
sections. 

The  power  level  can  be  one  of  four  designators: 

blank:  full  power',  180-240  mA 
H:  'half  power',  90-1 05  mA 
Q:  'quarter  power',  45-55  mA 
Z:  'zero  power',  <0.1  mA  standby  current 


Figure  2.  New  Monolithic  Memories  Nomenclature 
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AmPAL  • 


NUMBER  OF  INPUTS  - 


ARCHITECTURE 


L 

L 


POWER  LEVEL 


SPEED  GRADE 


-  NUMBER  OF  OUTPUTS 
(OR  FLIP-FLOPS) 


Notes: 

The  number  of  inputs  includes  feedback,  if  feedback  exists. 

The  architecture  code  varies.  It  is  usually  mnemonic,  and  the 
meanings  of  each  code  will  be  listed  in  each  of  the  following 
sections. 


The  speed  grade  indicates  the  propagation  delay.  In  general,  the 
speeds  are  as  follows: 

blank:  35  ns 
A:  25  ns 
B:  15  ns 
D:  10  ns 

The  power  level  can  be  one  of  three  designators: 

blank:  'full  power',  1 80-240  mA 
L:  'half  power',  90-105  mA 
Q:  'quarter  power',  45-55  mA 


Figure  3.  Advanced  Micro  Devices  Nomenclature  for  Older  Products 


22     XP     10    -30  L 


TECHNOLOGY 


NUMBER  OF  INPUTS  ■ 
ARCHITECTURE   


I 


POWER  LEVEL 


SPEED  GRADE 


The  technology  designator  can  be  one  of  two  codes: 

blank  =  TTL 
C  =  CMOS 

The  number  of  inputs  includes  feedback,  if  feedback  exists. 

The  architecture  code  varies.  It  is  usually  mnemonic,  and  the 
meanings  of  each  code  will  be  listed  in  each  of  the  following 
sections. 


NUMBER  OF  OUTPUTS 
(OR  FLIP-FLOPS) 


The  speed  grade  indicates  the  propagation  delay  in  nano- 
seconds. 

The  power  level  can  be  one  of  three  designators: 

blank:  lull  power',  1 80-240  mA 
L:  'half  power',  90-105  mA 


Figure  4.  New  Advanced  Micro  Devices  Nomenclature 
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AMD/MM  I  Programmable  Logic  Product 
Cross  Reference  Guide 

As  a  merged  company,  the  combined  PLD  offerings  of  AMD/MMI 
provide  the  PLD  user  the  broadest  line  of  PLDs  available  from  one 
supplier.  The  combined  product  offering  also  contains  a  number 
of  devices  for  which  each  of  the  original  separate  companies  had 
equivalent  devices. 

This  is  a  guide  to  the  devices  that  are  equivalent  and  can  be 
directly  substituted  (with  exceptions  as  noted  in  the  comments 
section). 

These  are  the  only  products  which  are  direct  substitutes  for  each 
other,  or  which  we  will  be  encouraging  customers  to  use  as 
alternate  parts. 


TTL  and  CMOS  PAL  Devices 

TTL  PAL  devices  are  used  more  widely  by  far  than  devices  made 
with  any  othertechnology.  They  provide  very  high  speed  without 
the  design  requirements  of  ECL.  CMOS  devices  provide  the 
benefits  of  reduced  power  consumption  and  erasability.  Each 
CMOS  device  is  either  TTL  compatible,  or  has  a  TTL-compatible 
version. 

The  available  devices  can  be  generally  placed  into  four  functional 
groups: 

•  Simple  combinatorial 

•  Simple  registered 

•  Sequencer 

•  Asynchronous 


AMD 

MMI 

COMMENTS 

Part  Number 

Speed/Power 

Part  Number 

Speed/Power 

AmPALI  6R8D 
AmPALI  6R6D 
AmPALI  6R4D 
AmPALI  6L8D 

10/180 
10/180 
10/180 
10/180 

PAL16R8D 
PAL16R6D 
PAL16R4D 
PAL16L8D 

10/180 
10/180 
10/180 
1 0/1  80 

AmPALI  6R8B 
AmPALI  6R6B 
AmPALI  6R4B 
AmPALI  6L8B 

15/180 
15/180 
15/180 
15/180 

PAL16R8B 
PAL16R6B 
PAL16R4B 
PAL16L8B 

15/180 
15/180 
15/180 
15/180 

AmPALI  6R8AL 
AmPALI  6R6AL 
AmPALI  6R4AL 
AmPAl  1fil  flAI 

Mil 1 1  ML  I  DLDML 

25/90 
25/90 
25/90 

PAL16R8B-2 
PAL16R6B-2 
PAL16R4B-2 

DAI  1  CI  OR  O 

25/90 
25/90 
25/90 

AmPALI  6R8A 
AmPALI  6R6A 
AmPALI  6R4A 
AmPALI  6L8A 

25/180 
25/180 
25/180 
25/155 

PAL16R8A 
PAL16R6A 
PAL16R4A 
PAL16L8A 

25/180 
25/180 
25/180 
25/180 

AKyin    on  ne  t 
mivil/ — tu-ns  isu 

AMD — 20-ns  tsu 

AMD— 20-ns  tsu 

AmPALI  6R8Q 
AmPALI  6R6Q 
AmPALI  6R4Q 
AmPALI  6L8Q 

35/45 
35/45 
35/45 
35/45 

PAL16R8B-4 
PAL16R6B-4 
PAL1 6R4B-4 
PAL16L8B-4 

35/55 
35/55 
35/55 
35/55 

AmPALI  6R8L 
AmPALI  6R6L 
AmPALI  6R4L 
AmPALI  6L8L 

35/90 
35/90 
35/90 
35/80 

PAL16R8A-2 
PAL16R6A-2 
PAL16R4A-2 
PAL16L8A-2 

35/90 
35/90 
35/90 
35/90 

AmPALI  6R8 
AmPALI  6R6 
AmPALI  6R4 
AmPALI  6L8 

35/180 
35/180 
35/180 
35/180 

PAL16R8A-2 
PAL16R6A-2 
PAL1 6R4A-2 
PAL16L8A-2 

35/90 
35/90 
35/90 
35/90 

Note  that  the  MMI  part  requires  half  the  power 
(90  mA  lcc)  of  the  AMD  part  (180  mA) 

AmPALI  8P8L 
AmPALI  8P8L 
AmPALI  8P8L 
AmPALI  8P8L 
AmPALI  8P8L 
AmPALI  8P8L 
AmPALI  8P8L 
AmPAL18P8L 

35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 

PAL10H8 
PAL12H6 
PAL14H4 
PAL16H2 
PAL10L8 
PAL12L6 
PAL14L4 
PAL16L2 

35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 
35/90 

The  18P8L  can  replace  any  of  the  MMI  parts, 
but  the  reverse  is  not  necessarily  true. 

Product  Overview 


Simple  Combinatorial  Devices 

These  PAL  devices  provide  simple  combinatorial  functions 
quickly  and  efficiently. .  There  is  a  wide  selection  of  devices, 
allowing  the  user  to  choose  the  right  device  for  the  given 
application. 

There  are  five  basic  characteristics  that  differentiate  these 
devices: 

•  The  number  of  inputs 

•  The  number  of  outputs 

•  The  number  of  product  terms  per  output 

•  The  speed:  propagation  delay 

•  Power  consumption 

Several  of  the  devices  have  internal  feedback,  which  routes 
output  pins  back  as  inputs.  This  is  generally  coupled  with 
programmable  three-state  capability,  so  that  the  output  pins  can 
be  used  as  outputs,  inputs,  or  bidirectional  pins. 

Most  of  the  devices  have  fixed  output  polarity.  The  AmPALI  8P8, 
AmPAL22P10,  PAL16P8,  PAL20S10,  and  AmPAL22XP10  all 
have  programmable  output  polarity. 

The  PAL20S10  has  a  feature  called  "product  term  steering", 
which  allows  product  termsto  be  allocated  between  outputs.  This 
is  described  more  fully  on  page  5-1 03  of  the  Data  Book. 


The  AmPAL22XP10  has  built-in  exclusive-OR  gates  on  each 
output. 

In  addition  to  the  devices  that  are  purely  combinatorial,  there  are 
also  registered  devices  with  flip-flops  that  can  be  bypassed  to 
give  a  combinatorial  device.  Although  these  devices  are  not 
listed  in  Table  2,  they  can  be  used  as  combinatorial  devices.  The 
best  example  is  the  PAL22V10,  which  is  widely  used  as  a  com- 
binatorial device. 

A  summary  of  the  architecture  codes  for  these  types  of  devices 
is  shown  in  Table  2. 

The  combinatorial  devices  are  listed  in  Table  3,  with  their  logic, 
speed,  and  power  characteristics. 


CODE 

MEANING 

EXAMPLE 

H 

Active-HIGH  outputs 

PAL10H8 

L 

Active-LOW  outputs 

PAL16L8 

P 

Programmable  output  polarity 

PAL16P8 

C 

Complementary  outputs 

PAL16C1 

XP 

Exclusive-OR  gate,  prog. 

AmPAL22XP10 

polarity 

s 

Product  term  steering 

PAL20S10 

Table  2.  Combinatorial  Architectures 


STANDBY 

PRODUCT 

SPEED 

l_ 
(mi) 

DATA  SHEET 

DEVIC 

ENAME 

INPUTS 

OUTPUTS 

TERMS/OUTPUT 

(I,,  in  ns) 

PAGE  NO. 

PAL8 

L14A 

8 

14 

1 

25 

90 

5-141 

PAL6L16A 

6 

16 

1 

25 

90 

5-141 

PAL10H8 

10 

8 

2 

35 

90 

5-56 

PAL12H6 

12 

6 

2,4 

35 

90 

5-56 

PAL14H4 

14 

4 

4 

35 

90 

5-56 

PAL16H2 

16 

2 

8 

35 

90 

5-56 

PAL10L8 

10 

8 

2 

35 

90 

5-56 

PAL12L6 

12 

6 

2,4 

35 

90 

5-56 

PAL14L4 

14 

4 

4 

35 

90 

5-56 

PAL16L2 

16 

2 

8 

35 

90 

5-56 

PAL16C1 

16 

2 

16 

40 

90 

5-56 

PAL16L8D 

16* 

8 

7 

10 

180 

5-29 

AmPALI  6L8D 

10 

180 

5-183 

PAL1 6L8B 

15 

180 

5-31 

AmPALI  6L8B 

15 

180 

5-197 

PALC16L8Z-25 

25 

0.1 

5-50 

PALC 

,1 6L8Q-25 

25 

45 

5-33 

PAL1 

6L8B-2 

25 

90 

5-35 

AmPALI 

6L8AL 

25 

90 

5-197 

PAL1 

6L8A 

25 

180 

5-37 

AmPALI 

6L8A 

25 

155 

5-197 

PAL1 

6L8B-4 

35 

55 

5-39 

AmPALI 

6L8Q 

35 

45 

5-197 

PAL1 

6L8A-2 

35 

90 

5-41 

AmPALI 

6L8L 

35 

80 

5-197 

AmPALI 

6L8 

35 

155 

5-197 

PAL1 

6L8A-4 

55 

50 

5-43 

Table  3.  Simple  Combinatorial  PAL  Devices 
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STANDBY 

PRODUCT 

SPEED 

(mA) 

niTA    pUC  FT 

DATA  SHEET 

DEVICE  NAME 

INPUTS 

/-it  irni  iTr> 

OUTPUTS 

Trnniir'  i r\ i  i-r-i-n  it- 

TERMS/OUTPUT 

(tPD  in  ns) 

PAGE  NO. 

PAL16P8A 

16* 

8 

7 

25/30** 

180 

5-17 

AmPAL18P8B 

18* 

8 

8 

15 

180 

5-202 

AmPAL18P8AL 

25 

90 

5-202 

AmPAL18P8A 

25 

180 

5-202 

AmPAL18P8Q 

35 

55 

5-202 

AmrAL  i  oraL 

35 

90 

5-202 

PAL12L10 

12 

10 

2 

40 

100 

5-147 

PAL14L8 

14 

8 

2,4 

40 

100 

5-147 

PAL1 6L6 

16 

6 

2,4 

40 

100 

5-147 

PAL1 8L4 

18 

4 

4,6 

40 

100 

5-147 

PAL20L2 

20 

2 

8 

40 

100 

5-147 

PAI  OC\CA 

o 

c. 

4U 

1 00 

o- 14/ 

PAL20L8B 

20* 

8 

7 

15 

210 

5-125 

PAL20L8B-2 

25 

105 

5-126 

PAL20L8A 

25 

210 

5-128 

DA  I  r"inl  o"7  or 

lJALC2uL8Z-35 

35 

0.1 

5-133 

PAL20L8A-2 

35 

105 

5-130 

PALC20L8Z-45 

45 

0.1 

5-133 

AmPAL20L10B 

20* 

10 

3 

15 

210 

5-306 

AmPAL20L10-20 

20 

165 

5-306 

AmPAl  9nl  1  nAI 
MmrrtLtUL  I  URL 

Oc 
dX> 

1 05 

5-306 

PAL20L1  OA 

30 

165 

5-113 

PAL20S10 

20* 

10 

0-1 6ft 

35 

240 

5-103 

AmDAI  OOP1AR 
MITlrrtL^^r  IUB 

1  U 

Q 

o 

10 

01  n 
iL  1  U 

5-306 

AmPAL22P10AL 

25 

105 

5-306 

AmPAL22P10A 

25 

210 

5-306 

AmPAL22XP10-20 

22* 

10 

2/6t 

20 

210 

5-286 

AmPAL22XP10-30L 

30 

105 

5-286 

AmPAL22XP10-30 

30 

180 

5-286 

AmPAL22XP10-40L 

40 

105 

5-286 

*  Includes  feedback  t  Has  an  exclusive-OR  gate 

'"  Depending  on  polarity  ft  Product  term  steering 
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Simple 


Registered  Devices 


A  basic  registered  device  is  generally  equivalent  to  a  combinato- 
rial device  with  registers  on  the  outputs  for  signal  synchronization. 
All  such  devices  have  feedback  from  the  flip-flops  back  into  the 
array  as  inputs,  so  that  they  can  also  be  used  for  very  fast  small 
state  machines.  The  differentiating  characteristics  of  these  de- 
vices are: 

•  The  number  of  inputs 

•  The  number  of  outputs 

•  The  number  of  flip-flops 

•  The  number  of  product  terms  per  output 

•  The  speed:  maximum  clock  frequency 

•  Power  consumption 

Most  of  these  devices  have  dedicated  clock  and  output  enable 
pins.  On  devices  such  as  the  PAL16R4,  which  have  some 
dedicated  combinatorial  outputs,  only  the  registered  outputs  are 
controlled  by  the  enable  pin;  the  combinatorial  outputs  have 
programmable  three-state  functions. 

The  PAL22V10,  AmPAL22RP10  family,  and  AmPAL22XRP10 
family  have  programmable  three-state  outputs.  With  these  fami- 
lies, the  clock  pin  may  also  be  used  as  a  logic  input  if  desired. 

The  PAL16X4  has  extra  bit-pair  decoding  circuitry  which  makes 
it  especially  well  suited  to  arithmetic  operations.  This  is  discussed 
more  fully  on  page  2-53  in  the  combinatorial  logic  section. 


The  simpler  devices  all  have  active-LOW  outputs.  The 
PAL16RP8  family,  PAL22V10,  AmPAL22RP10  family, 
AmPAL22XRP10  family,  PAL20RS10  family,  and  the  PAL32R16 
have  programmable  polarity. 

The  PAL20RS1 0  family  and  PAL32R1 6  have  product  term  steer- 
ing, as  was  discussed  above  for  the  PAL20S1 0. 

The  outputs  of  the  PAL32R1 6  can  also  be  programmed  to  be 
combinatorial  in  banks  of  eight.  In  addition,  the  PAL32R1 6  has 
multiple  clock,  enable,  and  TTL-level  preload  pins. 

The  PAL22V10  has  two  programmable  initialization  product 
terms:  a  synchronous  preset  term  and  an  asynchronous  reset 
term.  These  terms  control  all  flip-flops. 

A  summary  of  the  architecture  codes  for  these  types  of  devices 
is  shown  in  Table  4.  The  devices  are  summarized  in  Table  5. 


CODE 

MEANING 

EXAMPLE 

R 

Registered  outputs 

PAL16R8 

X 

Exclusive-OR  gates 

PAL16X4 

RP 

Registered  with  prog,  polarity 

PAL16RP8 

RS 

Registered  with  term  steering 

PAL20RS1 0 

V 

Versatile 

AmPAL22V10 

Table  4.  Registered  Architectures 


STANDBY 

PRODUCT 

SPEED 

(mA) 

DATA  SHEET 

DEVICE 

■NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

TERMS/OUTPUT 

(f„„  in  MHz) 

PAGE  NO. 

PAL16R8D 

16' 

8 

8 

8 

55 

180 

5-29 

AmPAL16R8D 

55 

180 

5-183 

PAL16R8B 

37 

180 

5-31 

AmPAL16R8B 

40 

180 

5-197 

PALC 

6R8Z-25 

28.5 

0.1 

5-50 

PALC 

6R8Q-25 

28.5 

45 

5-33 

PAL16R8B-2 

25 

90 

5-35 

AmPAL16R8AL 

28.5 

90 

5-197 

PAL16R8A 

25 

180 

5-37 

AmPAL16R8A 

28.5 

180 

5-197 

PAL16R8B-4 

16 

55 

5-39 

AmPAL16R8Q 

18 

45 

5-197 

PAL16R8A-2 

16 

90 

5-41 

AmPAL16R8L 

18 

90 

5-197 

AmPAL16R8 

18 

180 

5-197 

PAL16R8A-4 

11 

50 

5-43 

PAL16R6D 

16* 

8 

6 

8 

55 

180 

5-29 

AmPALie 

R6D 

55 

180 

5-183 

PAL1E 

R6B 

37 

180 

5-31 

AmPALIE 

R6B 

40 

180 

5-197 

PALC 

6R6Z-25 

28.5 

0.1 

5-50 

PALC 

6R6Q-25 

28.5 

45 

5-33 

PAL16R6B-2 

25 

90 

5-35 

AmPALie 

R6AL 

28.5 

90 

5-197 

PAL16 

R6A 

25 

180 

5-37 

Table  5.  Simple  Registered  PAL  Devices 
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DEVICE  NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

PRODUCT 
TERMS/OUTPUT 

SPEED 

(U 

In  MHz) 

STANDBY 

'cc 
(mA) 

DATA  SHEET 
PAGE  NO. 

AmPAL16R6A 
PAL16R6B-4 

AmPALi  6R6Q 
PALI  6R6A-2 

A  . nAi  4     r~i r1 1 

AmPALi  6R6L 
AmPALi  6R6 

nA  i  4  n  r~) r-  A  a 

PALI  6R6A-4 

28.5 
16 
18 
16 
18 
18 
11 

180 
55 
45 
90 
90 
180 
50 

5-197 

5-39 

5-197 

5-41 

5-197 

5-197 

5-43 

PAL16R4D 

16* 

8 

4 

8 

55 

180 

5-29 

AmPALi  6R4D 

55 

180 

5-183 

PAL1 6R4B 

37 

180 

5-31 

AmPALi  6R4B 

40 

180 

5-197 

PALC16R4Z-25 

28.5 

0.1 

5-50 

PALC16R4Q-25 

28.5 

45 

5-33 

PAL1 6R4B-2 

25 

90 

5-35 

AmPALi  6R4AL 

28.5 

90 

5-197 

PAL16R4A 

25 

180 

5-37 

AmPALi  6R4A 

28.5 

180 

5-197 

PAL16R4B-4 

16 

55 

5-39 

AmPALi  6R4Q 

18 

45 

5-197 

PAL16R4A-2 

16 

90 

5-41 

AmPALi  6R4L 

18 

90 

5-197 

AmPALi  6R4 

18 

180 

5-197 

PAL16R4A-4 

11 

50 

5-43 

PAL 16X4 

16* 

8 

4 

8t 

14 

225 

5-51 

PAL1 6RP8A 

16* 

8 

8 

8 

25" 

180 

5-17 

PAL16RP6A 

16* 

8 

6 

8 

25" 

180 

5-17 

PAL1 6RP4A 

16* 

8 

4 

8 

25** 

180 

5-17 

PAL20R8B 

20* 

8 

8 

8 

37 

210 

5-125 

PAL20R8B-2 

25 

105 

5-126 

PAL20R8A 

25 

210 

5-128 

PALC20R8Z-35 

20 

0.1 

5-133 

PAL20R8A-2 

16 

105 

5-130 

PALC20R8Z-45 

15.3 

0.1 

5-133 

PAL20R6B 

20* 

8 

6 

8 

37 

210 

5-125 

PAL20R6B-2 

25 

105 

5-126 

PAL20R6A 

25 

210 

5-128 

PALC20R6Z-35 

20 

0.1 

5-133 

PAL20R6A-2 

16 

105 

5-130 

PALC20R6Z-45 

15.3 

0.1 

5-133 

PAL20R4B 

PAL20R4B-2 

PAL20R4A 

PALC20R4Z-35 

PAL20R4A-2 

PALC20R4Z-45 

20* 

8 

4 

8 

37 
25 
25 
20 
16 
15.3 

210 
105 
210 
0.1 
105 
0.1 

5-125 
5-126 
5-128 
5-133 
5-130 
5-133 

Table  5.  Simple  Registered  PAL  Devices  (Continued) 
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SPEED 

STANDBY 

PRODUCT 

^MAX 

"cc 

r~\  a  "x" *  euccT 

DATA  SHEET 

DEVICE  NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

TERMS/OUTPUT 

in  MHz) 

(mA) 

PAGE  NO. 

PAL20RS10 

20 

1 0 

1 0 

0-1 6ft 

20 

240 

5-103 

PAL20RS8 

20* 

10 

8 

0-1 6  ft 

20 

240 

5-103 

PAL20RS4 

20* 

10 

4 

0-1 6ft 

20 

240 

5-103 

AmPAl  ?9Vm-1  R 

on* 

1  U 

0-1 0§ 

8— 1 6§§ 

An 

1 80 

5-249 

PALC22V10H-25 

oo  Q 
oo.o 

90 

5-79 

AmPAL22V10A 

28.5 

180 

5-260 

PALC22V10H-35 

20 

90 

5-79 

AmPAl  99V/m 
nlTrnLttV  IU 

1  o 

1 80 

5-260 

AmPAL20RP10B 

22* 

10 

10 

8 

37 

210 

5-306 

AmPAL20RP10AL 

25 

105 

5-306 

MmrALtUnr  I UA 

25 

210 

5-306 

AmPAL20RP8B 

22' 

10 

8 

8 

37 

210 

5-306 

AmPAL20RP8AL 

25 

105 

5-306 

AmPAL20RP8A 

25 

21 0 

5-306 

AmPAL20RP6B 

22* 

10 

6 

8 

37 

210 

5-306 

AmPAL20RP6AL 

25 

105 

5-306 

AmPAL20RP6A 

25 

210 

5-306 

AmPAL20RP4B 

22* 

10 

4 

8 

37 

210 

5-306 

AmPAL20RP4AL 

25 

105 

5-306 

AmPAL20RP4A 

25 

210 

5-306 

PAL32R16 

32* 

16 

16§ 

0-1 6ft 

16 

280 

5-158 

'  Includes  feedback  ft  Product  term  steering 

*"  With  polarity  fuse  intact  §  Flip-flops  can  be  bypassed 

t  Has  an  exclusive-OR  gate  §§  Has  varied  product  term  distribution 


Table  5.  Simple  Registered  PAL  Devices  (Continued) 


PAL  Devices  as  Sequencers 

Several  registered  PAL  devices  have  added  features  that  make 
them  particularly  well  suited  to  state  machine  applications. 

The  basic  differentiating  aspects  of  these  devices  are: 

•  The  number  of  inputs 

•  The  number  of  outputs 

•  The  number  of  flip-flops 

•  The  type  of  flip-flop  possible  (D,  T,  J-K,  S-R,  latch, 
and/or  buried) 

•  The  number  of  product  terms  per  output 

•  The  speed:  maximum  clock  frequency 

•  Power  consumption 

The  PAL20X1 0  series,  AmPAL20XRP1 0  series,  PAL22RX8,  and 
PAL32VX1  Oall  have  an  exclusive-OR  gate  that  makes  it  possible 
to  design  with  D,  T,  J-K,  or  S-R  flip-flops.  Since  the  PAL20X1 0  has 
fewer  product  terms,  it  is  best  for  counter-like  applications.  The 
AmPAL20XRP1 0  series,  PAL22RX8  and  PAL32VX1 0  have  more 
product  terms,  for  much  more  sophisticated  state  machines. 


The  AmPAL23S8  has  six  extra  flip-flops  that  are  permanently 
buried  in  addition  to  eight  output  flip-flops.  The  PAL32VX1 0  and 
AmPALC29M1 6  also  have  flip-flops  that  can  be  buried  if  they  are 
not  needed  at  the  outputs.  An  output  pin  can  be  used  as  an  input 
if  its  flip-flop  is  buried.  All  of  these  devices  are  ideal  for  applica- 
tions incorporating  internal  state  machines  as  part  of  the  overall 
design. 

The  PAL22RX8,  PAL32VX10,  and  AmPAL23S8  have  global 
preset  and  reset  control,  making  system  initialization  much 


The  AmPALC29M1 6  is  an  electrically  erasable  device  that  has  an 
unusually  flexible  architecture.  Almost  all  pins  can  be  used  as 
inputs  or  outputs;  both  inputs  and  outputs  can  be  registered, 
latched,  or  combinatorial.  There  are  two  pins  that  can  be  used  as 
clocks.  Global  preset  and  reset  functions  are  also  provided. 
Several  of  the  flip-flops  can  be  buried  to  provide  internal  state 
machines.  This  device  is  HC  and  HCT  compatible. 

A  summary  of  the  architecture  codes  for  these  types  of  devices 
is  shown  in  Table  6. 

The  PAL  devices  best  suited  to  state  machine  applications  are 
listed  in  Table  7. 
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CODE 

MEANING 

EXAMPLE 

X 

Exclusive-OR  gates 

PAL20X10 

XRP 

Exclusive-OR  gates,  prog,  polarity 

AmPAL20XRP10 

RX 

Registered,  exclusive-OR  gates 

PAL22RX8 

S 

Sequencer 

AmPAL23S8 

M 

Advanced  macrocell 

AmPALC29M16 

VX 

Varied  term  distribution,  XOR  gate 

PAL32VX1 0 

Table  6.  State  Machine  Architectures 


STAND 

SPEED 

BY 

FLIP-FLOP 

PRODUCT 

'« 

DATASHEET 

DEVICE  NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

TYPES 

TERMS/OUTPUT 

in  MHz) 

(mA) 

PAGE  NO. 

PAI  POX  1 0A 

20* 

10 

10 

D  TJK  SR 

2/2t 

22.2 

180 

5-113 

PAI  20X8A 

20* 

1 0 

s 

D  TJK  SR 

2/2t 

22.2 

1 80 

5-113 

PAL  P0X4A 

20* 

1 0 

4 

D  TJK  SR 

2/2t 

22.2 

180 

5-113 

AmPAL20XRP10-20 

22' 

10 

10 

D,T,JK,SR 

2/6f 

30.3 

210 

5-286 

AmPAL20XRP10-30L 

22.2 

105 

5-286 

AmPAL20XRP10-30 

22.2 

180 

5-286 

AmPAL20XRP10-40L 

14  ^ 

1 05 

5-286 

AmPAL20XRP8-20 

22* 

10 

8 

D,T,JK,SR 

2/6,  8f 

30.3 

210 

5-286 

AmPAl  *?f")YRPfl  ^ftl 
MiTlrMLtUArir  o-OUL 

22  2 

1 05 

5-286 

AmPAL20XRP8-30 

22.2 

180 

5-286 

AmPAL20XRP8-40L 

14.3 

105 

5-286 

AmPAL20XRP6-20 

22* 

10 

6 

D,T,JK,SR 

2/6,  8f 

30.3 

210 

5-286 

AmPAL20XRP6-30L 

22.2 

105 

5-286 

AmPAL20XRP6-30 

22.2 

180 

5-286 

AmPAL20XRP6-40L 

14.3 

105 

5-286 

AmPAL20XRP4-20 

22* 

10 

4 

D,T,JK,SR 

2/6,  8f 

30.3 

210 

5-286 

AmPAL20XRP4-30L 

22.2 

105 

5-286 

AmPAL20XRP4-30 

22.2 

180 

5-286 

AmPAL20XRP4-40L 

14.3 

105 

5-286 

PAL22RX8A 

22* 

8 

8§ 

D,T,JK,SR 

1/8f 

28.5 

210 

5-87 

AmPAL23S8-20 
AmPAL23S8-25 

23* 

8 

14§ 

D.BO 

6-1 2§§ 

33 

200 

5-169 

28.5 

200 

5-169 

AmPALC29M16-35 

29* 

16 

16§ 

D,B,l_0 

8-1 6§§ 

20 

120 

5-231 

AmPALC29M16-45 

15 

120 

5-231 

PAL32VX1 OA 

32* 

10 

10§ 

D,T,JK,SR, 

1/8-1 6f 

25 

180 

5-70 

PAL32VX10 

BO 

22.2 

180 

5-70 

*  Includes  feedback  §§  Has  varied  product  term  distribution 

t  Has  an  exclusive-OR  gate  0  B=flip-flops  are  or  can  be  buried;  L=latched  outputs  possible 

§  Some  flip-flops  can  be  bypassed 

Table  7.  State  Machine  PAL  Devices 
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Asynchi 


ronous  PAL  Devices 


Several  PAL  devices  have  been  designed  for  applications  that 
require  asynchronous  control  of  flip-flops.  The  simplest  devices 
are  the  PAL20RA1 0  and  the  PAL1 6RA8.  These  devices  essen- 
tially consist  of  several  flip-flops  driven  by  a  programmable  AND 
array  and  a  fixed  OR  array.  Thus  the  D-type  flip-flops  have 
individually  programmable  clock,  set,  and  reset  lines.  They  also 
have  programmable  polarity,  and  the  flip-flops  can  be  bypassed 
to  give  combinatorial  outputs. 

These  devices  are  also  useful  for  replacing  random  discrete  logic. 
Each  flip-flop  is  independent;  the  clocks  can  be  gated  if  desired, 
as  can  the  set  and  reset  lines.  This  provides  for  flexibility  both 
when  replacing  random  gates  and  when  designing  logic  that  is 
asynchronous  in  nature. 

The  AmPALC29MA1 6  combines  some  of  the  advantages  of  the 
AmPALC29M1 6  with  the  advantages  of  the  PAL20RA1 0.  Most  of 
the  AmPALC29M16  macrocell  is  preserved,  except  that  each 
flip-flop  is  given  a  programmable  clock,  set,  and  reset  line.  Each 
output  also  has  individual  three-state  control.  This  device  is 
electrically  erasable,  and  is  HC  and  HCT  compatible. 

A  summary  of  the  architecture  codes  for  these  types  of  devices 
is  shown  in  Table  8.  The  asynchronous  devices  available  are 
summarized  in  Table  9. 


CODE 

MEANING 

EXAMPLE 

RA 
MA 

Registered,  asynchronous 
Asynchronous  macrocell 

PAL20RA1 0 
AmPALC29MA16 

Table  8.  Asynchronous  Architectures 

CMOS  PAL  Devices 

PAL  devices  have  recently  become  available  in  CMOS  technol- 
ogy, providing  performance  competitive  with  that  of  many  bipolar 


devices,  but  with  reduced  power  consumption.  The  functions  of 
these  devices  have  been  discussed  above,  along  with  the  TTL 
devices.  The  particular  characteristics  of  the  CMOS  versions  are 
discussed  below. 

Zero-Power  vs  Low-Power 

Two  basic  types  of  CMOS  PAL  devices  are  available:  those  that 
dissipate  essentially  no  power  when  in  a  quiescent  state,  and 
faster  devices  which  draw  a  nominal  amount  of  current  even 
when  quiescent.  Devices  are  thus  classified  as  "zero-power"  or 
"low-power".  The  low-power  devices  still  require  far  less  current 
than  their  bipolar  equivalents. 

The  basicdifference  between  zero-power  and  low-power  devices 
is  the  wake-up  circuitry  used  in  the  zero-power  devices.  Anytime 
a  signal  changes,  the  device  wakes  up  in  order  to  respond  to  the 
signal.  After  the  transition  is  complete,  the  device  goes  "back  to 
sleep",  as  long  as  no  other  signals  are  changing.  This  wake-up 
circuitry  exacts  a  slight  speed  penalty,  usually  about  5  ns,  but  in 
applications  where  minimal  power  dissipation  is  crucial,  the 
benefits  of  zero-power  operation  greatly  outweigh  the  extra  delay. 

The  PALC1 6R8Z  family  actually  has  a  "turbo  product  term".  This 
product  term  allows  the  device  to  operate  either  in  zero-power  or 
low-power  mode.  Since  this  choice  has  been  provided  by  a 
product  term,  it  is  actually  possible  to  have  a  device  idle  in  zero- 
power  mode,  and  then  switch  into  low-power  mode  for  faster 
operation.  This  is  particularly  appropriate  for  devices  that  tend  to 
operate  in  bursts,  with  idle  periods  between  bursts. 

HC  vs  HCT 

CMOS  devices  can  be  characterized  as  being  HC  and/or  HCT 
compatible.  HC-compatible  devices  have  outputs  that  swing 
"rail-to-rail",  or  from  ground  to  VCC.  They  also  have  an  input 
threshold  which  is  about  2.5  V.  These  devices  are  normally  used 
when  driving  other  CMOS  devices. 


DEVIC 

:E  NAME 

INPUTS 

OUTPUTS 

PRODUCT 
TERMS/OUTPUT 

SPEED 
(tPD  In  ns) 

STANDBY 

lcc(mA) 

DATA  SHEET 
PAGE  NO. 

PAL1 

5RA8 

16* 

8 

4 

30** 

170 

5-11 

PAL20RA10-20 
PAL20RA10 

20* 

10 

4 

20** 
30" 

200 
200 

5-95 
5-97 

AmPALC29MA1 6-35 
AmPALC29MA16-45 

29* 

16 

4-1 2ft 

35 
45 

120 
120 

5-209 
5-209 

*  Includes  feedback 
**  With  polarity  fuse  intact 
ft  Has  product  term  steering 


Table  9.  Asynchronous  PAL  Devices 
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HCT-compatible  devices  have  TTL-level  output  voltages;  that  is, 
a  VOH  of  about  3.5  V  and  a  VOL  of  about  0.5  V.  The  input 
threshold  is  usually  adjusted  to  1 .5  V,  which  is  the  threshold  of  TTL 
devices.  These  devices  are  more  suitable  for  a  mixed  TTL  and 
CMOS  system. 

Roughly  speaking,  HC  and  HCT  devices  can  be  interchanged. 
However,  since  HCT  outputs  do  not  swing  rail-to-rail,  if  they  drive 
another  CMOS  device,  that  device  will  draw  more  power.  HCT 
outputs  are  faster,  have  more  drive,  and  are  more  latch-up 
immune.  The  input  threshold  is  less  critical,  and  is  important  only 
in  very  touchy  situations. 

Note  that  some  devices  are  both  HC  and  HCT  compatible.  This 
means  that  they  have  aquick,  high  drive  output  risetoaTTL  VOH, 
and  then  a  slower  rise  to  VCC,  ultimately  providing  rail-to-rail 
levels.  This  is  illustrated  in  Table  10. 


INPUT 

vOM 

FAMILY 

THRESHOLD  (V) 

(V) 

(V) 

HC 

2.5 

0 

Vcc 

HCT 

1.5 

0.5 

3.5 

HC/HCT 

1.5 

0 

3.5  ->  Vcc 

Table  10.  CMOS/TTL  Compatibility 


UV  Erasable  vs  Electrically  Erasable 

CMOS  PAL  devices  come  in  one  of  two  basic  technologies: 

•  UV  erasable 

•  Electrically  erasable 

UV-erasable  devices  use  EPROM  technology  in  implementing 
the  array.  They  can  be  erased  by  exposure  to  ultraviolet  light  if  the 
device  is  packaged  in  a  windowed  package.  These  packages  are 
more  expensive,  so  the  devices  are  also  available  in  lower-cost 
plastic  packages.  UV-erasable  devices  in  plastic  packages  are 
not  erasable,  and  are  thus  referred  to  as  "One-Time  Program- 
mable", or  OTP  devices. 

Electrically  erasable  devices  can  be  erased  by  providing  the 
appropriate  signals  to  the  device.  No  window  is  required  in  the 
package,  and  the  erase  time  is  much  faster  than  the  erase  time 
of  UV-erasable  devices.  Erasure  is  implemented  by  the  device 
programmer  as  a  portion  of  the  overall  programming  algorithm. 

Summary  of  CMOS  PAL  Devices 

The  PAL  devices  available  in  CMOS  have  been  described  along 
with  their  bipolar  counterparts  in  the  preceding  pages.  They  are 
summarized  in  Table  1 1 ,  along  with  their  power,  compatibility,  and 
erasability  characteristics. 


SPEED 

STANDBY 

DATA  SHEET 

DEVICE  NAME 

INPUTS 

OUTPUTS 

(tro  in  ns) 

lcc(mA) 

COMPATIBILITY 

ERASABILITY 

PAGE  NO. 

PALC16L8Z-25 

16* 

8 

25 

0.1/45** 

HC/HCT 

UV 

5-50 

PALC16R8Z-25 

16" 

8 

25 

0.1/45" 

HC/HCT 

UV 

5-50 

PALC16R6Z-25 

16' 

8 

25 

0.1/45" 

HC/HCT 

UV 

5-50 

PALC16R4Z-25 

16* 

8 

25 

0.1/45" 

HC/HCT 

UV 

5-50 

PALC1 6L8Q-25 

16* 

8 

25 

45 

HCT 

UV 

5-33 

PALC16R8Q-25 

16* 

8 

25 

45 

HCT 

UV 

5-33 

PALC16R6Q-25 

16* 

8 

25 

45 

HCT 

UV 

5-33 

PALC16R4Q-25 

16* 

8 

25 

45 

HCT 

UV 

5-33 

PALC20L8Z-35 

20* 

8 

35 

0.1 

HC/HCT 

UV 

5-133 

PALC20L8Z-45 

20* 

8 

45 

0.1 

HC/HCT 

UV 

5-133 

PALC20R8Z-35 

20* 

8 

35 

0.1 

HC/HCT 

UV 

5-133 

PALC20R8Z-45 

20* 

8 

45 

0.1 

HC/HCT 

UV 

5-133 

PALC20R6Z-35 

20' 

8 

35 

0.1 

HC/HCT 

UV 

5-133 

PALC20R6Z-45 

20' 

8 

45 

0.1 

HC/HCT 

UV 

5-133 

PALC20R4Z-35 

20* 

8 

35 

0.1 

HC/HCT 

UV 

5-133 

PALC20R4Z-45 

20* 

8 

45 

0.1 

HC/HCT 

UV 

5-133 

PALC22V10H-25 

22' 

10 

25 

90 

HCT 

UV 

5-79 

PALC22V10H-35 

22* 

10 

35 

90 

HCT 

UV 

5-79 

AmPALC29M16-35 

29* 

16 

35 

120 

HC/HCT 

EE 

5-231 

AmPALC29M16-45 

29* 

16 

45 

120 

HC/HCT 

EE 

5-231 

AmPALC29MA16-35 

29" 

16 

35 

120 

HC/HCT 

EE 

5-209 

AmPALC29MA16-45 

29* 

16 

45 

120 

HC/HCT 

EE 

5-209 

'  Includes  feedback 
'  With  turbo  term 


Table  11.  Summary  of  CMOS  PAL  Devices 
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Product  Overview 


ECL  PAL  Devices 


Programmable  Sequencers 


For  a  long  period  of  time,  PLDs  were  available  only  for  in- 
compatible systems.  More  recently,  however,  PAL  devices  have 
been  introduced  which  address  the  users  of  ECL  logic. 

Two  basic  families  of  ECL  logic  are  addressed  by  the  ECL  PAL 
devices.  These  families  differ  in  their  voltage  ranges  and  the 
compensation  built  into  the  circuitry  (Table  1 2).  These  differences 
also  affect  noise  margins  and  thresholds.  Refer  to  page  3-1 50  for 
more  information  on  ECL  design. 

The  ECL  PAL  devices  presently  have  a  6-ns  propagation  delay; 
the  differences  between  the  devices  can  be  found  in  the  architec- 
tures. The  PAL1 0H20P8  and  PAL1 0H20G8  are  simpler  devices 
which  are  compatible  with  other  10KH  logic;  the  former  is  a 
combinatorial  device  with  programmable  polarity,  the  latter  has 
output  latches.  Both  devices  have  product  term  steering  for 
added  flexibility  in  allocating  logic  resources.  These  features  are 
described  in  more  detail  on  page  5-382. 

The  PAL10H20EV/EG8  and  PAL10020EV/EG8  are  10KH  and 
lOOKcompatibleoptionsofthesamedevice,  respectively.  These 
devices  have  more  product  terms  than  the  first  two  devices,  and 
have  outputs  that  can  be  programmed  as  combinatorial  or  regis- 
tered/latched. These  devices  also  have  global  reset  and  preset 
controls. 

A  summary  of  the  architecture  codes  for  these  types  of  devices 
is  shown  in  Table  13.  The  ECL  PAL  devices  and  their  basic 
characteristics  are  summarized  in  Tables  14  and  15. 


FAMILY 

v„ 

COMPENSATION 

10KH 
100K 

-5.2V  ±  5% 
-4.5V  ±  0.3V 

Voltage 
Voltage,  temperature 

Table  12.  10KH  ECL  vs  100K  ECL 


Many  PAL  devices  can  be  used  to  design  very  fast,  efficient 
sequencers.  Optimal  among  these  devices  are  the  PAL32VX1 0 
andtheAmPAL23S8.  However,  there  are  additional  devices  with 
entire  architectures  dedicated  to  the  task  of  sequencing. 

Since  these  devices  are  primarily  intended  for  use  as  state 
machines,  the  important  characteristics  to  be  evaluated  are: 

•  The  number  of  inputs 

•  The  number  of  outputs 

•  The  number  of  states  that  can  be  implemented 

•  The  number  of  branches  available  from  any  state 
•The  speed;  maximum  clock  frequency 

•  Power  consumption 

The  programmable  sequencers  available  are  shown  in  Table  1 6. 

The  PMS14R21  (also  known  as  the  PROSE™  device)  actually 
consists  of  a  PAL  device  for  branching,  followed  by  a  PROM  to 
store  the  states.  This  device  also  has  the  Diagnostics-On-Chip™ 
(DOC™)  testability  circuitry,  allowing  the  device  to  be  inserted 
into  a  serial  scan  path.  This  is  described  more  thoroughly  on  page 
3-123  of  the  testability  section. 

The  PLS105,  PLS167,  and  PLS168  are  based  on  the  standard 
PLS  architecture,  where  both  the  AND  array  and  the  OR  array  are 
programmable.  S-R  flip-flops  are  used  to  minimize  the  need  for 
specifying  'HOLD'  conditions.  There  is  also  a  "complement 
array",  which  makes  it  easier  to  define  default  state  transitions. 


CODE 

MEANING 

EXAMPLE 

P 

Combinatorial,  prog,  polarity 

PAL10H20P8 

G 

Latched  outputs 

PAL10H20G8 

EG 

Latched  or  combinatorial 

PAL10H/10020EG8 

EV 

Registered  or  combinatorial 

PAL10H/10020EV8 

Table  13.  ECL  PAL  Device  Architectures 


DEVI 

CE  NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

PRODUCT 
TERMS/OUTPUT 

SPEED 

(*PD°rf««) 

l„(mA) 

DATA  SHEET 
PAGE  NO. 

PAL10H20P8 
PAL10H20G8 
PAL1 0H20EV/EG8 

20* 
20* 
20* 

CO  CO  00 

0 
8§ 
8§ 

0-8tt 
0-8ft 
8-1 2§§ 

6  ns 
6  ns 
125  MHz 

210 
225 
220 

5-385 
5-382 
5-381 

"  Includes  feedback 
ft  Has  product  term  steering 


§  Flip-flops  can  be  bypassed 
§§  Has  varied  product  term  distribution 

Table  14.  10KH-Compatlble  PAL  Devices 


DEVICE  NAME 

INPUTS 

OUTPUTS 

FLIP-FLOPS 

PRODUCT 
TERMS/OUTPUT 

SPEED 

('„„) 

l„(mA) 

DATA  SHEET 
PAGE  NO. 

PAL10020EV/EG8 

20* 

8 

8§ 

8-1 2§§ 

125  MHz 

220 

5-381 

*  Includes  feedback  §  Flip-flops  can  be  bypassed 

§§  Has  varied  product  term  distribution 

Table  15.  lOOK-Compatible  PAL  Devices 
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Product  Overview 


The  PMS14R21,  PLS105,  PLS167,  and  PLS168  are  all  sup- 
ported by  PALASM  software,  using  a  convenient  entry  formatthat 
has  been  optimized  for  state  machines.  This  essentially  makes 
it  possible  to  transfer  a  state  diagram  directly  to  a  design  file 
without  the  need  for  manual  logictransformations  or  calculations. 

The  Am29PL141  Fuse  Programmable  Controller  (FPC)  is  an  in- 
struction-based sequencer.  This  is  especially  convenient  for  de- 
signers who  are  more  comfortable  using  instruction-based  de- 
vices. The  design  is  implemented  more  as  microprocessor 
program  might  be  written.  Indeed,  the  architecture  itself,  which  is 
described  in  more  detail  on  page  5-339,  resembles  that  of  a 
simple  processor.  This  device  has  its  own  assembler  which  is 
used  to  reduce  the  sequencer  "program"  into  a  file  that  can  be 
used  to  program  the  device. 

The  Am2971  is  a  Progammable  Event  Generator.  It  is  a  versatile 
timing  device  that  can  be  used  as  a  digital  substitute  for  analog 
delay  lines  or  as  a  general-purpose  user-programmable  timing/ 
waveform  generator. 

LCA  Devices 

Large-scale  designs  are  now  possible  with  the  introduction  of  the 
Logic  Cell  Array  (LCA)  devices.  These  devices  can  be  thought  of 


as  programmable  gate  arrays,  and  provide  an  efficient,  cost- 
effective  means  of  implementing  dense  designs  within  a  single 
PLD. 

Unlike  other  PLDs,  which  are  non-volatile,  LCA  devices  make  use 
of  static  RAM  technology.  This  makes  them  easy  to  develop,  easy 
to  test  and  debug,  and  reprogrammable. 

There  is  an  entire  family  of  support  software  and  hardware 
packages  available  to  aid  in  the  design  of  LCA  applications. 
These  are  described  in  more  detail  on  page  3-17. 

The  architecture  of  an  LCA  device  resembles  that  of  a  gate  array 
more  closely  than  a  traditional  PLD.  It  has  an  array  of  Configur- 
able Logic  Blocks  (CLBs)  that  can  be  connected  via  a  variety  of 
interconnect  resources.  There  are  also  Input/Output  Blocks 
(lOBs)  for  transferring  the  signals  on  and  off  the  chip. 

The  main  features  that  distinguish  one  LCA  device  from  another 
are: 

•  The  number  of  I/O  pins 

•  The  number  of  Configurable  Logic  Blocks 

•  The  speed:  internal  maximum  toggle  frequency 

•  Power  consumption 

The  available  devices  are  summarized  in  Table  17. 




DEVICE  NAME 

INPUTS 

OUTPUTS 

STATES 
(MAX) 

BRANCHES 
PER  STATE 

SPEED 
(fMAX  In  MHz) 

lee 
(mA) 

DATA  SHEET 
PAGE  NO. 

PMS14R21A 
PMS14R21 

8 

8 

128 

4 

30 
25 

210 
210 

5-315 
5-315 

PLS1 05-37 
PLS1 67-33 
PLS1 68-33 

16 
14 
12 

8 
6 
8 

<64* 
<128* 
<1028* 

* 
* 
• 

37 
33 
33 

200 
200 
200 

5-331 
5-331 
5-331 

Am29PL141 

6 

16 

64 

2 

20 

450 

5-339 

Am2971 

6 

14 

N/A 

N/A 

85 

310 

5-365 

*  Depends  highly  on  state  diagram  topology.  May  be  limited  by  number  of  product  terms  or  number  of  flip-flops. 


Table  16.  Programmable  Sequencers 


DEVICE  NAME 

I/O  PINS 

CLBs 

SPEED 
(INTERNAL  TOGGLE 

U  m  MHz) 

STANDBY 

■ccfmA) 

DATA  SHEET 
PAGE  NO. 

M2064-70 

58 

64 

70 

5 

5-483 

M2064-50 

50 

5 

5-483 

M2064-33 

33 

5 

5-483 

M2018-50 
M2018-33 

74 

100 

50 
33 

5 
5 

5-483 
5-483 

Table  17.  LCA  Devices 
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Introduction 


Installing  PALASM  Software 


This  section  is  intended  as  a  beginner's  introduction  to  PLD 
design,  although  experienced  users  may  find  it  a  good  review.  We 
will  take  a  step-by-step  approach  through  two  very  simple  de- 
signs to  demonstrate  the  basic  PLD  design  implementation 
process. 


Before  we  actually  start  implementing  a  design,  we  must  install 
PALASM  software.  If  you  already  have  the  software  installed,  you 
may  disregard  this  discussion.  The  procedure  described  here  will 
allow  you  to  install  the  software  package  onto  an  IBM  PC/XT  or 
compatible  with  a  hard  disk  and  at  least  one  floppy  drive. 


By  "beginner",  we  mean  a  logic  designer  who  is  just  beginning  to 
use  programmable  logic.  You  may  have  a  lot  of  experience  with 
discrete  digital  logic,  or  you  may  have  just  graduated  from 
college.  We  assume  a  basic  understanding  of  digital  logic, 
although  there  is  a  logic  reference  on  page  6-1  should  you 
need  a  refresher.  Some  computer  experience  is  helpful,  but  not 
essential 

Through  this  effort,  you  will  be  introduced  to  PALASM  software 
and  to  the  concept  of  device  programming.  Because  of  the 
simplicity  of  the  designs,  we  will  be  using  PAL  devices,  but  the 
implementation  concepts  will  be  applicable  to  all  of  Monolithic 
Memories'  and  Advanced  Micro  Devices'  PLDs. 

Only  enough  detail  of  the  software  will  be  presented  to  allow  you 
to  implement  the  examples.  This  will  also  give  you  enough 
information  to  understand  most  of  the  other  design  examples  in 
this  handbook.  More  software  details  can  be  found  in  the  actual 
software  documentation  in  section  4. 

To  keep  the  discussion  simple,  we  will  make  some  assumptions 
about  the  system  being  used  to  process  your  designs.  We  will 
assume  an  IBM  PC/XT™  orcompatible  system  with  one  harddisk 
and  one  floppy  drive,  running  PC-DOS™  or  MS-DOS™  version 
2.1  or  higher.  The  installation  procedure  for  the  software  will  be 
described  for  this  setup.  The  procedures  for  other  setups  and 
machines  are  not  radically  different;  details  can  be  found  in  the 
PALASM  software  documentation,  section  4. 

We  will  also  take  no  significant  shortcuts  for  these  examples, 
even  though  there  may  be  times  when  we  could.  In  this  way,  you 
can  gain  a  better  understanding  of  exactly  what  is  happening  as 
you  implement  your  design. 

We  will  talk  about  device  programming,  describing  all  of  the  steps 
that  are  necessary  to  program  a  PLD.  However,  due  to  the  wide 
variety  of  programmers  available,  we  will  not  get  down  to  the  level 
of  detail  that  tells  you  exactly  which  buttons  to  push.  Although  we 
will  get  as  close  as  we  can,  we  must  defer  the  details  to  your 
programmer  manual. 

As  we  work  the  design  examples,  we  will,  of  course,  have  to 
choose  devices  into  which  to  put  the  designs.  However,  we  will 
not  dwell  here  on  the  parameters  that  go  into  choosing  a  device. 
This  is  covered  more  thoroughly  in  the  rest  of  this  section. 


After  you  have  turned  on  the  computer  and  allowed  it  to  boot  up, 
you  can  install  the  software  by  placing  disk  #1  into  drive  A,  and 
typing: 

A:PAL2INST<CR> 

If  you  are  new  to  the  PC,  <CR>  is  the  'RETURN'  key  on  the  right 
side  of  the  keyboard. 

The  program  will  ask  you  which  disk  you  wish  to  install  the 
programs  on.  Enter  the  disk  name,  or  just  hit  <CR>  if  installing 
onto  disk  C.  The  installation  program  creates  directories  for  all  of 
the  programs.  If  you  presently  have  an  older  version  of  PALASM 
software  installed  in  a  similar  directory  structure,  the  installation 
program  will  warn  you  that  the  contents  of  the  directories  will  be 
changed.  Otherwise,  it  will  start  installing. 

Note  that  the  installation  procedure  may  modify  two  files  which 
are  used  when  the  computer  boots  up:  AUTOEXEC.BAT  and 
CONFIG.SYS.  These  changes  should  not  affect  any  other 
programs  you  run. 

The  installation  program  will  then  give  you  a  choice  of  programs 
to  install;  hit 


to  install  the  software.  The  installation  program  will  ask  you  to 
insert  various  disks  as  it  installs  the  programs.  The  disks  are 
labelled,  so  this  should  be  a  simple  procedure.  Afterthe  software 
has  been  installed,  reboot  the  system  by  holding  down 

<CTRLXALTXDEL> 

at  the  same  time.  This  only  needs  to  be  done  after  installation.  In 
the  future  when  you  turn  the  computer  on,  the  computer  will  boot 
automatically. 

You  can  now  call  up  the  menu  by  typing 

PALASM<CR> 

Now  you  need  to  install  the  editor  or  word  processing  program 
you  wish  to  use.  You  may  use  any  editor  you  wish,  as  long  as  it 
can  generate  a  clean  ASCII  text  file.  In  addition,  you  need  to 
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install  a  communication  program;  any  one  will  do.  A  simple  one 
called  PC2  is  provided  with  the  PALASM  software.  You  can  install 
these  programs  by  hitting 

<F6> 

When  the  setup  screen  comes  up,  hit  the  down-arrow  key  until  it 
gets  to  the  line  that  says  "Editor".  Enter  the  directory  path  and 
program  name  for  the  editor  you  wish  to  use.  For  example,  if  you 
use  WordStar™,  and  you  have  the  program  located  in  the 
directory  called  "WS"  on  disk  C,  then  enter 

C: \ws\ws 

Add  enough  spaces  to  delete  whatever  was  there  before. 

This  takes  you  into  the  next  field.  Here  you  need  to  enter  the 
filetype  of  your  editor,  which  will  be  either  .COM  or  .EXE. 

Once  this  is  complete,  for  the  WordStar  example,  the  line  should 
now  look  like: 

C:\ws\ws  .com 

Hit  <CP»  again  to  get  to  the  next  field.  Here  you  enter  the  name 
of  your  communication  software.  If  you  wish  to  use  PC2,  then 
type 

\PALASM\SUPL\PC2 

(adding  spaces  to  delete  what  was  there  before.) 

<CE> 

.EXE 

If  you  are  not  using  PC2,  then  replace  "PC2"  with  the  name  of  your 
program.  This  procedure  works  regardless  of  the  editor  or 
communication  software  you  wish  to  use.  When  you  are 
finished,  hit 

<ESO 

to  get  back  to  the  menu. 
You  are  now  ready  to  start. 

Constructing  a  Combinatorial  Design 
— Basic  Gates 

The  first  example  we  will  try  is  a  very  simple  combinatorial  circuit 
consisting  of  all  of  the  basic  logic  gates,  as  shown  in  Figure  1 .  This 
will  be  helpful  for  those  designs  where  you  are  integrating  random 
logic  into  a  PAL  device  to  save  space  and  money. 

As  can  be  seen  from  thef  igure,  there  will  be  six  separate  functions 
involving  atotal  of  twelve  inputs.  It  is  important  to  bear  in  mind  that 
programmable  logic  provides  a  convenient  means  of  implement- 
ing designs.  With  a  real  design,  some  work  would  be  required 
before  this  point  to  conceptualize  the  design,  but  due  to  the 
simplicity  of  these  circuits,  we  are  already  in  a  position  to  start  the 
implementation. 


 >  

=D  

=D  

=£>  

 )E>  

Figure  1.  The  Basic  Logic  Gates 


Building  The  Equations 

We  will  start  by  generating  Boolean  equations.  The  first  function 
to  be  generated  is  an  inverter.  This  is  specified  according  to 
Figure  1  as: 

B  -  /A 

Here  the  "equals"  sign  ('=')  is  used  to  assign  a  function  to  output 
B.  The  slash  (V)  is  used  to  indicate  negation,  since  it  is  impossible 
to  put  a  bar  over  a  letter  in  an  ASCII  file.  Thus  this  equation  may 
be  read: 

B  is  TRUE  if  NOT  A  is  TRUE 

The  next  function  is  a  simple  AND  gate.  As  shown  in  Figure  1,  we 
can  write 

E  =  C*D 

Here  we  use  the  "equals"  sign  again,  but  this  time  we  have 
introduced  the  asterisk  ('*')  to  indicate  the  AND  operation.  This 
equation  may  be  read: 

E  is  TRUE  if  C  AND  D  are  TRUE 

The  third  function  is  an  OR  gate,  which  may  be  written: 

H  =  F  +  G 

The  "plus"  sign  ('+')  is  used  to  specify  the  OR  operation  here. 
Because  of  the  sum-of-products  nature  of  logic  as  implemented 
in  PLDs,  it  is  often  easy  to  place  product  terms  on  separate  lines, 
which  improves  the  readability.  We  may  rewrite  this  equation  as: 

H  =  F 
+  G 

This  equation  may  be  read: 

H  is  TRUE  if  F  OR  G  is  TRUE 
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For  the  moment,  we  will  assume  that  we  have  active-HIGH 
outputs  on  our  device.  The  functions  we  have  generated  so  far 
have  essentially  been  active-HIGH  functions.  At  times  we  wish  to 
generate  active-LOW  functions;  the  next  two  functions  are 
active-LOW  functions  that  we  wish  to  implement  in  an 
active-HIGH  device. 

When  we  talk  in  terms  of  an  active-HIGH  or  an  active-LOW 
device,  the  real  question  is  whether  there  is  an  extra  inverter  at  the 
output.  An  active-HIGH  device  has  an  AND-OR  structure;  an 
active-LOW  device  has  an  AND-OR-INVERT  structure  which 
inverts  the  function  at  the  output  (see  Figure  2). 


a.  AND-OR  Structure 


and 


NAND 
active-LOWdevice 
and  OR 


NOR  gates  could  be  generated  very  simply  in  an 
because  we  would  just  have  to  generate  AND 
unctions,  and  let  the  output  inverter  generate  their 
complements.  However,  given  that  we  wish  to  implement  these 
functions  in  an  active-HIGH  device,  we  must  invoke  DeMorgan's 
theorem,  as  shown  in  Figure  3. 

/ (X*Y)  =  /x  +  n 
/(X  +  Y)  -  /X*/Y 
Figure  3.  DeMorgan's  Theorem 


PALASM 
theorem 
writing: 


to 


b.  AND-OR-INVERT  Structure 


Figure  2.  Active  HIGH  vs.  Active  LOW 


software  has  the  capability  of  applying  DeMorgan's 
functions,  so  we  may  generate  our  NAND  function  by 


L  =  /(I*J*K) 

or,  if  preferred, 

L  m  n 

+  /J 

+  /K 

Likewise  the  NOR  function  may  be  specified  as 

o  =  /  (M 
+  N) 

or 

O  =  /M*/N 


Finally,  an  exclusive-OR  (XOR)  gate  may  be  specified  either  as 
R  =  P  :+:  Q 

where  ;+:  represents  the  XOR  operation,  or  more  explicitly  as 

R  =  P*/Q 
+  /P*Q 

We  have  now  specified  all  of  the  functions  in  terms  of  their 
Boolean  equations.  The  equations  are  summarized  in  Figure  4. 

B  =  /A 

E  -  C*D 

H  -  F 
+  G 

L  =  /I 

+  IS 
+  /K 

0  =  /M*/N 

R  -  P*/Q 
+  /P*Q 

Figure  4.  Basic  Gates  Equations 


Understanding  the  Logic  Diagram 

We  will  use  a  PAL12H6  for  this  function,  since  it  has  all  of  the 
resources  needed  to  implement  all  of  these  functions.  A  portion 
of  the  logic  diagram  for  this  device  is  shown  in  Figure  5.  The  full 
logic  diagram  for  this  device  can  be  found  on  page  5-62. 

The  logic  diagram  shows  all  of  the  logic  resources  available  in  a 
particular  device.  In  each  device,  inputs  are  provided  in  true  and 
complement  versions,  as  shown  in  Figure  5.  These  drive  what  are 
often  called  "input  lines",  which  are  the  vertical  lines  in  the  logic 
diagram.  These  input  lines  can  then  be  connected  to  product 
terms.  The  name  "product  term"  is  really  just  a  fancy  name  for  an 
AND  gate.  However,  PLDs  provide  very  wide  gates,  which  can  be 
cumbersome  to  draw.  To  save  space,  the  product  terms  are 
drawn  as  horizontal  lines  with  a  small  AND  gate  symbol  at  one  end 
to  indicate  the  function  being  performed. 

So  we  see  that  in  the  PAL12H6,  there  are  twelve  inputs  and  six 
outputs.  Four  of  the  outputs,  on  pins  14  through  17,  have  two 
product  terms  connected  to  the  OR  gates.  The  outputs  on  pins 
13  and  18  have  four  product  terms  connected  to  the  OR  gates. 
Thus  pins  13  and  18  can  be  used  to  implement  more  complex 
functions  than  the  other  four  outputs. 

Although  you  really  do  not  need  to  be  concerned  with  the  actual 
implementation  of  these  functions  inside  the  PALdevice,  you  may 
be  curious.  Figure  6  shows  howthe  inverter  and  the  AND  gate  are 
implemented  in  the  PAL12H6.  An  'X'  indicates  a  connection.  A 
product  term  that  is  not  used  is  indicated  by  an  'X'  in  the  small 
AND  gate. 
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Figure  5.  A  Portion  of  a  Logic  Diagram 
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Building  the  Design  File 

Oncethe  design  has  been  conceptualized,  the  design  file  must  be 
generated.  This  can  be  done  with  any  editor  program.  The  only 
requirement  is  that  the  editor  produce  a  clean  ASCII  file.  Most 
word  processing  programs  have  this  capability,  so  you  can  use 
your  favorite  program.  Any  hidden  control  characters  for  format- 
ting may  cause  problems  when  assembling;  thus  only  "non- 
document"  or  "clean"  modes  should  be  used.  The  filename  is 
usually  given  the  extension  '.PDS'  (for  PAL  device  Design  Speci- 
fication), although  this  is  not  required. 

We  now  know  exactly  what  our  functions  are  going  to  be.  We 
have  twelve  inputs,  six  outputs,  and  the  NAND  function  requires 
three  product  terms.  Note  that  if  we  had  specified 

L  =  /(I*J*K) 

instead  of 


L  =  /I 
+  /  J 
+  /K 


forthe  NAND  gate,  it  would  not  be  as  obvious  how  many  product 
terms  would  be  needed. 


We  are  now  in  a  position  to  create  the  design  file.  First  you  need 
to  enter  the  PALASM  menu.  This  menu  will  let  you  perform  all  of 
the  tasks  you  need,  including  editing  your  design  file.  Call  up  the 
menu  by  typing 


The  first  step  is  to  provide  some  documentation  information.  This 
is  done  with  six  keywords: 

title       (The  title  of  your  design  goes  here) 

pattern    (The  pattern  name  or  number  goes  here) 

revision  (The  revision  number  or  level  goes  here) 

author     (Your  name  goes  here) 

company    (Your  company  name  goes  here) 

date        (The  date  of  creation  or  modification  goes  here) 

This  is  followed  by  the  CHIP  declaration,  which  tells  the  assem- 
bler which  device  is  being  used  and  what  the  pin  names  are  going 
to  be.  We  do  this  by  typing: 

chip      (any  name  goes  here)  (the  device  type  goes  here) 
(the  pin  list  goes  here) 

Thus,  we  can  create  the  declaration  section  for  our  design  as 
shown  in  Figure  7. 

TITLE  Basic  gates 

PATTERN  P0000 

REVISION  A 

AUTHOR  Stateyour  Namehere 

COMPANY  Name  of  your  Co . ,   Inc . 

DATE  7/22/87 

CHIP     GATES  PAL12H6 
ACDFGIJKMGND 
NPBEHROLQ  VCC 

Figure  7.  The  Declaration  Section 


When  prompted,  hit  any  key  to  bring  the  menu  onto  the  screen. 

Enter  the  file  name  of  the  design  file  you  wish  to  create  in  the  field 
called  "Input  PDS  file".  The  cursor  should  be  there  when  the 
menu  appears  on  the  screen.  Then  hit 


<CR> 

This  will 
".PDS". 
files.  Hit 

<CR> 


We 


take 


you  to  the  filetype  field,  which  should  already  say 
recommend  that  you  use  ".PDS"for  all  of  your  design 


again.  At  this  time,  the  menu  will  say  that  the  file  cannot  be 
opened;  this  is  just  because  you  have  not  actually  created  the  file 


yet,  so  the 


>  is  no  need  to  be  concerned.  You  can  now  create  your 


design  file  by  hitting 
<F3> 

This  will  call  up  your  editor,  and  you  can  begin  building  the  file. 

We  start  with  what  is  called  the  DECLARATION  SECTION.  This 
section  al  ows  you  to  document  your  design,  and  also  provides 
some  definitions  for  the  assembler. 


Note  that  the  pin  list  has  all  pins  in  order,  from  pin  1  to  pin  20, 
including  VCC  and  GND.  You  can  see  that  the  inputs  were 
assigned  to  pins  1-12  and  19,  whereas  the  outputs  have  been 
assigned  to  pins  13-18.  Of  course  these  assignments  must  be 
made  knowing  which  pins  are  inputs  and  outputs,  as  per  the  logic 
diagram.  Knowing  that  the  NAND  gate  requires  three  product 
terms,  we  assigned  this  function  to  pin  1 8,  since  it  can  provide  up 
to  four  product  terms. 

We  can  make  the  pin  list  easierto  read  by  adding  the  pin  numbers 
as  comments.  Anything  following  a  semicolon  (;)  is  treated  as  a 
comment,  and  ignored  by  the  processor.  This  helps  make  the 
design  file  more  readable.  The  pin  list  with  comments  is  shown 
in  Figure  8.  Notice  how  much  easier  it  is  to  read. 

CHIP     GATES  PAL12H6 

;PINS  123456789  10 
ACDFGI     JKM  GND 

;PINS  11  12  13  14  15  16  17  18  19  20 
NPBEHROLQ  VCC 

Figure  8.    Pin  List  with  Comments 
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Next  we  enter  the  equations, 
keyword 


This  section  begins  with  the 


This  gives  you  a  choice  of  processing  options  on  the  right  of  the 
screen.  To  run  all  of  the  processing  steps  automatically,  hit 


EQUATIONS 

after  which  all  of  the  equations  are  entered,  exactly  as  shown  in 
Figure  4.  Note  that  the  equations  may  be  defined  in  any  order,  not 
necessarily  in  the  order  presented  in  the  pin  list. 

The  fact  that  this  is  an  active-HIGH  device  does  have  some 
bearing  on  the  way  the  file  is  built.  This  is  discussed  at  length  on 
page  6-19  and  in  the  software  documentation;  we  will  need  to 
address  the  issue  again  below,  when  we  do  an  active-LOW 
design.  For  now,  suffice  it  to  say  that  the  design  as  specified  here 
will  work  correctly  for  an  active-HIGH  device. 

Finally,  add  comments  to  the  equations  to  document  the  design. 
Liberal  commenting  is  encouraged  to  make  it  easier  for  you  and 
others  to  understand  your  design  in  the  future. 

The  equation  section,  with  comments,  is  shown  in  Figure  9. 


EQUATIONS 

B  =  /A 

E  =  C*D 

H  -  F 
+  G 

L  -  n 

+  /J 
+  /K 


; inverter 
;  AND  gate 
;OR  gate 


;NAND  gate  after  applying 
;  DeMorgan's  theorem 


O  -  /M*/N  ;NOR  gate  after  applying 

;  DeMorgan' s  theorem 

R  =  p*/Q  ;XOR  gate,  expanded 

+  /P*Q 

Figure  9.  The  Equation  Section 


When  you  have  completed  the  file,  save  the  file  and  get  out  of  the 
editor. 

Generating  a  JEDEC  File 

Once  the  design  file  has  been  entered,  you  can  assemble  the 
designtogetaJEDECfile.  We  have  two  purposes  here:  tomake 
sure  there  are  no  basic  mistakes  in  the  file,  and  to  generate  a 
JEDEC  file  for  programming. 


To  start 

<F5> 


!,  hit 


The  first  program  will  now  run,  as  evidenced  by  the  file  scrolling 
up  the  bottom  of  the  screen.  If  there  are  any  errors,  they  will 
appear  on  the  screen  and  in  an  error  file.  These  errors  will  be 
syntactical  in  nature;  typos,  misspelled  words,  missing  declara- 
tions, malformed  expressions. 

Once  the  first  prog  ram  has  been  run  with  no  errors,  the  equations 
are  minimized.  It  is  a  good  practice  to  run  the  expander  and 
minimizer  for  all  designs.  Minimization  may  allow  you  to  fit  a 
design  into  a  smaller  device.  It  also  improves  the  testability  of 
your  design.  More  complicated  designs  may  have  to  be  ex- 
panded and  minimized. 

After  the  minimization  has  been  performed,  the  JEDEC  file  will  be 
generated.  If  there  are  any  reported  errors,  it  will  likely  be 
because  you  requested  some  function  that  the  part  you  chose 
could  not  provide.  Some  examples  are: 

•  Active-HIGH  equations  in  an  active-LOW  device 

•  Too  many  product  terms 

•  Registered  equations  in  a  device  that  has  no  registers 

Since  you  have  used  the  "autorun"  feature,  the  simulation  pro- 
gram will  automatically  be  run.  Since  we  have  not  specified  the 
simulation  yet,  this  will  generate  an  error.  Ignore  the  error 
message  for  now. 

When  processing  is  complete,  hit 

<ESO 

to  get  back  to  the  main  menu. 

If  there  were  any  errors  during  processing,  you  can  see  what  they 
were  by  viewing  the  run-time  log.  This  is  done  by  hitting 

<F7> 

to  get  a  choice  of  things  to  view.  To  view  the  run-time  log,  hit 


You  can  scroll  down  until  you  find  any  errors  and  make  a  note  of 
what  needs  correcting.  Once  you  have  found  the  error(s),  you  can 
get  back  to  the  menu  by  hitting 

<ESCXESC> 

Any  corrections  should  be  made  by  editing  the  design  file.  After 
making  any  changes,  the  design  needs  to  reprocessed. 
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The  JEDEC  file  will  have  the  same  file  name  as  your  design  file, 
except  that  it  will  have  the  extension  'JED'.  The  program  also 
creates  an  'xplot'  file  (with  file  extension  '.XPT)  which  relates  the 
final  design  implementation  to  the  logic  diagram  of  the  PLD  being 
used.  The  xplot  file  is  usually  not  needed,  but  it  confirms  the 
physical  implementation  of  the  gates.  Figure  1 0  shows  a  portion 
of  the  xplot  for  this  design;  the  inverter  and  the  AND  gate  are 
shown.  You  can  see  how  this  relates  to  the  logic  diagram  by 
referring  back  to  Figure  6. 

You  may  view  the  complete  xplot  file  by  hitting 

<F7XPgDnXl> 

To  return  to  the  main  menu  after  inspecting  the  xplot  file,  hit 

<ESO 
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Figure  10.  A  Portion  of  an  Xplot 

Simulating  the  Gates 

After  you  have  verified  that  your  design  file  is  correct,  it  is  time  to 
verify  that  the  design  itself  is  correct.  This  is  done  by  simulating 
the  design.  Simulation  provides  a  way  for  you  to  see  whether  your 
design  is  working  as  you  expect  it  to.  You  provide  a  series  of 
commands,  or  events,  which  are  then  simulated  by  the  software. 
If  requested,  the  software  can  tell  you  if  the  simulation  matches 
what  you  expect,  and,  if  not,  where  the  problems  are. 

The  simulation  section  is  the  last  part  of  the  design  file.  It  is  not 
required,  but  is  invariably  helpful  both  in  debugging  the  design, 
and  in  generating  what  can  eventually  be  used  as  a  portion  of  a 
test  vector  sequence.  It  is  introduced  by  the  keyword: 

SIMULATION 

The  events  to  be  simulated  are  specified  by  placing  logic  levels 
on  pins  of  interest.  Any  pins  not  specifically  mentioned  will  either 


maintain  their  present  level,  or,  if  no  level  was  ever  defined, 
remain  undefined.  The  SETF  command  is  used  to  set  levels. 

We  can  start  by  simulating  the  inverter.  To  prove  that  the  inverter 
works  as  expected,  first  we  wish  to  set  input  A  HIGH  to  verify  that 
the  output  goes  LOW.  This  is  done  with  the  statement: 

SETF  A 

Since  all  we  did  here  was  set  A  HIGH,  all  other  inputs  are  still 
undefined.  However,  this  is  enough  to  determine  the  output  for 
signal  B. 

The  simulator  will  calculate  the  value  for  B  and  put  it  in  the  output 
file  (to  be  discussed  later),  but  it  is  helpful  to  write  what  you  expect 
B  to  be.  If  the  simulator  calculates  a  result  that  is  different  from 
what  you  expect,  it  will  alert  you,  and  place  a  marker  in  the  output 
file  to  tell  you  where  the  problem  is.  You  can  do  this  with  the 
CHECK  statement.  Since  A  is  HIGH  here,  we  expect  B  to  be 
LOW.  To  verify  that  B  is  indeed  LOW,  use  the  statement: 

CHECK  /B 

Note  that  CHECK  statements  are  not  required  for  simulation. 
However,  without  them,  you  must  examine  the  simulation  output 
to  see  if  the  design  functioned  correctly. 

We  also  need  to  verify  that  when  A  goes  LOW,  B  goes  HIGH,  as 
expected.  This  can  be  done  with  the  pair  of  statements: 

SETF  /A 
CHECK  B 

We  have  now  fully  exercised  the  inverter.  However,  the  standard 
output  file  forthe  simulator,  called  the  history  file,  tracks  all  signals 
on  the  device,  in  increasing  pin  order.  From  Figure  8,  this  would 
be  in  the  order 

ACDFGI  JKMNPBEHROLQ 

Instead,  we  may  wish  to  place  related  signals  near  each  other. 
We  can  do  this  with  the  TRACE_ON  and  TRACE  OFF  com- 
mands. These  commands  cause  a  second  output  file  to  be 
generated,  which  will  only  show  signals  and  events  as  deter- 
mined by  the  TRACE_ON  command. 

In  our  case,  we  would  like  to  have  the  output  of  the  gate  appear 
right  after  its  inputs,  so  that  the  workings  of  the  gate  can  be  easily 
examined.  This  can  be  done  with  the  command: 

TRACE_ON  ABCDEFGHIJKLMNOPQR 

At  the  end  of  the  simulation,  the  TRACE_OFF  command  is  used 
to  end  the  special  trace. 

Note  that  both  a  history  file  and  a  trace  file  will  now  be  generated. 
The  history  file  will  have  the  simulation  results  in  increasing  pin 
order;  the  trace  file  will  have  the  signals  in  the  order  requested 
with  the  TRACE_ON  command. 
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Now  we  can  exercise  all  of  the  other  gates  in  the  circuit.  The  full 
simulation  section  is  shown  in  Figure  11. 

SIMULATION 

TRACE_ON  ABCDEFGHIJKLMNOPQR 


;look  at  the  inverter 
SETF  A 
CHECK  /B 
.SETF  /A 
CHECK  B 


/look  at  AND  gate 

SETF   /C  /D 

CHECK  /E 

SETF  D 

CHECK  /E 

SETF  C  /D 

CHECK  /E 

SETF  D 

CHECK  E 


; look  at  OR  gate 

SETF  /F  /G 

CHECK  /H 

SETF  G 

CHECK  H 

SETF  F 

CHECK  H 

SETF  /G 

CHECK  H 


; look  at  NAND  gate 

SETF  /I  /J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  I  /J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  J  /K 

CHECK  L 

SETF  K 

CHECK  /L 


;look  at  NOR  gate 

SETF  /K  /N 

CHECK  0 

SETF  N 

CHECK  /0 

SETF  M 

CHECK  10 

SETF  /N 

CHECK  /O 


;look  at  XOR  gate 

SETF  /P  /Q 

CHECK  /R 

SETF  Q 

CHECK  R 

SETF  P  /a 

CHECK  R 

SETF  Q 

CHECK  /R 


,-set  A  HI 

; verify  that  B  is  LO 
;set  A  LO 

,-verify  that  B  is  HI 
;end  of  inverter  trace 


;set  C,   D  LO 

,-verify  E  LO 

;C  stays  LO,  D  goes  HI 

;E  should  stay  LO 

;set  C  HI,  D  LO 

;E  should  stay  LO 

;C  stays  HI,  D  goes  HI 

;E  should  now  be  HI 

;end  of  AND  gate  trace 


;set  F,   G  LO 

,-verify  H  LO 

;F  stays  LO,   G  goes  HI 

;H  should  go  HI 

,-G  stays  HI,  F  goes  HI 

,-H  should  stay  HI 

;F  stays  HI,  G  goes  LO 

;H  should  stay  HI 

;end  of  OR  gate  trace 


;IJK  =  000 
,-verify  L  HI 
;IJK  =  001 

;L  should  still  be  HI 
;IJK  -  010 

,-L  should  still  be  HI 
;IJK  =  011 

,-L  should  still  be  HI 
,-IJK  =  100 

,-L  should  still  be  HI 
;IJK  -  101 

;L  should  still  be  HI 
,-IJK  •  110 

,-L  should  still  be  HI 

;IJK  =  111 

;L  should  go  LO 

,-end  of  NAND  gate  trace 


;set  M,  N  LO 

,-verify  O  HI 

;M  stays  LO,  N  goes  HI 

;0  should  go  LO 

,-M,  N  now  both  HI 

;0  should  stay  LO 

;M  stays  HI,   N  goes  LO 

;0  should  stay  LO 

;end  of  NOR  gate  trace 


,-set  P,   Q  LO 

,-verify  R  LO 

;P  stays  LO,  Q  goes  HI 

,-R  should  go  HI 

,-now  P  HI,   Q  LO 

,-R  should  stay  HI 

;both  inputs  HI 

;R  should  go  LO 


This  completes  the  simulation.  The  entire  design  file  is  shown  in 
Figure  21  at  the  end  of  this  section.  You  can  now  run  the 
simulation  to  make  sure  that  your  design  is  correct. 

Running  the  Simulator 

After  adding  the  simulation,  the  file  needs  to  be  reprocessed.  Do 
this  again  by  hitting 

<F5> 

and  then  hitting 


This  completely  reprocesses  the  file  and  runs  the  simulation. 

Each  of  the  events  specified  in  the  simulation  section  of  the 
design  file  is  executed.  The  resulting  outputs  are  calculated,  and, 
if  CHECK  statements  are  used,  the  calculated  outputs  are  com- 
pared to  the  expected  outputs.  If  the  two  values  do  not  match, 
then  you  are  expecting  an  output  that  the  circuit  is  not  generating. 
This  means  either  that  the  expected  values  are  wrong,  or  that 
there  is  a  problem  with  the  circuit  itself.  In  either  case,  the  design 
file  must  be  modified  to  fix  the  problem,  and  then  the  file  should 
be  completely  reprocessed. 

The  main  output  of  the  simulator  is  the  history  file.  In  addition,  a 
tracefile  will  be  generated  if  the  TRACE_ON  statement  was  used, 
as  in  the  example  above.  These  files  will  have  the  same  name  as 
your  original  design,  except  that  the  extension  for  the  history  file 
is  '.HST'  and  that  for  the  trace  file  is  '.TRF'. 

You  may  look  at  the  history  and  trace  files  with  your  editor,  or  print 
them  out  for  closer  analysis.  Note  that  the  history  file  contains  the 
results  of  every  signal  and  every  event  in  increasing  pin  order, 
whereas  the  trace  file  contains  signals  in  the  order  specifically  re- 
quested by  the  TRACE_ON  commands. 

It  is  usually  easier  to  look  at  the  simulation  output  as  waveforms. 
To  generate  the  waveforms,  hit 

<F7> 

To  see  the  waveforms  for  the  trace  file,  hit 


Once  the  waveforms  are  on  the  screen,  you  may  scroll  around 
using  the  arrow  keys.  The  vertical  bar  cursor  will  help  you  line  up 
events. 

After  you  are  through  examining  the  file,  hit 

<ESO 

to  get  out  of  the  waveform  generator,  and  then, 

<ESO 

to  get  back  to  the  main  menu. 


TRACE  OFF 


,-end  of  trace 

Figure  11.  The  Simulation  Section 
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This  makes  it  much  easier  to  debug  your  design.  You  can  look  at 
the  entire  simulation  in  the  history  file,  or  break  it  up  into  manage- 
able bits  in  the  trace  file. 

The  simulator  also  converts  the  simulation  results  into  test 
vectors,  and  appends  the  vectors  to  the  JEDEC  file.  This  gives 
you  two  JEDEC  files;  one  with  vectors  and  one  without.  The  new 
file  has  the  same  name  as  your  original  design  file,  but  with  the 
extension  'JDC.  This  file  can  be  used  with  programmers  that 
provide  functional  tests. 

Constructing  a  Registered  Design- 
Basic  Flip-Flops 

Next  we  will  do  a  very  simple  registered  design:  we  will  be 
designing  all  of  the  basic  flip-flop  types  (Figure  12).  We  will 
conceptualize  the  design  by  reviewing  briefly  the  behavior  of  the 
D-type  flip-flop.  We  will  then  present  the  results  for  T,  J-K,  and  S- 
R  flip-flops.  More  detail  on  these  flip-flop  types  can  be  found  in 
the  logic  reference,  on  page  6-8. 

Rememberthat  the  devices  we  will  be  using  only  have  D-type  flip- 
flops.  Thus  we  will  be  emulating  the  other  flip-flops  with  D-type 
flip-flops. 


CLK 
D 


D  C  o 

>  P  Q 


T  C  Q 
>  P° 


J  C  Q 
t> 


S  C  Q 

-t> 

R  p  Q 


DT 
DC 


Building  the  D-Type  Flip-Flop 
Equations 

A  D-type  flip-flop  merely  presents  the  input  data  at  the  output  after 
being  clocked.  Its  basic  transfer  function  can  be  expressed  as 

DT  :=  D 

where  we  have  used  pins  DT  ("D  True")  and  D  as  shown  in 
Figure  12. 

Note  the  use  of  ':='  here  instead  of  '='.  This  indicates  that  the 
output  is  registered  for  this  equation.  The  difference  is  illustrated 
in  Figure  13. 


Figure  13.  Registered  vs.  Combinatorial  Equations 

We  can  also  generate  the  complement  signal  (named  DC)  with 
the  statement 


/D 


As  per  Figure  12,  we  want  to  add  synchronous  preset  and  clear 
functions  to  the  flip-flops.  This  can  be  done  with  two  input  pins, 
called  PR  and  CLR.  To  add  these  functions  to  the  true  flip-flop 
signal,  we  add  /CLR  to  every  product  term  and  add  one  product 
term  consisting  only  of  PR.  Likewise,  for  the  complement 
functions,  we  add  /PR  to  every  product  term,  and  add  one  product 
term  consisting  only  of  CLR.  This  is  shown  in  Figure  14. 

DT  D*/CLR 
+  PR 


Figure  12.  Basic  Flip-Flops 


DC  /DVPR 
+  CLR 


Figure  14.  D-Type  Flip-Flops  with  Clear  and  Preset 
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In  this  way,  when  clearing  the  flip-flops,  the  active-HIGH  flip-flops 
have  no  product  terms  true,  and  go  LOW;  the  active-LOW  flip- 
flops  have  the  last  product  term  true,  and  will  therefore  go  HIGH. 
The  reverse  will  occur  for  the  preset  function. 

There  is  still  one  hole  in  this  design:  what  happens  if  we  preset 
and  clear  at  the  same  time?  As  it  is  right  now,  both  outputs  will  go 
HIGH.  This  makes  no  sense  since  one  signal  is  supposed  to  be 
the  inverse  of  the  other.  To  rectify  this,  we  can  give  the  clear 
function  priority  over  the  preset  function.  We  can  do  this  by 
placing  /CLR  on  every  product  term  for  the  true  flip-flop  signal. 
The  results  are  shown  in  Figure  15. 


DT  :-  D*/CLR 
+    PR* /CLR 

DC   :=  /D*/PR 
+  CLR 

Figure  15.  D-Type  Flip-Flops  with  Clear  Priority 

Building  the  Remaining  Equations 

The  same  basic  procedure  can  be  applied  to  all  of  the  other  flip- 
flops.  The  equations  are  shown  in  Figure  16. 


EQUATIONS 


; emulating  all  flip-flops  with  D-type  flip-flops 


DT 

D*/CLR 

/output  is  D  if  not  clear 

+ 

PR* /CLR 

;or  1  if  preset  and  not  clear  at  the  same  time 

DC 

/D*/PR 

/output  is  /D  if  not  preset 

+ 

CLR 

/or  1  if  clear 

TT 

T*/TT*/CLR 

/go  HI  if  toggle  and  not  clear 

+ 

/T*TT*/CLR 

/ stay  HI  if  not  toggle  and  not  clear 

+ 

PR* /CLR 

/go  HI  if  preset  and  not  clear  at  the 

same  time 

TC 

T*/TC*/PR 

/go  HI  if  toggle  and  not  preset 

+ 

/T*TC*/PR 

/stay  HI  if  not  toggle  and  not  preset 

+ 

CLR 

/go  HI  if  clearing 

JKT 

J*/JKT*/CLR 

/go  HI  if  J  and  not  clear 

+ 

/K*JKT*/CLR 

/ stay  HI  if  not  K  and  not  clear 

+ 

PR* /CLR 

/go  HI  if  preset  and  not  clear  at  the 

same  time 

JKC 

/J*/JKC*/PR 

/go  HI  if  not  J  and  not  preset 

+ 

K*/JKC*/PR 

/stay  HI  if  K  and  not  preset 

+ 

CLR 

/go  HI  if  clear 

SRT 

S*/CLR 

/go  HI  if  set  and  not  clear 

+ 

/R*SRT*/CLR 

/ stay  HI  if  not  reset  and  not  clear 

+ 

PR* /CLR 

/go  HI  if  preset  and  not  clear  at  the 

same  time 

SRC 

R*/PR 

/go  HI  if  reset  and  not  preset 

+ 

/S*SRC*/PR 

/stay  HI  if  not  set  and  not  preset 

+ 

CLR 

/go  HI  if  clear 

Figure  16.  Flip-Flop  Equations  Section 
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[ff]  >  r  [Trr]  VCLR 

+  /T-[W]7CLR 
+  PR7CLR 


Figure  17.  Feedback  in  the  Equation  for  TT 


Notice  that  in  some  of  the  equations  above,  the  output  signal  itself 
shows  up  in  the  equations.  This  is  the  way  in  which  feedbackfrom 
the  flip-flop  can  be  used  to  determine  the  next  state  of  the  flip-flop. 
An  equivalent  logic  drawing  of  the  TT  equation  is  shown  in 
Figure  17. 


Completing  the  Design  File 


We  are  now  in  a  position  to  select  a  device  and  complete  the 
device  file.  The  device  used  will  be  a  PAL1 6R8,  which  is  actually 
an  active-LOW  device.  The  logic  diagram  for  this  device  can  be 
found  on  page  5-47.  The  implications  of  an  active-LOW  device 
are  discussed  more  fully  on  page  6-1 9.  For  now,  suffice  it  to  say 
that  one  way  of  implementing  active-LOW  logic  is  to  declare 
outputs  in  the  pin  list  with  a  slash  (V)  in  front  of  them.  This 
essentially  defines  them  as  'negative'  or  'inverted'  pins. 

The  declaration  section  is  built  up  just  like  the  last  one,  and  is 
shown  in  Figure  18. 

Basic  flip-flops 
P0001 
A 

Stateyour  Namehere 
Nameof your  Co . ,  Inc. 
7/22/87 

Flip-flops  PAL16R8 

:.  !     1         2345       6         789  10 
CLK     J    K    T     PR    CLR    D     S    R  GND 

!    11     12        13        14  15 
OE     /SRC     /SRT     /DC  /DT 

;PINS     16       17       18         19  20 
/TC     /TT     /JKC     /JKT  VCC 

Figure  18.  Basic  Flip-Flops  Declaration  Section 

Notice  thatthis  device  has  aclock  and  an  output  enable  pin,  called 
CLK  and  OE,  respectively.  The  input  and  output  pins  are  named 
as  shown  in  Figure  12.  The  output  pins  are  all  defined  with 
slashes,  to  indicate  that  they  are  inverted  pins. 


The  design  can  now  be  processed  by  hitting 


<F5> 


and  then  hitting 


This  will  tell  you  whether  there  are  any  basic  problems  with  your 
design.  You  can  correct  any  mistakes  by  editing  the  file  and  then 
reprocessing. 

Simulating  the  Flip-Flops 

After  processing  the  design  and  correcting  any  mistakes,  we  can 
write  the  simulation.  We  have  all  of  the  simulation  instructions  we 
need,  except  for  one  new  instruction  which  simplifies  manipula- 
tion of  the  clock  signal.  It  is  possible  to  use  SETF  instructions  with 
the  clock  pin,  but  then  each  clock  transition  would  require  two 
instructions:  one  to  set  the  clock  HIGH,  and  one  to  bring  it 
back  LOW. 

Instead,  we  can  use  the  CLOCKF  instruction.  This  pulses  a  clock 
pin  in  one  instruction.  Of  course,  registered  outputs  will  not 
change  state  until  after  the  rising  edge  of  the  clock  signal.  Since 
we  have  named  the  clock  pin  CLK,  we  can  clock  this  device  with 
the  instruction 

CLOCKF  CLK 

We  can  simulate  the  true  D-type  flip-flop  as  follows: 

SETF  D  ;set  the  D  input  HI 

CLOCKF  CLK  /clock  the  device 

CHECK  DT  /verify  that  the  output  went  HI 

SETF  /D  /set  the  D  input  LO 

CLOCKF  CLK  /clock  the  device 

CHECK  /DT  /verify  that  the  output  went  LO 

Before  we  actually  simulate  any  registered  design  like  this,  two 
items  must  be  initialized:  the  clock  and  the  output  enable  pin  (OE). 
Since  the  CLOCKF  statement  executes  a  HIGH-LOW  pulse  on 
the  clock  pin,  we  must  first  make  sure  that  the  clock  is  set  LOW 
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to  begin  with.  We  must  also  enable  the  outputs,  which  is  done  by 
setting  the  OE  pin  LOW.  These  tasks  can  be  accomplished  with 
the  statement: 

SETF  /CLK  /OE 

We  also  need  to  initialize  the  flip-flops.  At  the  same  time  we  can 
verify  that  the  CLR  function  is  working  correctly; 


SETF  CLR  /PR 
CLOCKF  CLK 


; set  the  clear  pin 
; clear  the  circuit 


/make  sure  the  clear  function  worked 
CHECK  /DT  DC  /TT  TC  /JKT  JKC  /SRT  SRC 
SETF  /CLR  ;  remove  the  clear  signal 

We  are  now  in  a  position  to  sim  ulate  the  entire  circuit.  A  thorough 
simulation  is  shown  in  Figure  19. 


SIMULATION 

TRACE  ON  CLR  PR  D  DT  DC  T  TT  TC  JK  JKT  JKC  SR  SRT  SRC 


/initialize  the  circuit 
SETF  CLR  /PR 
CLOCKF  CLK 


; set  the  clear  pin 
; clear  the  circuit 


;make  sure  the  clear  function  worked 

CHECK  /DT  DC  /TT  TC  / JKT  JKC  /SRT  SRC 

SETF  /CLR  ; remove  the  clear  signal 

/check  out  the  preset  function 

SETF  PR  ;set  the  preset  pin 

CLOCKF  CLK  /preset  the  circuit 

CHECK  DT  /DC  TT  /TC  JKT  /JKC  SRT  /SRT 

SETF  /PR  /remove  the  preset  signal 

/verify  that  clear  has  priority 

SETF  PR  CLR  /set  both  clear  and  preset 

CLOCKF  CLK 

CHECK  /DT  DC  /TT  TC  /JKT  JKC  /SRT  SRC 

SETF  /PR  /CLR  /remove  both  preset,  clear 

/disable  all  flip-flop  inputs  for  now 
SETF  /D  /T  /J  /K  /S  /R 


/check  out  D-type  flip-flops 
SETF  D 
CLOCKF  CLK 

DT  /DC 


SETF  D 
CLOCKF  CLK 
CHECK  DT  /DC 

SETF  /D 
CLOCKF  CLK 
CHECK  /DT  DC 


SETF  /D 
CLOCKF  CLK 

/DT  DC 


/set  the  D  input  HI  from  LO 
/clock  the  device 

/verify  that  the  true  output  went  HI 
/and  the  complement  went  LO 

/hold  a  HI 

/verify  that  state  maintained 

/set  the  D  input  LO  from  HI 
/clock  the  device 

/verify  that  the  true  output  went  LO 
/and  complement  went  HI 

/hold  a  LO 

/verify  that  state  maintained 


/check  out  T-type  flip-flops 

SETF  /T  /hold  a  LO 

CLOCKF  CLK 

CHECK  /TT  TC  /true  output  should  still  be  LO 

/ complement  output  HI 


SETF  T 
CLOCKF  CLK 
CHECK  TT  /TC 


/ toggle  from  a  LO 

/both  outputs  should  have  changed  state 


Figure  19.  Flip-Flop  Simulation  Section 
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SETF  /T 
CLOCKF  CLK 
CHECK  TT  /TC 

SETF  T 
CLOCKF  CLK 
CHECK  /TT  TC 


;hold  a  HI 

;both  outputs  should  have  held  their  state 
/toggle  from  a  HI 

;both  outputs  should  have  changed  state 


; check  out  J-K  flip-flops 
SETF  /J  /K 
CLOCKF  CLK 
CHECK  /  JKT  JKC 

SETF  J  /K 
CLOCKF  CLK 
CHECK  JKT  /JKC 

SETF  13  /K 
CLOCKF  CLK 
CHECK  JKT  /JKC 

SETF  /J  K 
CLOCKF  CLK 
CHECK  /JKT  JKC 

CLOCKF  CLK 
CHECK  /JKT  JKC 

SETF  J  K 
CLOCKF  CLK 
CHECK  JKT  /JKC 

SETF  J  /K 
CLOCKF  CLK 
CHECK  JKT  /JKC 

SETF  J  K 
CLOCKF  CLK 
CHECK  /JKT  JKC 


,-hold  a  LO 

;both  outputs  should  have  held  their  state 
;set  a  HI 

; outputs  should  have  changed 
;hold  a  HI 

;both  outputs  should  have  held  their  state 
; reset  a  LO 

/outputs  should  have  changed 

; reset  an  output  that  is  already  LO 
;make  sure  that  the  outputs  didn't  change 

; toggle  from  a  LO 

/verify  that  outputs  changed 

;set  an  output  that  is  already  HI 

;make  sure  the  outputs  didn't  change 

/toggle  from  a  HI 

/verify  that  outputs  changed 


; check  out  S-R  flip-flops 
SETF  /S  /R 
CLOCKF  CLK 
CHECK  /SRT  SRC 

SETF  S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

SETF  /S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

SETF  S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

SETF  /S  R 
CLOCKF  CLK 
CHECK  /SRT  SRC 

CLOCKF  CLK 
CHECK  /SRT  SRC 

TRACE  OFF 


;hold  a  LO 

;both  outputs  should  have  held  their  state 
;set  a  HI 

/outputs  should  have  changed 
;hold  a  HI 

;both  outputs  should  have  held  their  state 
;set  an  output  that  is  already  HI 
;make  sure  the  outputs  didn't  change 
; reset  a  LO 

; outputs  should  have  changed 

/reset  an  output  that  is  already  LO 
;make  sure  that  the  outputs  didn't  change 


Figure  19.  Flip-Flop  Simulation  Section  (Cont'd.) 
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The  file  can  now  be  simulated  in  the  same  manner  as  the  basic 
gates  design.  The  simulation  results  can  be  viewed  either  by 
examining  the  history  or  trace  file,  or  by  generating  waveforms. 
The  complete  design  file  is  shown  in  Figure  22  at  the  end  of  this 
section. 

Programming  a  Device 

After  simulating  the  design,  and  verifying  that  it  works,  it  is  time  to 
program  a  device.  There  are  several  steps  to  programming,  but 
the  exact  operation  of  the  programmer  naturally  depends  on  the 
type  of  programmer  being  used.  We  will  be  as  explicit  as  we  can 
here,  but  you  will  need  to  refer  to  your  programmer  manual  for  the 
specifics. 

The  first  thing  that  must  be  done  after  turning  the  programmer  on 
is  to  select  the  device  type.  This  tells  the  programmer  what  kind 
of  programming  data  to  expect.  The  device  type  is  usually 
selected  either  from  a  menu  or  by  entering  a  device  code.  Your 
programmer  manual  will  have  the  details. 

Next  a  JEDECfile  must  be  downloaded.  To  transfer  the  JEDEC 
file  from  the  computer  to  your  programmer,  you  will  need  to 
provide  a  connection,  as  shown  in  Figure  20.  This  is  normally 
done  with  an  RS-232  cable  connected  between  the 
programmer's  port  and  a  serial  port  on  the  computer  (usually 
COM1). 
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Figure  20.  A  Connector  Must  Be  Provided  Between  the 
Computer  and  the  Programmer 

If  your  programmer  can  perform  functional  tests,  and  you  wish  for 
those  tests  to  be  performed,  you  should  download  the  '.JDC  file; 
otherwise  you  should  download  the  '.JED'  file. 


To  download  data,  the  programmer  must  first  be  set  up  to  receive 
data.  The  programmer  manual  will  tell  you  how  to  do  this.  The 
data  can  then  be  sent  from  the  computer  by  hitting 

<F4> 

This  sets  up  communication  between  the  computer  and  the 
programmer.  Whichever  communication  program  is  installed  will 
be  invoked.  This  is  used  to  transmit  the  JEDEC  file  to  the 
programmer.  The  details  below  assume  you  are  using  PC2;  if  not, 
follow  the  instructions  for  your  program  to  accomplish  the  same 
steps. 

Before  actually  sending  the  data,  you  must  verify  the  correct  com- 
munication protocol.  Check  to  make  sure  you  know  what  protocol 
the  programmer  is  expecting;  then  hit  the  <F2>  key  on  the 
computer.  This  allows  you  to  set  up  the  baud  rate,  data  bits,  stop 
bits,  and  parity. 

Once  the  protocol  has  been  set  up,  hit  the  <F1>  key.  You  will  be 
asked  for  the  file  name;  enter  the  name  of  the  JEDECfile  you  wish 
to  use.  The  computer  will  then  announce  that  it  is  sending  the 
data,  and  tell  you  when  it  is  finished.  Note  that  just  because  it  says 
it  has  finished  sending  data  does  not  mean  that  the  data  was 
received.  Your  programmer  will  indicate  whether  or  not  data  was 
received  correctly. 

Once  the  data  has  been  received,  the  programmer  is  ready  to 
program  a  device.  Place  a  device  in  the  appropriate  socket,  and 
follow  the  instructions  for  your  programmer  to  program  the 
device.  This  procedure  programs  and  verifies  the  connections  in 
the  device,  and,  if  a  '.JDC  file  was  used,  will  perform  a  functional 
test. 

The  programmer  will  announce  when  the  programming  proce- 
dure has  been  completed.  You  may  then  take  the  device  and  plug 
it  into  your  application. 

If  you  have  actually  programmed  one  of  the  examples  that  we 
created  above,  you  naturally  don't  have  a  board  into  which  you 
can  plug  the  device.  If  you  do  have  a  lab  setup,  you  may  wish  to 
play  with  the  devices  to  verify  for  yourself  that  the  devices  perform 
just  as  you  expected  them  to. 

You  will  find  much  more  detail  on  many  issues  that  were  not 
discussed  in  this  section  in  the  remaining  sections  of  this  hand- 
book. This  section  should  have  provided  you  with  the  basic 
knowledge  you  need  to  understand  the  remaining  design  ex- 
amples in  this  book,  and  to  start  your  own  designs. 
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TITLE  Basic  gates 

PATTERN  POOOO 

REVISION  A 

AUTHOR  Stateyour  Namehere 

COMPANY  Nameof your  Co . ,  Inc . 

DATE  7/22/87 

CHIP     GATES  PAL12H6 


;PINS     1  2 
A  C 


3456789  10 
D    F     G     I     J    K     M  GND 


;PINS 


11  12  13  14  15  16  17  18  19  20 
NPBEHROLQ  VCC 


EQUATIONS 

B  =  /A 

E  =  C*D 

H  =  F 
+  G 

L  =  /I 
+  /J 
+  /K 

O  -  /M*/N 

R  =  P*/Q 
+  /P*Q 

SIMULATION 


; inverter 
;  AND  gate 
;OR  gate 

;NAND  gate  after  applying  DeMorgan's  theorem 

;NOR  gate  after  applying  DeMorgan's  theorem 
;X0R  gate,  expanded 


TRACE  ONABCDEFGHIJKLMNOPQR 


;look  at  the  inverter 
SETF  A 
CHECK  /B 
SETF  /A 
CHECK  B 


;look  at  AND  gate 
SETF  /C  /D 
CHECK  /E 
SETF  D 
CHECK  /E 
SETF  C  /D 
CHECK  /E 
SETF  D 


;look  at  OR  gate 

SETF  /F  /G 

CHECK  /H 

SETF  G 

CHECK  H 

SETF  F 

CHECK  H 

SETF  /G 

CHECK  H 


j  set  A  HI 

,-verify  that  B  is  LO 
;set  A  LO 

,-verify  that  B  is  HI 
;end  of  inverter  trace 

;set  C,  D  LO 

,-verify  E  LO 

;C  stays  LO,  D  goes  HI 

;E  should  stay  LO 

;set  C  HI,  D  LO 

,-E  should  stay  LO 

;C  stays  HI,  D  goes  HI 

,-E  should  now  be  HI 

;end  of  AND  gate  trace 

;set  F,  G  LO 

,-verify  H  LO 

,-F  stays  LO,  G  goes  HI 

,-H  should  go  HI 

,-G  stays  HI,  F  goes  HI 

;H  should  stay  HI 

;F  stays  HI,  G  goes  LO 

,-H  should  stay  HI 

;end  of  OR  gate  trace 


Figure  21.  Complete  Basic  Gates  Design  File 
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; look  at  NAND  gate 

SETF  /I  /J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  I  /J  /K 

CHECK  L 

SETF  K 

CHECK  L 

SETF  J  /K 

CHECK  L 

SETF  K 

CHECK  /L 


;IJK  =  000 
;  verify  L  HI 
;IJK  =  001 

;L  should  still  be  HI 
;IJK  =  010 

;L  should  still  be  HI 
;IJK  -  Oil 

;L  should  still  be  HI 
;IJK  =  100 

;L  should  still  be  HI 
;IJK  =  101 

,-L  should  still  be  HI 
;IJK  -  110 

;L  should  still  be  HI 

;IJK  =  111 

;L  should  go  LO 

;end  of  NAND  gate  trace 


;look  at  NOR  gate 

SETF  /M  /N 

CHECK  0 

SETF  N 

CHECK  /0 

SETF  M 

CHECK  /O 

SETF  /N 

CHECK  10 


;set  M,   N  LO 

/verify  0  HI 

;M  stays  LO,  N  goes  HI 

;0  should  go  LO 

;M,  N  now  both  HI 

;0  should  stay  LO 

;M  stays  HI,  N  goes  LO 

;0  should  stay  LO 

;end  of  NOR  gate  trace 


;look  at  XOR  gate 
SETF  /P  /Q 
CHECK  /R 
SETF  Q 
CHECK  R 
SETF  P  /Q 
CHECK  R 
SETF  Q_ 
CHECK  /R 


;set  P,  Q  LO 

; verify  R  LO 

;P  stays  LO,  Q  goes  HI 

;R  should  go  HI 

;now  P  HI,   Q  LO 

;R  should  stay  HI 

/both  inputs  HI 

;R  should  go  LO 


TRACE  OFF 


;end  of  trace 


Figure  21.  Complete  Basic  Gates  Design  File  (Cont'd.) 
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TITLE  Basic  flip-flops 

PATTERN  P0001 

REVISION  A 

AUTHOR  Stateyour  Namehere 

COMPANY  Nameof  your  Co . ,   Inc . 

DATE  7/22/87 


CHIP     Flip-flo  PAL16R8 


;PINS 

1 

2  3 

4 

CLK 

J  K 

T  1 

;PINS 

11 

12 

13 

OE 

/SRC 

/SRT 

;PINS 

16 

17 

18 

/TC 

/TT 

/JKC 

5  6 
PR  CLR 


14 


19 


15 
/DT 


20 


9  10 
R  GND 


EQUATIONS 

; emulating  all  flip-flops  with  D-type  flip-flops 


DT  :=  D*/CLR 
+  PR* /CLR 


; output  is  D  if  not  clear 

;or  1  if  preset  and  not  clear  at  the  same  time 


DC  :=  /D*/PR 
+  CLR 


.•output  is  /D  if  not  preset 
;or  1  if  clear 


:=  T*/TT*/CLR 
+  /T*TT*/CLR 
+  PR* /CLR 


;go  HI  if  toggle  and  not  clear 

; stay  HI  if  not  toggle  and  not  clear 

;go  HI  if  preset  and  not  clear  at  the  same  time 


TC  :=  T*/TC*/PR 
+  /T*TC*/PR 
+  CLR 


;go  HI  if  toggle  and  not  preset 

; stay  HI  if  not  toggle  and  not  preset 

;go  HI  if  clearing 


JKT  :=  J*/JKT*/CLR 
+  /K*  JKT* /CLR 
+  PR* /CLR 


;go  HI  if  J  and  not  clear 

; stay  HI  if  not  K  and  not  clear 

;go  HI  if  preset  and  not  clear  at  the 


same  time 


:=  /J*/JKC*/PR 
+  K* JKC* /PR 
+  CLR 


;go  HI  if  not  J  and  not  preset 
;stay  HI  if  K  and  not  preset 
;go  HI  if  clear 


:=  S*/CLR 
+  /R*  SRT* /CLR 
+  PR* /CLR 


;go  HI  if  set  and  not  clear 

; stay  HI  if  not  reset  and  not  clear 

;go  HI  if  preset  and  not  clear  at  the  same  time 


SRC  :=  R*/PR  ;go  HI  if  reset  and  not  preset 

+  /S*SRC*/PR  ;stay  HI  if  not  set  and  not  preset 

+  CLR  ;go  HI  if  clear 

Figure  22.  Complete  Basic  Fllp-Flops  Design  File 
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SIMULATION 

TRACE  ON  CLR  PR  D  DT  DC  T  TT  TC  JK  JKT  JKC  SR  SRT  SRC 


; initialize  the  circuit 
SETF  CLR  /PR 
CLOCKF  CLK 


; set  the  clear  pin 
/clear  the  circuit 


/make  sure  the  clear  function  worked 

CHECK  /DT  DC  /TT  TC  /JKT  JKC  /SRT  SRC 

SETF  /CLR  ;  remove  the  clear  signal 

; check  out  the  preset  function 

SETF  PR  ;set  the  preset  pin 

CLOCKF  CLK  /preset  the  circuit 

CHECK  DT  /DC  TT  /TC  JKT  /JKC  SRT  /SRT 

/PR  /remove  the  preset  signal 


/verify  that  clear  has  priority 

SETF  PR  CLR  /set  both  clear  and  preset 

CLOCKF  CLK 

CHECK  /DT  DC  /TT  TC  /JKT  JKC  /SRT  SRC 

SETF  /PR  /CLR  /remove  both  preset,  clear 

/disable  all  flip-flop  inputs  for  now 
SETF  /D  /T  /J  /K  /S  /R 


/check  out  D-type  flip-flops 
SETF  D 
CLOCKF  CLK 
CHECK  DT  /DC 


SETF  D 
CLOCKF  CLK 
CHECK  DT  /DC 

SETF  /D 
CLOCKF  CLK 
CHECK  /DT  DC 

SETF  /D 
CLOCKF  CLK 
CHECK  /DT  DC 


/set  the  D  input  HI  from  LO 
/clock  the  device 

/verify  that  the  true  output  went  HI  and  the  complement  went  LO 
/hold  a  HI 

/verify  that  state  maintained 

/set  the  D  input  LO  from  HI 
/clock  the  device 

/verify  that  the  true  output  went  LO  and  complement  went  HI 
/hold  a  LO 

/verify  that  state  maintained 


/check  out  T-type  flip-flops 
SETF  II 
CLOCKF  CLK 
CHECK  /TT  TC 


SETF  T 
CLOCKF  CLK 
CHECK  TT  /TC 

SETF  /T 
CLOCKF  CLK 
CHECK  TT  /TC 

SETF  T 
CLOCKF  CLK 
CHECK  /TT  TC 


hold  a  LO 

/true  output  should  still  be  LO,  complement  output  HI 
/toggle  from  a  LO 

/both  outputs  should  have  changed  state 
/hold  a  HI 

/both  outputs  should  have  held  their  state 
/toggle  from  a  HI 

/both  outputs  should  have  changed  state 

Figure  22.  Complete  Basic  Flip-Flops  Design  File  (Cont'd.) 


2-18 


Beginner's  Guide 


; check  out  J-K  flip-flops 

SETF  /J  /K 

,-hold  a  LO 

PT  .OPKF1  P  T  ,K 

■Virtt"h   f*>i  1 1"  tm  1 1" «?    *5hriiilH  Viav^  hial  H  t"  Vic  i  r  cl-sif-a 

SETF  J  /K 

;set  a  HI 

rT.fVKF'  PT.K 

•       it-  m  it"  «3     cV-n^i  1 1  H    Viairia       V>  a  t~i /~t*h tH 

,  uuLpui-a    oiiuuiu  i let vt;   oiicii ly fcitj 

SETF  /J  /K 

;hold  a  HI 

CLOCKF  CLK 

CHECK  JKT  /JKC 

;both  outputs  should  have  held  their  state 

CHECK  /JKT  JKC 

; outputs  should  have  changed 

PT  OPKTT    PT  K 

,-  reset  an  output  that  is  already  LO 

;make  sure  that  the  outputs  didn't  change 

SETF  J  K 

; toggle  from  a  LO 

("VTf  IT    CJ  V 
\~-Li\J\^r\£     t  .  i  if\ 

fiUPf^     TOT     /  TV/"* 

; verify  that  outputs  changed 

SETF  J  /K 

;set  an  output  that  is  already  HI 

CLOCKF  CLK 

CHECK  JKT  /JKC 

;make  sure  the  outputs  didn't  change 

SETF  J  K 

; toggle  from  a  HI 

CLOCKF  CLK 

CHECK  /JKT  JKC 

; verify  that  outputs  changed 

; check  out  S-R  flip-flops 
SETF  /S  /R 
CLOCKF  CLK 
CHECK  /SRT  SRC 

;hold  a  LO 

;both  outputs  should  have  held  their 

state 

SETF  S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

;set  a  HI 

/outputs  should  have  changed 

SETF  /S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

,-hold  a  HI 

;both  outputs  should  have  held  their 

state 

SETF  S  /R 
CLOCKF  CLK 
CHECK  SRT  /SRC 

; set  an  output  that  is  already  HI 
;make  sure  the  outputs  didn't  change 

SETF  /S  R 
CLOCKF  CLK 
CHECK  /SRT  SRC 

; reset  a  LO 

.•outputs  should  have  changed 

CLOCKF  CLK 
CHECK  /SRT  SRC 

; reset  an  output  that  is  already  LO 
;make  sure  that  the  outputs  didn't  change 

TRACE  OFF 

Figure  22.  Complete  Basic  Flip-Flops  Design  File  (Cont'd.) 
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Programmable  logic  devices  (PLDs)  are  used  in  digital  systems 
design  for  implementing  a  wide  variety  of  logic  functions.  These 
logicfunct  ons  range  from  simple  random  logic  replacement  (dis- 
cussed earlier  in  the  Beginner's  Guide  to  PLDs  on  page  2-2)  to 
complex  control  sequencers  (discussed  in  the  subsequent  sec- 
tions). Programmable  logic  devices  offer  the  multiple  advantages 
of  low  cost,  high  integration,  ease  of  use,  and  easier  design 
debugging  capability  not  available  in  other  systems  design  op- 
tions. These  are  described  in  the  introduction  (on  page  1  -1 )  to  this 
handbook.  In  the  following  discussion  we  will  detail  the  PLD 
design  process. 

Most  PLDs  have  an  AND-OR  array  structure  with  programmable 
connections  in  eitherorboth  ofthe  arrays.  Aprogrammable  array 
implies  that  the  connections  can  be  programmed  by  the  user. 
PLDs  are  classified  depending  on  which  of  the  two  arrays  is 
programmable  or  fixed.  The  popular  PAL  (Programmable  Array 
Logic)  devices  have  a  programmable  AND  array  and  a  fixed  OR 
array.  PAL  devices  are  used  for  a  wide  variety  of  combinatorial 
and  registered  logic  functions.  PROM  (Programmable  Read- 
only Memory)  devices — used  often  as  memory  and  seldom  as 
logic — have  a  fixed  AND  array  and  a  programmable  OR  array. 
Both  arrays  are  programmable  in  PLS  (Programmable  Logic  Se- 
quencer) devices.  These  devices  are  used  for  special  state 
machine  applications  discussed  on  page  2-117.  In  addition,  there 
are  other  programmable  logic  devices  that  combine  program- 
mable arrays  with  dedicated  logic  for  providing  optimal  function- 
ality for  a  specific  system  application.  The  PROSE  (PROgram- 
mable  SEquencer)  device  is  one  such  device,  with  its  architec- 
ture optimized  for  state  machine  designs.  A  discussion  of  the 
various  PLD  architectures  is  included  on  page  1  -7.  In  this  discus- 
sion we  will  also  examine  the  various  design  constraints  to  be 
considered  when  selecting  the  correct  architecture  for  a  given 
application. 

All  digital  logic  can  be  efficiently  reduced  to  two  fundamental 
gates,  AND  and  OR,  provided  both  true  and  complement  versions 
of  all  input  signals  are  available.  Such  logic  is  generally  built 
around  what  is  known  as  the  sum-of-products  (AND-OR)  form. 
Please  see  page  6-1  for  details  on  sum-of-products  form  and 
Boolean  logic.  Programmable  logic  devices  are  ideal  for  imple- 
menting such  two-stage  logic  in  the  AND  and  OR  arrays. 

Various  process  technologies  offer  many  design  options  for 
PLDs.  The  connections  in  the  programmable  arrays  can  be  fuse- 
based,  commonly  used  in  both  ECL  and  TTL  bipolar  technolo- 
gies, E/EEPROM  cell-based  in  UV-EPROM  and  EEPROM 
CMOS  technologies  and  RAM  cell-based  in  CMOS  RAMtechnol- 
ogy.  ECL  PLDs  are  used  for  very  high-speed  designs  (greater 
than  125  MHz  bandwidth),  while  CMOS  is  used  for  low-power 
designs.  Bipolar  TTL  fuse-based  PLDs  cover  the  vast  midrange 
of  applications  and  are  the  most  popular  PLDs.  The  selection  of 
technology  is  mostly  dependent  upon  the  system  speed  and 
power  constraints.  Most  design  engineers  are  familiar  with  these 


constraints,  which  not  only  dictate  the  technology  of  PLDs  but  of 
all  other  logic  used  in  a  system.  In  the  following  discussion,  we 
will  assume  TTL  fuse-based  technology  for  simplicity.  A  detailed 
discussion  of  the  technology  is  included  on  page  3-130. 

Designing  with  PLDs  involves  the  use  of  design  software  and  a 
device  programmer  (Figure  1).  The  design  software  eliminates 
the  need  to  identify  every  connection  to  be  programmed  for 
implementing  the  desired  sum-of-products  logic.  The  design 
process  begins  with  the  creation  of  a  design  file  which  specifies 
the  desired  function.  The  function  is  typically  represented  by  its 
sum-of-products  form  and  can  be  derived  directly  from  the  timing 
diagram  and/or  truth  tables.  Occasionally  Karnaugh  maps  and 
state  diagrams  are  also  used.  The  design  file  is  then  assembled 
to  produce  the  "JEDEC"  file.  The  JEDEC  file  gets  its  name  from 
the  fact  that  it  is  an  approved  JEDEC  standard  for  specifying  the 
state  of  every  connection  on  the  device.  Simulation  can  then  be 
performed.  If  the  design  is  correct,  the  JEDEC  file  is  downloaded 
into  a  device  programmer  for  programming  the  connections  on 
the  device.  The  device  can  then  be  plugged  into  the  PC  board 
where  it  will  function.  The  entire  procedure  can  often  be  per- 
formed with  the  designer  never  having  to  leave  the  desk.  Most 
programmers  interface  to  personal  computers,  so  that  the  design 
file  can  be  edited,  assembled,  simulated,  and  downloaded,  and 
the  device  programmed,  all  in  one  place. 


PLD  DESIGN 
SOFTWARE 

JEDEC 
FILE 

 i 

Figure  1.  PLDsare  Designed  using  Software 
and  a  Device  Programmer 
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The  first  stage  in  a  PLD  design  process  (Figure  2)  is  the  concep- 
tualization of  adesign  problem;the  second  is  the  selection  of  the 
correct  device;  the  third  is  the  implementation  of  the  design,  which 
also  includes  simulating  the  design  with  test  vectors;  and  finally, 
the  actual  programming  and  testing  on  a  system  board.  We  will 
take  a  simple  design  example  and  go  through  the  various  stages 
of  this  design  process. 


CONCEPTUALIZE  A 
DESIGN  PROBLEM 


SELECT 
DEVICE 


IMPLEMENT 
DESIGN 


PROGRAM 
PLD 


TEST 
PLD 


PLUG  PLD 
INTO  BOARD 


Figure  2.  Programmable  Logic  Device  Design  Process 


Conceptualizing  a  Design 

The  first  step  in  the  PLD  design  process  is  also  required  for  any 
SSI/MSI  design.  An  advantage  of  PLDs  is  that  at  this  stage  the 
designer  needs  to  be  concerned  only  with  the  required  logic 
function.  With  SSI  or  MSI,  various  device  logic  limitations  must 
be  accounted  for  before  the  design  can  be  started.  Clearly  a 
designer  needs  to  develop  a  brief  and  complete  functional  de- 
scription, based  upon  the  system  design  requirements. 

We  will  take  the  example  of  a  simple  address  decoder  circuit 
required  for  a  68000  microprocessor.  The  microprocessor  has  24 
address  lines  along  with  separate  read  and  write  signals.  It 
requires  some  ROM  to  store  the  boot-up  code  as  well  as  some 
RAM  for  storing  and  executing  programs.  The  purpose  of  the 
address  decoder  circuitry  is  to  select  one  of  the  memory  ad- 
dresses at  a  time.  The  RAMs  and  ROMs  are  assigned  addresses 
on  the  68000  microprocessor  address  space.  The  address 
decoder  circuit  has  to  select  one  of  the  RAMs  or  ROMs  for  a 
specific  range  of  addresses,  called  the  address  space.  This 
selection  is  accomplished  by  asserting  the  specific  chip-select 
signal  for  the  RAM  or  ROM  when  the  microprocessor  accesses 
one  of  the  addresses  in  the  address  space.  There  is  additional 
circuitry  in  a  typical  microprocessor  system  for  addressing  I/O 


PROM  1 


PROM  2 


DRAM  1 
DRAM  2 
DRAM  3 
DRAM  4 


000000-0FFFFF 
100000-1 FFFFF 
200000-2FFFFF 
300000-3FFFFF 
400000^tFFFFF 
500000-5FFFFF 

600000-7FFFFF 


Figure  3.  Memory  Address  Map 


RESET 
BUTTON  . 
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BOTTOM 


RW 

RESET  AS 

A  21.  A  22,  A  23 


M68000 
MICROPROCESSOR 


INTERFACE 
LOGIC 


INIT 

ROMCS1 


PROM1 


ROMCS2 


PROM2 


RAMCS 




2  A22,  A23 


DRAM 
CONTROLLER 


*  DRAM2 


DRAM3 


DRAM4 


READ 
ONLY 


READS 
WRITE 


Figure  4.  Microprocessor  to  Memory  Interface 
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devices  (such  as  disk  controllers).  These  devices  also  require 
that  chip-select  signals  be  asserted  when  the  microprocessor 
addresses  them.  Figure  3  shows  an  example  address  map  for  a 
68000  microprocessor. 

Figure4showsthecircuitdiagram.  Theaddresssignalsfromthe 
68000  microprocessor  are  inputs  tothe  interface  logic  block.  The 
outputs  generated  are  ROMCS1 .  ROMCS2,  and  RAMCS.  The 
generation  of  signals  for  selecting  device  l/Os  is  similar  and  is  not 
shown  here  for  the  sake  of  simplicity.  Other  system  inputs  to  the 
interface  are  the  address  strobe  signal  generated  by  the  68000 
microprocessor  as  well  as  the  read/write  signal.  The  truth  table 
for  generating  the  outputs  is  shown  in  Figure  5.  This  truth  table 
is  derived  from  the  memory  address  map  and  the  functional  de- 
scription of  the  design. 


ADDRESSES 
HEX 


000O0-0FFFF 
1 0000-1 FFFF 
20000-2FFFF 
30000-3FFFF 
40000-4FFFF 
50000-5FFFF 


SIZE 


1  Mbytes 
1  Mbytes 
1  Mbytes 
1  Mbytes 
1  Mbytes 
1  Mbytes 


A23 


A22 


A21 


SIGNAL 


ROMCS1 


ROMCS2 


RAMCS 


RAMCS 


RAMCS 


RAMCS 


Figure  5.  Truth  Table  for  Chip-Select  Signals 

Device  Selection  Considerations 

Thefirsttaskforthe  designer  isto  identify  the  design  problem  and 
classify  it  as  a  combinatorial  function  or  a  registered  function, 
depending  upon  whether  or  not  registers  are  required.  In  most 
cases,  this  decision  depends  upon  the  functional  nature  of  the 
problem.  Sometimes  timing  and  logic  considerations  can  also 
dictate  the  use  of  registers;  this  will  be  discussed  later.  Registers 
are  usually  not  required  for  such  simple  combinatorial  functions 
such  as  encoders,  decoders,  multiplexers,  demultiplexers,  ad- 
ders, and  comparators.  Registers  are  required,  however,  for 
functions  such  as  counters,  timers,  control  signalgeneration,  and 
state  machines.  No  registers  are  required  for  this  simple  address 
decoding  example. 

A  brief  look  at  the  places  where  various  architectural  features  are 
used  will  help  us  select  a  device  for  our  design  example.  Based 
on  historic  record  of  usage  and  architectural  functionality,  PLDs 
can  be  classified  into  three  basic  application  segments.  Figure  6 
shows  the  optimal  placement  of  PAL,  PLS,  and  PROSE/FPC 
(Fuse  Programmable  Controller)  devices  for  these  three  seg- 
ments; combinatorial  designs,  simple  registered  designs  and 
complex  registered  (sequencer)  designs.  This  classification  also 
shows  the  available  functionality  and  device  speeds.  These  are 
rough  estimates  that  most  designers  implicitly  assume  and  de- 
velop with  experience.  These  estimates  vary  drastically  among 
different  designers,  based  on  personal  preferences  and  experi- 
ence. They  have  been  included  hereto  provide  a  complete  road 
map  for  device  selection  and  should  be  considered  strictly  as  a 
rule  of  thumb. 


COMBINATORIAL 


-  FUNCTIONALITY  - 


SIMPLE 
REGISTERED  SPEED 
DESIGNS 


FUNCTIONALITY  - 


STATE  MACHINES/ 
SEQUENCERS, 

COMPLEX  SPEED 
REGISTERED 
DESIGNS 


-  FUNCTIONALITY  - 


Figure  6.  Various  PLD  Design  Options 


Based  on  these  criteria,  the  best  choice  for  our  combinatorial 
design  would  be  a  PALdevice.  The  task  now  is  to  select  a  PAL 
device  for  implementing  the  desired  function.  The  main  function- 
specific  selection  considerations  are  shown  in  Figure  7.  These 
items  are  applicable  to  most  designs.  There  are  other  function- 
specific  issues  which  need  addressing  when  selecting  a  device; 
these  will  be  discussed  in  the  latersectionsof  the  book,  when  we 
address  advanced  designs. 

1.  Number  of  Input  pins 

2.  Number  of  output  pins 

3.  Number  of  I/O  pins 

4.  Device  speed 

5.  Device  power  requirements 

6.  Number  of  registers  (If  any) 

7.  Number  of  product  terms 

8.  Output  polarity  control 

Figure  7.  General  Device  Selection  Considerations 

The  first  resource  that  must  be  provided  in  a  PLD  is  the  number 
of  pins  needed  for  the  basic  logic  function.  This  consists  of  the 
number  of  input  and  output  pins.  Many  PLDs  have  internal 
feedback,  which  allows  the  generated  output  signal  to  be  reused 
as  an  input.  The  same  feedback  also  allows  the  pin  to  be  used 
as  a  dedicated  input,  if  required.  This  is  especially  useful  for  fitting 
various  designs  with  different  input/output  requirements  on  the 
same  device.  The  I/O  pin  capability  of  certain  PLDs  can  also  be 
very  usefulforcertain  bus  applications  and  will  be  discussed  later. 


m 
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Figure  8  and  9  show  the  pin  resources  available  on  various 
combinatorial  and  registered  PLDs.  The  task  is  as  simple  as 
counting  the  number  of  input,  output  and  I/O  pins  required  by  the 
design  and  picking  a  PLD  from  the  table  which  has  the  requisite 
number  of  pins.  We  will  pick  a  PAL1 6L8  for  our  design,  since  it 
has  sufficient  pin  resources. 


The  next  selection  issue  is  the  device  speed.  The  most  important 
timing  consideration  for  combinatorial  PLDs  is  the  propagation 
delay  (tPD)  of  signals  from  the  input  to  the  output  of  the  device.  For 
registered  PLDs,  the  importanttiming  consideration  is  the  device 
clocking  frequency.  This  clocking  frequency  (shown  in  Figure  9) 
is  in  turn  determined  by  sum  of  the  register  setup  time  (t  J,  and 
clock-to-output  propagation  delay  (tCLK).  Most  systems  impose 
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DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED GRADES 

NUMBER 

INPUT 

OUTPUT 

PINS 
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tPDINns 

OF  PINS 

PINS 

PINS 
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20 

10 

8 

- 

2 

35 
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20 

10 

8 

- 

2 

35 
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20 

12 

6 
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35 
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20 

12 

6 

- 

2-4 

35 

14H4 

20 

14 

4 

- 

4 

35 

14L4 

20 

14 

4 

- 

4 

35 

16H2 

20 

16 

2 

- 

8 

35 

16L2 

20 

16 

2 

- 

8 

35 

16C1 

20 

16 

2 

- 

16 

40 
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20 

10 

2 

6 

7 
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20 

10 

2 

6 

7 

25 
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20 

8 
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20 

10 

8 

8 
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24 

6 

16 

1 

25 
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24 
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14 
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25 
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40 
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24 

14 

8 

2-4 

40 
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24 

16 

6 
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24 

18 

4 
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40 
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24 

20 
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8 

40 
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24 

20 

2 

16 

40 
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7 
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6 

Figure  8.  Combinatorial  PLDs  with  Pin  Resources  and  Speed  Requirements 
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DE 

.VICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 
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REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 
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TERMS/ 
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Figure  9.  Registered  PLDs  with  Pin  Resources  and  Speed  Requirements 
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Figure  10.  System  Timing  Requirements 


some  timing  restrictions  on  the  internal  logic  functions.  These 
restrictions  will  determine  the  necessary  tPD  (for  combinatorial 
devices)  or  f^  (for  registered  devices).  Details  on  clocking 
frequency  are  provided  on  page  2-64. 

In  our  design  example,  the  PLD  will  primarily  perform  address 
decoding.  The  critical  system  timing  constraint  is  determined  by 
the  read/write  cycle  time  of  the  microprocessor  and  the  memory 
access  time  available  (Figure  1 0).  Most  microprocessors  allow 
anywhere  from  10  to  35  ns  for  address  decoding.  That  is,  10-35 
ns  after  the  address  is  available,  the  correct  memory  chip-select 
signal  should  be  asserted.  In  our  design  example  the  available 
cycle  time  of  240  ns  and  memory  access  time  of  220  ns  leaves 
barely  10  ns  for  address  decode  time.  From  the  table  in 
Figure  8,  we  can  check  the  propagation  delay  and  select  the 
appropriate  speed  of  PAL1 6L8  for  our  design,  which  is  tpo=1 0  ns. 

We  have  already  briefly  discussed  the  types  of  applications 
where  registers  are  needed.  Sometimes  the  consideration  of 
system  timing  can  affect  whether  or  not  registers  are  needed. 
Devices  with  registers  can  hold  a  signal  stable  for  the  long 
durations  required  by  the  addressed  peripheral  or  memory. 
However,  this  slows  the  initial  response  or  access  time  of  the 
device  since  the  chip  select  must  wait  for  the  setup  time  beforethe 
rising  edge  of  the  clock  cycle.  Devices  without  registers  provide 
fast  access  time  but  hold  the  signal  valid  only  as  long  as  the  input 
conditions  are  valid.  In  most  address  decoders,  the  address 
signals  are  kept  asserted  by  the  microprocessor  until  the  read/ 
write  cycle  is  completed.  In  this  case,  the  registers  are  not 
required  for  holding  the  signals  asserted. 

The  remaining  two  general  design  considerations  arethe  number 
of  product  terms  and  output  polarity.  We  will  discuss  these  two  as 
we  implement  the  design  in  the  next  section. 

Implementing  a  design 

Implementing  a  design  (Figure  11)  requires  the  creation  of  a 
design  file.  This  design  file  is  called  the  PAL  device  Design 
Specification  (PDS)  in  the  PALASM  2  software  package.  Details 
of  PALASM  2  software  can  be  found  in  section  4  of  this  handbook. 
The  design  file  contains  three  types  of  information. 

1 .  Basic  bookkeeping  information 

2.  Design  syntax 

3.  Simulation  syntax 


The  first  section  provides  documentation  information,  along  with 
the  name  of  the  device  selected,  and  the  signal  names.  All  of  the 
signal  names  are  assigned  to  pins  and  are  defined  as  inputs  or 
outputs.  The  Declaration  Section  contains  the  signal  names  in 
the  ascending  order  of  pin  numbers.  Figure  12  shows  the  pin 
assignments  of  the  signals  used  in  our  example.  Signals  which 
are  used  as  l/Os  are  assigned  as  outputs  and  are  used  as  both 
inputs  and  outputs  in  the  sum-of-products  logic  explained  later. 

Once  the  design  file  is  complete,  it  is  then  assembled  and 
simulated.  Once  it  passes  assembly  and  simulation,  the  resultant 
JEDECfile  is  downloaded  to  adeviceprogrammerfor  configuring 
the  device. 


SELECT  DEVICE 


CREATE  DESIGN  FILE 
WRITE  BOOLEAN 
LOGIC  EQUATIONS 


1 


CREATE  DESIGN  FILE 
WRITE  STATE 
MACHINE 


ASSEMBLE 
DESIGN  FILE 


I  


PROGRAM  PLD 


 1 

404  11 


Figure  11.  Implementing  a  Design 
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Title  M68000_ADDRESS_DECODER 
Pattern  P8000 
Revision  A 
Author  ENGINEER 

Company  MMI  SUNNYVALE,  CALIFORNIA 
DATE  07/21/87 


CHIP  DECODER  PAL16L8 

NC     A23  A22  A21       TOP     BOTTOM     AS     RW     NC  GND 

OE     ROMCS1     ROMCS2     RAMCS   INIT  NC  NC     NC     NC  VCC 

Figure  12.  Declaration  Section  of  a  Design  File 


Design  Syntax 

As  shown  in  Figure  11,  there  are  two  options  available  to  the 
designerfor  expressing  the  design.  The  first  is  through  traditional 
Boolean  logic  equations;  the  second  is  through  a  state  machine 
syntax.  The  Boolean  logic  equations  are  the  only  option  for 
combinatorial  designs  and  can  also  be  efficient  for  some  regis- 
tered designs.  The  Boolean  equations  can  be  derived  from  a 
combination  of  the  functional  description,  the  truth  table  and/or 
the  timing  diagrams  (Figure  13).  The  state  machine  approach  is 
ideal  for  large  registered  control  designs,  and  can  be  derived  from 
the  functional  description,  state  table,  state  diagram  and/or  the 
timing  diagram  (Figure  1 4).  The  latter  approach  can  also  be  used 
with  PROSE  and  PLS  devices,  and  will  be  discussed  in  a  later 
section  on  state  machine  design. 

Boolean  Logic  Equations 

Boolean  equations  are  used  to  represent  the  sum-of-products 
logic  form  (see  page  6-1  for  a  logic  design  review).  The  Boolean 
equations  are  ideally  suited  for  representing  the  two-level 
AND-OR  logic  available  in  most  PLDs. 

A  conventional  approach  to  the  design  is  to  convert  the  design 
problem  to  its  discrete  logic  implementation.  Such  random  SSI 
and  MSI  logic  can  be  easily  implemented  in  PLDs  (see  the 
Beginner's  Guide,  page  2-2).  This  usually  involves  converting  to 
sum-of-products  Boolean  logic  form.  This  approach  can  be  a 
chore,  and  much  effort  can  be  saved  by  implementing  a  design 
with  PLDs  in  a  sum-of-products  form  right  from  the  start.  This 
essentially  means  that  the  designer  does  not  have  to  design 
around  the  limitations  of  fixed  SSI  and  MSI  functions.  A  direct 
implementation  of  a  design  in  sum-of-products  form  in  a  PLD  can 
also  yield  a  faster  circuit. 

Boolean  equations  can  be  directly  derived  from  the  truth  table  or 
timing  diagram  (Figure  13).  The  truth  table  is  used  more  often  in 
simple  combinatorial  designs.  The  timing  diagram  method  is 
used  more  often  in  registered  control  designs.  We  will  first 
discuss  the  truth  table  method  and  then  discuss  the  details  of  the 
timing  diagram  method.  The  state-machine-based  registered 
design  approach  will  be  discussed  on  page  2-101. 


In  addition  to  specifying  the  logic  function,  the  Boolean  equations 
in  the  design  file  help  document  the  design.  There  is  no  need  to 
drawoutan  equivalent  schematic.  This  allows  design  modularity; 
the  schematic  can  just  show  a  block  for  a  particular  PLD. 
Separate  supporting  documentation  (the  design  file)  provides  the 
details  without  cluttering  the  drawing. 

Truth-Table-Based  Design 

The  requirements  for  our  particular  design  example  can  be  easily 
converted  to  a  truth  table  format  (Figure  15).  This  truth  table  is 
based  upon  the  functional  description  of  the  design,  and  is 
derived  from  the  address  map  (Figure  3)  and  the  truth  table 
(Figure  5). 
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Figure  13.  Writing  Boolean  Logic  Equations 
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Figure  14.  Stats  Machine  Dsecription 
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Figure  15.  Truth  Table  for  the  Address  Decoder 


There  are  three  additional  input  signals  in  this  design  example. 
The  first,  RW,  is  generated  by  the  microprocessor,  and  distin- 
guishes between  read  and  write  cycles.  Since  the  ROM  data  is 
only  for  reading,  the  ROMCS1  and  ROMCS2  signals  are  asserted 
only  when  RW  is  high  (when  the  microprocessor  attempts  to  read 
the  ROM)  and  are  not  asserted  for  the  write  cycle.  On  the  other 
hand,  RAMCS  is  generated  for  both  read  and  write  cycles  and  the 
state  of  signal  RW  is  "don't  care". 

The  second  additional  signal,  AS,  is  the  address  strobe  signal 
generated  by  the  microprocessor,  and  is  asserted  only  when  the 
address  lines  carry  a  valid  address.  All  of  the  chip  select  signals 
need  to  be  gated  with  the  AS  signal  to  ensure  that  they  are  only 
generated  for  valid  addresses,  and  no  spurious  chip  selects  are 
generated. 

The  last  signal  is  the  INIT  signal,  which  is  a  system  initialization 
signal.  This  signal  is  used  to  initialize  the  microprocessor  for  a 
"warm  boot,"  and  none  of  the  chip  selects  is  allowed  when  this 
INIT  signal  is  asserted. 

Writing  Boolean  equations  from  the  above  logic  is  very  straight- 
forward. The  output  signal  names,  along  with  their  polarity,  are 


assigned  to  sum-of-product  equations,  which  are  based  upon 
inputs  and  their  polarities. 


/R0MCS1  -  /A23 
/ROMCS2  -  /A23 
/RAMCS     =  /A23 
+  /A23 
+  A23 
+  A23 


/A22 
/A22 
A22 
A22 
/A22 
/A22 


/A21 
A21 

/A21 
A21 

/A21 
A21 


INIT  *  /AS 
INIT  *  /AS 
INIT  *  /AS 
INIT  *  /AS 
INIT  *  /AS 
INIT  *  /AS 


RW 
RW 


Figure  16.  The  Implementation  in  Boolean  Equations 


The  equations  are  derived  directly  from  the  truth  tables.  Each  one 
of  the  AND  equations  uses  up  one  product  term  of  the  device  as 
shown  in  Figure  16.  One  device  selection  consideration  is  to 
ensure  that  all  the  outputs  have  sufficient  product  terms  to 
accommodate  the  desired  function.  The  device  we  have  selected 
for  this  design  is  the  popular  PAL16L8D.  This  device  has 
sufficient  inputs,  outputs,  and  product  terms,  and  has  the  requi- 
site speed  for  the  design. 
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This  brings  us  to  the  issue  of  output  polarity.  Suppose  we  had  to 
generate  active-HIGH  outputs.  In  that  case  the  output  equations 
for  the  ROMCS1  signal  would  be  : 

ROMCSl  -  /A23  *  /A22  *  /A21  *  INIT  *  /AS  *  RW 


PAL 


As  the  PAL16L8  has  active-LOW  outputs  only,  this  equation's 
output  polarity  needs  to  be  inverted  to  be  able  to  fit  the  device. 
Using  DeMorgan's  theorem  for  Boolean  logic  we  get: 

/ROMCSl  =  A23  +  A22  +  A21  +  /INIT  +  AS  +  /RW 

This  equation  requires  a  large  number  of  product  terms  (six). 
Some  signals  are  efficient  and  use  fewer  product  terms  in  their 
true  form,  while  others  are  more  efficient  in  their  inverted  form. 
This  poses  no  problems  until  the  signals  exceed  the  device  limit 
of  8  product  terms  provided  by  the  PAL1 6L8  and  other  standard 
PALdevices.  In  such  cases,  an  output  polarity  control  is  required. 
The  PAL16RP8  family  provides  just  the  control  for  fitting  the 
required  number  of  product  terms  on  the  device.  The  device 
selection  issues  of  product  terms  and  output  polarity  also  apply  to 
registered  designs.  A  detailed  discussion  of  output  polarity  is 
included  on  page  6-15. 

Timing-Diagram-Based  Design 

Until  now,  we  have  discussed  a  PLD  design  using  truth  tables  as 
the  primary  design  vehicle.  In  this  section  we  will  attempt  adesign 
using  a  timing  diagram  as  a  design  vehicle. 

Earlier  in  the  address  decoder  design  we  mentioned  the  INIT 
signal.  This  INIT  signal  is  essentially  an  initialization  signal  forthe 
entire  system,  and  can  be  generated  by  the  PAL1 6L8.  Note  that 
the  PAL1 6L8D  used  for  decoding  still  has  several  unused  outputs 
left,  one  of  which  can  be  used  to  generate  the  INIT  signal.  The 
INIT  signal  is  used  internally  (via feedback)  for  disabling  the  chip 
selects  during  initialization.  Externally  it  can  be  used  to  initialize 
other  system  signals.  This  INITsignal  is  generated  from  a  RESET 
switch  connected  to  the  inputs  of  the  PAL1 6L8  device  as  shown 
in  Figure  17. 

To  avoid  unwanted  initialization,  the  RESET  switch  must  be 
debounced.  That  is,  we  want  the  INITsignal  to  remain  HIGH  until 
the  switch  actually  contacts  the  bottom  side.  Once  the  bottom 
side  is  hit,  INIT  should  be  asserted  active  LOW.  Once  asserted, 
it  should  stay  LOW  and  not  change  until  the  top  side  is  hit  again. 
The  timing  requirements  of  the  debounce  circuitry  are  shown  in 
Figure  18.  Signals  TOP  and  BOTTOM  are  inputs  to  the  program- 
mable logicdevice.  These  signals  are  activated  when  the  RESET 
switch  touches  the  top  and  the  bottom  contacts,  respectively. 


vcc 


PAL  DEVICE 
DEBOUNCE 
CIRCUIT 

PAL16L8 


Figure  17.  RESET  Switch  for  System  Initialization 


We  can  formulate  the  equations  by  looking  at  the  timing  require- 
ments of  the  debounce  circuitry  shown  in  Figure  18.  The  idea  is 
to  identify  the  key  elements  of  this  timing  diagram.  The  arrows  in 
Figure  1 8  show  the  critical  events.  The  first  arrow  shows  the 
normal  state  of  all  the  pins  when  the  RESET  switch  is  not 
asserted.  Subsequent  arrows  show  each  event  in  the  timing  of 
the  INIT  signal,  depending  upon  the  movement  of  the  switch. 


BOTTOM 


f,  i.  f,  t,  f. 

Figure  18.  Timing  Diagram  for  the  Debounce  Switch 

The  logic  level  of  the  signals  at  each  critical  event  carries  useful 
logic  information  for  deriving  Boolean  equations.  This  logic 
information  for  each  event  is  converted  into  direct  Boolean 
equations  as  shown  in  Figure  19  below.  For  example,  at  instant 
1  the  INITsignal  remains  HIGH  as  long  as  the  TOP  signal  remains 
LOW;  this  is  converted  to  INIT  =  /TOP  *  BOTTOM. 


1.   Normal  state 


INIT  -  /TOP 


2.  Switch  travels  INIT 
from  TOP  to  BOTTOM 


TOP  *  BOTTOM  *  INIT 


3.  Switch  contacts 
BOTTOM 


/INIT  =  /BOTTOM 


4.  Switch  travels  /init  =  /init  *  bottom  *  top 
form  BOTTOM  to  TOP 

5.  Normal  State  Again 

Figure  19.  Boolean  Logic  at  Every  Instant  of  Timing 


Since  we  are  working  with  an  active-LOW  device,  we  can  com- 
bine the  two  active-LOW  events  into  one  equation: 

/INIT  a  /BOTTOM 

+  /INIT  *  BOTTOM  *  TOP 

Minimizing,  this  becomes: 

/INIT  =  /BOTTOM 

+  /INIT  *  TOP 

This  can  also  be  done  by  way  of  a  truth  table  and  Karnaugh  map. 
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Figure  20.  Truth  Table  of  IN  IT  logic 


Here  TOP  or  BOTTOM  will  be  LOW  rl  contacted.  Note  that  both 
TOP  and  BOTTOM  can  not  be  contacted  at  the  same  time.  The 
truth  table  of  Figure  20  yields  the  Karnaugh  map  shown  in  Figure 
21 .  Grouping  the  zeros  (because  the  PAL1 6L8  offers  active-LOW 
outputs)  yields  the  Boolean  equation  identical  to  the  one  derived 
from  the  timing  diagram. 


then  checked  for  an  appropriate  response.  The  simulation  test 
vectors  identify  any  flaws  in  the  design  equations  which  could 
affect  the  logical  operation  of  the  devices  programmed.  Thus  the 
simulation  vectors  serve  as  a  design  debugging  tool. 

Simulation  test  vectors  will  eventually  make  up  part  of  a  larger  set 
of  test  vectors  called  "functional  test  vectors".  These  functional 
test  vectors  are  used  to  exercise  a  real  device  after  programming 
to  identify  any  individual  devices  which  are  defective.  Other 
means  of  identifying  defective  devices,  such  as  signature 
analysis,  are  also  available.  All  of  these  are  discussed  in  detail  on 
page  3-128.  In  this  section  we  will  strictly  focus  on  simulation 
vectors. 

Simulation  is  included  in  the  design  file  along  with  the  logic 
equations.  There  is  little  standardization  in  these  simulation 
expressions  among  various  PLD  design  software  packages, 
although  most  of  them  rely  on  test  vectors  to  exercise  the  logic. 
PALASM  2  software  offers  an  advanced  event-driven  capability 
that  retains  a  closer  design  perspective  than  simulation  vectors. 
Details  of  PALASM  2  simulation  syntax  are  included  on  page 
4-137. 


/RESET 


/TOP 
/BOTTOM 

00     01     11  10 


1 


1 


The  simulation  vectors  or  events  can  be  directly  derived  from  the 
truth  table  and  the  timing  diagram  of  the  design.  The  logic  level 
and  functions  of  all  signals  can  be  expanded  and  rewritten  in  atest 
vector  form  by  the  software.  For  example,  the  truth  table  for  the 
address  decoder  example  discussed  earlier  can  be  easily  rewrit- 
ten as  shown  in  Figure  23. 


Figure  21.  Karnaugh  Map  of  INIT  Signal  Logic 

There  is  essentially  no  difference  between  the  truth  table  and 
timing  diagram  techniques  for  writing  Boolean  logic.  A  careful 
analysis  will  indicate  that  we  implicitly  assumed  atruth  table  in  the 
timing  diagram  example  also.  Some  designers  prefer  to  make  a 
separate  truth  table  (at  least  in  the  first  few  PLD  designs),  while 
others  prefer  to  design  directly  from  timing  diagrams.  While  the 
truth  table  method  allows  a  more  optimal  utilization  of  product 
terms,  the  timing  diagram  method  is  easier  to  visualize  as  it 
retains  the  design  perspective.  In  both  cases  the  logic  should  be 
minimized  by  the  design  software  to  ensure  that  the  design  is 
testable.  Refer  to  page  3-1 08  for  a  discussion  on  testability. 

Most  experienced  designers  understand  the  tradeoffs  for  device 
selection.  They  implicitly  go  through  the  steps  of  design  concep- 
tualization and  device  selection,  explained  earlier.  They  typically 
draw  a  block  around  the  logic  being  designed,  with  the  previous 
knowledge  that  it  would  fit  a  PLD  which  has  sufficient  inputs, 
outputs,  l/Os  and  product  terms.  Now  that  we  have  selected  a 
device  and  implemented  the  design,  the  next  step  is  design 
simulation. 

Simulation 

Design  simulation  is  an  integral  part  of  the  design  process,  as 
shown  in  Figure  22.  The  purpose  is  to  exercise  all  of  the  inputs 
and  test  the  response  of  outputs  to  verify  that  they  will  work  as 
desired  in  the  system.  These  are  essentially  test  vectors  which 
designate  the  state  of  every  input  on  the  device;  the  outputs  are 
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Figure  22.  Device  Simulation  and  Programming 
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Figure  23.  Truth  Table  Used  to  Derive  Simulation  Vectors 

These  are  essentially  the  simulation  vectors  which  will  allow  us  to 
define  the  inputs  to  the  device  and  check  the  outputs  of  the 
device.  The  PALASM  2  syntax  for  these  simulation  vectors  is 
shown  in  Figure  24. 

There  is  a  direct  relationship  between  the  truth  table  vectors  of 
Figure  23  and  the  simulation  shown  in  Figure  24.  The  simulator 
then  interprets  the  design  file  and  generates  the  output  logic 
levels  and/or  waveforms,  which  can  be  checked  by  the  designer. 


Once  the  simulation  is  complete,  the  design  file  can  be  as- 
sembled to  generate  the  JEDEC  file.  In  the  preceding  discus- 
sions we  have  assumed  prior  knowledge  of  the  design  file 
assembly.  The  procedure  for  assembly  varies  with  different 
software  packages  and  is  covered  for  the  PALASM  2  software 
package  in  the  Beginner's  Guide  (page  2-6). 
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SETF  /A23  /A22  /A21  /TOP  BOTTOM  AS  RW 
CHECK  R0MCS1  ROMCS2  RAMCS  INIT 

SETF  /AS  ;  READ  CYCLE  VALID  ROMCS1  ADDRESS 

CHECK  ROMCS1  ROMCS2  RAMCS  INIT 
SETF  AS 

SETF  /RW  /AS  ;  WRITE  TO  ROMCS1  NOT  POSSIBLE 

CHECK  ROMCS1  ROMCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  /A23  /A22  A21       ;   SET  UP  ROMCS2  ADDRESS 

SETF  /AS  ;   READ  CYCLE  VALID  ROMCS2  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  AS 

SETF  /RW  /AS  ;  WRITE  TO  ROMCS2  NOT  POSSIBLE 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  /A23  A22  /A21       ;  SET  OP  RAMCS  ADDRESS 

SETF  /AS  ;  READ  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  AS 

SETF  /RW  /AS  ;  WRITE  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  /A23  A22  A21         ;   SET  UP  RAMCS  ADDRESS 

SETF  /AS  ;   READ  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  AS 

SETF  /RW  /AS  ;  WRITE  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  R0MCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  A23  /A22  /A21       ;   SET  UP  RAMCS  ADDRESS 

SETF  /AS  ;   READ  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  AS 

SETF  /RW  /AS  ;  WRITE  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  A23  /A22  A21         ;   SET  UP  RAMCS  ADDRESS 

SETF  /AS  ;  READ  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  AS 

SETF  /RW  /AS  ;  WRITE  CYCLE  VALID  RAMCS  ADDRESS 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 
SETF  RW  AS 


SETF  TOP  ;  RESET  SWITCH  TRAVELING  DOWN 

SETF  /BOTTOM  ;  RESET  HITTING  BOTTOM 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 

SETF  BOTTOM  ;  RESET  TRAVELING  TO  TOP 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 
SETF  TOP  ;   RESET  HITTING  TOP 

CHECK  ROMCSI  ROMCS2  RAMCS  INIT 


Figure  24.  PALASM  2  Software  Simulation  for  Address  Decoder 


PLD  Design  Methodology 


DOWNLOAD  JEDEC 
FILE  TO  DEVICE 
PROGRAMMER 


r   1 


PROGRAM  PLD 
FUSE  MATRIX 


VERIFY  FUSE  MATRIX 


I  


  I 


PROGRAM 
PLD 


TEST  PLD  WITH 
SIMULATION  AND 
OTHER  TEST  VECTORS 


PROGRAM  SECURITY 
FUSE  IF  DESIRED 


l  


PERFORMED  BY 
DEVICE  PROGRAMMERS 


TEST 
PLD 


USE  ON  A  BOARD 


Figure  25.  Device  Programming  and  Testing 


Device  Programming  and  Testing 

Once  the  design  simulation  is  completed,  the  final  step  is  device 
programming  and  testing  (Figure  25).  Programmers  are  avail- 
able from  a  variety  of  vendors.  It  is  important  to  note  that 
Monolithic  Memories  qualifies  programmers  upon  verifying  that 
the  algorithms  used  by  the  programmers,  are  correct  and  that 
other  basic  criteria  are  met.  When  purchasing  a  programmer, 
check  that  the  programmer  is  qualified  forthe  devices  you  intend 
to  use. 

There  are  two  types  of  programmer  available;  menu-driven  or 
device  code  based.  The  menu-driven  programmer  directly  indi- 
cates the  part  type  being  programmed,  whereas  the  latter  type 
requires  the  user  to  enter  the  device  code  before  programming. 


Oncethe  JEDECfusefile  has  been  downloaded,  the  programmer 
can  program  the  device;  the  PLD  is  then  ready  for  use.  The 
programmer  also  verifies  the  connections  after  the  programming 
cycle.  Programmers  also  provide  the  capability  of  reading  a 
previously  programmed  device  and  creating  duplicates  of  that 
device. 

Testing  PLDs 

The  testing  of  PLDs  can  be  performed  by  the  device  programmer 
or  by  other  test  equipment.  For  a  manufacturing  environment 
where  high  yields  are  required,  device  testing  is  critical.  This 
subject  merits  a  separate  discussion  and  has  been  included  on 
page  3-128.  After  testing  is  complete,  the  device  security  cells 
may  be  programmed  if  desired  to  secure  the  design  from  copying. 
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In  this  section  we  will  take  a  detailed  look  at  several  aspects  of 
combinatorial  logic  design.  Most  combinatorial  design  applica- 
tions can  be  easily  segmented  into  five  major  fields. 

•  Encoders  and  Decoders 

•  Multiplexers 

•  Comparators 

•  Adders  and  Arithmetic  Logic 

•  Latches 

We  will  not  only  focus  on  the  design  methodology  for  these  func- 
tions, but  will  also  explore  furtherfunction-specific  PLD  selection 
requirements.  Generalized  designs  will  be  developed,  which  can 
be  customized  later  to  suit  specific  system  applications.  Ways  of 
optimizing  the  design  will  also  be  discussed. 

Encoders  and  Decoders 

Two  of  the  most  important  functions  required  in  digital  design  are 
encoding  and  decoding.  The  encoding  and  decoding  of  data  are 
used  extensively  in  digital  communications  as  well  as  in  periph- 
erals. Both  these  areas  use  various  complex  encoding  and  de- 
coding techniques,  which  are  described  in  more  detail  on  pages 
2-357  and  2-507.  Most  of  these  techniques  are  extensions  of  the 
simple  encoding  and  decoding  techniques  often  used  in  other 
digital  designs.  In  this  discussion  we  will  focus  on  simple  encod- 
ing and  decoding  techniques. 

Encoders 

A  binary  code  of  n  bits  can  be  used  to  represent  2"  distinct  pieces 
of  coded  data.  A  simple  combinatorial  encoder  is  a  circuit  which 
generates  n  bits  of  output  information  based  upon  one  of  the  2" 
unique  pieces  of  input  data  information.  This  encoding  of  infor- 
mation is  controlled  by  other  independent  control  signals  in  a 
typical  digital  circuit. 

An  illustration  of  a  typical  encoder  is  shown  in  Figure  1.  The 
design  methodology  typically  followed  is  based  on  truth  tables 
(Figure  2),  from  which  the  Boolean  equations  are  directly  derived 
for  the  design.  The  same  genericdevice  selection  considerations 
discussed  in  the  section  on  PAL  device  design  methodology 
apply  for  encoder  and  decoder  designs. 


INPUTS  < 


ENCODED 
OUTPUTS 


Figure  1.  A  Block  Diagram  of  an  Encoder 
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Figure  2.  Truth  Table  of  a  Typical  Encoder 

The  Boolean  equations  can  then  be  optimized  using  Karnaugh 
maps  or  the  PALASM  2  software  minimizer. 

The  resulting  Boolean  equations  are: 


ci  =  /a  *    b  *  /c  *  /D 
+  /A  *  /B  *  /C  *  D 


CO  =  /A  *  /B  *     C  *  /D 
+  /A  *  /B  *  /C  *  D 


The  complete  PALASM  2  design  file  (including  the  simulation)  for 
this  simple  encoder  design  is  shown  in  Figure  3. 

A  Priority  Encoder 

Let  us  take  another  look  at  the  encoder  example  of  Figure  2.  In 
this  example  it  is  assumed  that  only  one  of  the  inputs  A,  B,  C  or 
D  is  asserted  HIGH  at  any  one  time.  If  two  of  the  inputs  are 
asserted  HIGH  simultaneously,  a  conflict  would  be  created.  To  re- 
solve this,  a  priority  needs  to  be  assigned  to  each  of  the  inputs. 
Such  a  priority  assignment  is  used  to  select  a  particular  element 
when  several  inputs  are  asserted  simultaneously.  Each  input  is 
assigned  a  priority  with  respect  to  the  other  inputs.  The  output 
code  generated  is  the  code  assigned  to  the  highest  priority 
input  asserted. 

Thus,  a  priority  encoder  is  a  combinatorial  circuit  block  similar  to 
a  general  encoder,  except  that  the  inputs  are  assigned  a  priority. 
Such  priority  encoders  are  used  often  in  state  machine  applica- 
tions, where  they  detect  the  occurrence  of  the  highest  priority 
event.  They  are  also  used  for  microprocessor  interrupt  control- 
lers, where  they  detect  the  highest  priority  interrupt.  Another  use 
for  priority  encoders  is  in  bus  control,  where  they  are  used  in 
arbitration  schemes  for  allowing  selective  access  to  the  bus. 

The  model  of  a  priority  encoder  is  shown  in  Figure  4.  The  four 
input  signals  are  A,  B,  C  and  D.  These  are  to  be  encoded  as  LL, 
LH,  HL  and  HH  outputs.  Let  us  assign  priority  to  D  over  C,  C  over 
B,  and  B  over  A.  The  next  design  step  would  be  to  modify  the  truth 
table  (Figure  5)  to  reflect  these  priorities. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


SIMPLE_ENCODER 
ENCODE. PDS 
01 

JOE  ENGINEER 

MONOLITHIC  MEMORIES,  INC. 
8/20/87 


This  is  an  example  of  a  simple  encoder.  The  output  CI  and  CO 
are  generated  based  upon  inputs  A,  B,  C  and  D.  The  function 
truth  table  is  following: 


A 

B 

C 

D 

CO 

CI 

1 

0 

0 

0 

L 

L 

0 

1 

0 

0 

L 

H 

0 

0 

1 

0 

H 

L 

0 

0 

0 

1 

H 

H 

CHIP  ENCODER  PAL16H2 


A  B  C  D  NC  NC  NC  NC  NC  GND 
NC  NC  NC  NC  CO  CI  NC  NC  NC  VCC 


EQUATIONS 

CI   «  /A 
♦  /A 

CO  «  /A 
+  /A 

SIMULATION 


B  *  /C 
/B   *  /C 


/B 
/B 


C 

/c 


/D 
D 

/D 
D 


TRACE_ON 

SETF         A  /B  /C  /D 
CHECK  /CO  /CI 
SETF       /A     B  /C  /D 
CHECK  /CO  CI 
SETF       /A  /B     C  /D 
CHECK     CO  /CI 
SETF       /A  /B  /C  D 
CHECK     CO  CI 


Figure  3.  PALASM  2  File  for  a  Simple  Encoder 


PRIORITY 
ENCODER 
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Figure  4.  A  Four-Input  Priority  Encoder  Block  Diagram 
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Figure  5.  Priority  Encoder  Truth  Table 
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The  Boolean  equations,  directly  derived  from  the  truth  table,  are: 


Cl  =  /A 
+  /A 
+ 

+ 


These  equations  can  be  further  optimized  by  the  design  software 
to  the  following: 


B 

/B 


/C 

/c 
/c 


/D 
D 

/D 
D 


CO 


/A 
/A 


/B 
/B 


C 
/C 

c 


m 

D 
/D 
D 


Cl 
CO 


D  +  /C 
D  +  C 


Although  a  priority  encoder  is  a  purely  combinatorial  function, 
output  registers  are  frequently  used  to  hold  the  output  signal 
stable  for  longer  durations.  An  example  of  a  registered  1 6-input 
priority  encoder  is  shown  in  Figure  6. 


TITLE  16   INPUT  REGISTERED  PRIORITY  ENCODER 

PATTERN  P7090 

REVISION  01 

AUTHOR  J.  ENGINEER 

COMPANY  MONOLITHIC  MEMORIES,  INC. 

DATE  8/20/87 


CHIP  IN_PRTY  PAL20R4 
DESCRIPTION 


THE  16   INPUT  REGISTERED  PRIORITY  ENCODER  ACCEPTS  SIXTEEN  ACTIVE-LOW  INPUTS 
(10-115)   TO  LOAD  THE  BINARY  WEIGHTED  CODE  OF  THE  PRIORITY  ORDER   INTO  THE 
OUTPUT  REGISTER   (Q3-Q0)   ON  THE  RISING  EDGE  OF  THE  CLOCK   (CLK).      A  PRIORITY 
IS  ASSIGNED  TO  EACH  INPUT  SO  THAT  WHEN  TWO  INPUTS  ARE  SIMULTANEOUSLY 
ACTIVE,   THE  INPUT  WITH  THE  HIGHEST  PRIORITY   IS  LOADED  INTO  THE  OUTPUT 
REGISTER.      THEREFORE  THE  HIGHEST  PRIORITY   INPUT   (I0=H)   PRODUCES  HHHH  IN 
THE  OUTPUT  REGISTER  AND  THE  LOWEST  PRIORITY  INPUT   (I15=H)   PRODUCES  LLLL  IN 
THE  OUTPUT  REGISTER. 


OPERATIONS  TABLE 


/OC     CLK     115-10       Q3-Q0  OPERATION 


H 

X 

Z 

HI-Z 

L 

c 

10 

=H 

15 

10 

INTERRUPT 

L 

c 

11 

=H 

14 

11 

INTERRUPT 

L 

c 

11 

=H 

13 

12 

INTERRUPT 

L 

c 

11 

-H 

12 

13 

INTERRUPT 

L 

c 

11 

=H 

11 

14 

INTERRUPT 

L 

c 

11 

=H 

10 

15 

INTERRUPT 

L 

c 

11 

=H 

9 

16 

INTERRUPT 

L 

c 

11 

=H 

8 

17 

INTERRUPT 

L 

c 

11 

=H 

7 

18 

INTERRUPT 

L 

c 

11 

=H 

6 

19 

INTERRUPT 

L 

c 

11 

=H 

5 

110 

INTERRUPT 

L 

c 

11 

=H 

4 

111 

INTERRUPT 

L 

c 

11 

=H 

3 

112 

INTERRUPT 

L 

c 

11 

=H 

2 

113 

INTERRUPT 

L 

e 

11 

=H 

1 

114 

INTERRUPT 

L 

c 

11 

=H 

0 

115 

INTERRUPT 

CLK  10  II  12  13  14  15  16  17  18  19  GND 
/OC  110   111   112  Q3  Q2  Ql  Q0   113   114   115  VCC 


EQUATIONS 


/Q0    :=  /I0*  II 

+  /I0*/I1*/I2*  13 

+  /I0*/I1*/I2*/I3*/I4*  15 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*  17 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*  19 

+  /I0VI1*/I2*/I3VI4*/I5VI6*/I7*/I8*/I9*/I10*  111 

+  /I0VI1VI2VI3*/I4*/I5*/I6VI7*/I8*/I9*/I10*/I11*/I12*  113 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*/I12*/I13*/I14*I15 


Figure  6.  A  16-lnput  Registered  Priority  Encoder  Design  File 
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/Ql   :=  /I0*/I1*  12 

+  /I0*/I1*/I2*  13 

+  /I0*/I1*/I2*/I3*/I4*/I5*  16 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*  17 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*  110 

+  /I0*/H*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/H0*  111 

+  /I0*/I1*/I2*/I3*/I4«/I5*/I6*/I7«/I8*/I9»/I10*/I11*/112*/I13*  114 

+  /I0*/I1VI2VI3»/I4*/I5*/I6VI7VI8*/I9VI10*/I11*/I12*/I13*/I14*I15 

/Q2   :=  /I0*/I1*/I2*/I3*  14 

+  /I0*/I1*/I2*/I3*/I4*  15 

+  /I0*/I1*/I2*/I3*/I4»/I5*  16 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*  17 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*  112 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/110*/111*/I12*  113 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/19*/I10*/I11*/I12*/I13*  114 

+  /I0*/I1*/12*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*/I12*/I13*/I14*I15 

/Q3  :=  /I0*/I1*/I2*/I3*/I4*/I5*/I6»/I7*  18 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*  19 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*  110 

+  /I0*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*  111 

+  /IO*/I1*/I2*/I3*/I4*/I5*/I6*/I7*/I8*/I9*/I10*/I11*  112 

+  /I0*/I1*/I2VI3VI4*/I5*/I6VI7*/I8VI9VI10*/IH*/I12*  113 

+  /I0*/I1*/I2*/I3VI4*/I5*/I6VI7*/I8VI9VI10*/I11*/H2*/I13*  114 

+  /I0*/I1VI2*/I3*/I4*/I5VI6*/I7*/I8»/I9VI10*/I11*/I12*/I13*/I14*I15 

;    SIMULATION  NOT  INCLUDED  HERE 


Figure  6.  A  16-lnput  Registered  Priority  Encoder  Design  File  (Cont'd.) 
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INPUT  SELECT  LINES 


OUTPUT  LINES 


QO    Q1    Q2    Q3    Q4    Q5    Q6    Q7    Q8    Q9  Q10  Q11  Q12  Q13  Q14  Q15 


Figure  7.  The  Truth  Table  of  an  Active-LOW  4-to-16  Decoder 


Decoders 


A  decoder  performs  the  reverse  function  of  an  encoder.  It 
converts  an  n-bit  code  to  one  of  its  2"  unique  items.  It  is  a 
combinatorial  circuit  designed  such  that  at  most  one  of  its  several 
outputs  will  be  asserted  based  upon  the  unique  input  codes. 

Adecoder  may  have  as  many  outputs  as  there  are  possible  binary 
input  selection  combinations.  As  shown  in  the  truth  table 
(Figure  7),  only  one  output  may  be  asserted  at  any  time.  When 
a  new  combination  is  applied,  another  output  is  asserted  and  the 
original  output  is  returned  to  its  non-asserted  state. 

The  Boolean  logic  equations  can  be  directly  derived  from  the  truth 
table  shown  in  Figure  7.  The  procedure  is  the  same  as  explained 
in  the  prev  ous  section  on  PLD  design  methodology  on  page  2-27. 
The  Boolean  equations  derived  are  shown  in  Figure  8. 


/QO 

/D 

* 

/c 

* 

1% 

* 

/A 

/Ql 

/D 

* 

/c 

* 

/B 

* 

A 

/Q2 

/D 

* 

/c 

* 

B 

* 

/A 

/Q3 

/D 

* 

/c 

* 

B 

* 

A 

/Q4 

/D 

* 

c 

* 

/B 

* 

/A 

/Q5 

/D 

* 

c 

* 

/B 

* 

A 

/Q6 

/D 

* 

c 

* 

B 

* 

/A 

/Q7 

/D 

* 

c 

* 

B 

* 

A 

/Q8 

D 

* 

/c 

* 

/B 

* 

/A 

/Q9 

D 

* 

/c 

* 

/B 

* 

A 

/Q10 

D 

★ 

/c 

* 

B 

* 

/A 

/Qll 

D 

★ 

/c 

* 

B 

* 

A 

/Q12 

D 

* 

c 

/B 

* 

/A 

/Q13 

D 

* 

c 

* 

m 

* 

A 

/Q14 

D 

* 

c 

* 

B 

* 

'A 

/Q15 

D 

* 

c 

* 

B 

* 

A 

Notice  from  the  truth  table  that  there  is  no  combination  of  inputs 
that  will  send  all  the  outputs  to  their  non-asserted  state.  Many 
designs  actually  need  to  be  able  to  make  all  outputs  inactive.  This 
can  be  done  simply  by  putting  enable  lines  in  all  of  the  output  AND 
gates.  Many  such  design  modifications  can  be  easily  added  once 
the  basic  Boolean  equations  have  been  derived,  instead  of 
redoing  the  truth  table.  Figure  9  shows  the  PALASM  2  software 
design  file  of  one  such  4-to-1 6  decoder  with  the  enable  function. 

Probably  the  most  commonly  used  decoders  are  the  address 
decoders  required  by  most  microprocessors  and  bus  interfaces. 
These  also  constitute  the  most  common  application  of  PLDs  in 
digital  designs.  The  design  considerations  for  address  decoders 
have  been  covered  earlier  in  the  PLD  Design  Methodology 
section,  page  2-23.  Later  we  will  develop  a  general  Boolean 
equation  for  an  address  decoder  circuit  when  we  discuss  range 
decoders. 


Figure  8.  Decoder  Boolean  Logic  Equations 
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Title  4tol6  Decoder 

Pattern  4-16DEC . PDS 

Revision  A 

Author  Mehrnaz  Hada 

Company  Monolithic  Memories,  Santa  Clara,  CA 

Date  1/9/85 


CHIP  Decoder  PAL6L16 


QO  Ql  Q2  A  B  C  D  EN1  EN2  Q3  Q4  GND 
Q5  Q6  Q7  Q8  Q9  Q10  Qll  Q12  Q13  Q14  Q15  VCC 

;The  4  to  16  decoder  decodes  four  binary  encoded  inputs 
;into  one  of  16  mutually  exclusive  outputs,  whenever  the 
;two  enable  lines  EN1  and  EN 2  are  high.  When  one  or  both 
;of  the  enable  lines  are  low  the  outputs  are  all  set  to 
;high  values. 


EQUATIONS 


/QO 

/D*/C*/B*/A* 

EN1* 

EN  2 

; Decode 

0000 

/Ql 

/D*/C*/B*  A* 

EN1  * 

EN  2 

; Decode 

0001 

/Q2 

/D*/C*  B*/A* 

EN1* 

EN2 

; Decode 

0010 

/Q3 

/D*/C*   B*  A* 

EN1* 

EN  2 

; Decode 

0011 

/Q4 

/D*  C*/B*/A* 

EN1* 

EN  2 

; Decode 

0100 

/Q5 

/D*  C*/B*  A* 

EN1  * 

EN  2 

; Decode 

0101 

/Q6 

/D*  C*  B*/A* 

EN1* 

EN  2 

; Decode 

0110 

/Q7 

/D*  C*   B*  A* 

EN1* 

EN  2 

; Decode 

0111 

/Q8 

D*/C*/B*/A* 

EN1  * 

EN2 

; Decode 

1000 

/Q9 

D*/C*/B*  A* 

EN1* 

EN2 

; Decode 

1001 

/Q10 

D*/C*  B*/A* 

EN1* 

EN2 

; Decode 

1010 

/Qll 

D*/C*  B*  A* 

EN1* 

EN  2 

; Decode 

1011 

/Q12 

D*  C*/B*/A* 

EN1* 

EN2 

; Decode 

1100 

/Q13 

D*  C*/B*  A* 

EN1* 

EN  2 

; Decode 

1101 

/Q14 

D*  C*  B*/A* 

EN1* 

EN  2 

; Decode 

1110 

/Q15 

D*  C*   B*  A* 

EN1* 

EN  2 

; Decode 

1111 

SIMULATION 

TRACE_ON     D  B  C  A  Q0  Ql  Q2  Q3  Q4  Q5  Q6  Q7  Q8  Q9  Q10 
Qll  Q12  Q13  Q14  Q15 

SETF  /D  /C  /B  /A  EN1  EN 2 

SETF  A 

SETF  B 

SETF  C 

SETF  D 

SETF  /D 

SETF  /C 

SETF  /B 

SETF  /A 

SETF  /EN1 

SETF  EN1  /EN2 
SETF  /EN1 

TRACE  OFF 


Figure  9.  Design  File  of  Four-to-Slxteen  Decoder  with  Enable 


;Set  outputs  to  high 

;Set  outputs  to  high 
;Set  outputs  to  high 


Encoder/Decoder  Device  Selection  Considerations 

The  general  device  selection  considerations  are  listed  in 
Figure  10.  Based  upon  the  number  of  inputs  and  outputs 
required,  a  device  can  be  selected  from  the  table  in  Figure  11  for 
combinatorial  devices  or  Figure  12  for  registered  devices. 


Number  of  Input  Pins 
Number  of  Output  Pins 
Number  of  I/O  Pins 
Device  Speed 

Device  Power  Requirements 
Number  of  Registers 
Number  of  Product  Terms 
Output  Polarity  Control 


Figure  10.  General  Encoder/Decoder  Device  Selection 
Considerations 
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DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

OUTPUT 

PINS 

PRODUCT 

tPD  "N  ns 

OF  PINS 

PINS 

PINS 

TERMS/ 

OUTPUT 

TTL 

10H8 

20 

10 

8 

- 

2 

35 

10L8 

20 

10 

8 

2 

35 

12H6 

20 

12 

6 

2-4 

35 

12L6 

20 

12 

6 

2-4 

35 

14H4 

20 

14 

4 

4 

35 

14L4 

20 

14 

4 

4 

35 

16H2 

20 

16 

2 

- 

8 

35 

16L2 

20 

16 

2 

- 

8 

35 

16C1 

20 

16 

2 

16 

40 

16L8 

20 

10 

2 

6 

7 

10  1=;  ?r  *35 

16P8 

20 

10 

2 

5 

7 

25 

1 6RA8 

20 

8 

g 

4 

30 

18P8 

20 

10 

8 

8 

15  25 

6L16 

24 

6 

16 

1 

25 

8L14 

24 

8 

14 

- 

1 

25 

12L10 

24 

12 

10 

2 

40 

14L8 

24 

14 

8 

2-4 

40 

16L6 

24 

16 

6 

2-4 

40 

18L4 

24 

18 

4 

4-6 

40 

20L2 

24 

20 

2 

8 

40 

20C1 

24 

20 

2 

16 

40 

20L8 

24 

14 

2 

6 

7 

15,  25,35 

20L10 

24 

12 

2 

8 

3 

15,  20,  25,  30 

20RA10 

24 

10 

10 

4 

20,  30 

20S1 0 

24 

12 

2 

8 

8-16 

35 

22P10 

24 

12 

10 

8 

15,  25 

22XP10 

24 

12 

10 

8 

20,  30,  40 

22RX8 

24 

14 

8 

8 

25 

22V10 

24 

12 

10 

8-16 

15,  25,  35 

32VX10 

24 

12 

10 

8-16 

25,  30 

CMOS 

C16L8 

20 

10 

2 

6 

7 

25 

C20L8 

24 

14 

2 

6 

7 

35,  45 

C22V10 

24 

12 

10 

8-16 

25,  35 

C29M16 

24 

5 

16 

8-16 

35,  45 

C29MA1 

S 

24 

5 

16 

4-12 

35,45 

ECL 

10H20P! 

I 

24 

12 

8 

4-8 

6 

Figure  11.  Combinatorial  Programmable  Logic  Devices 
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DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

^MAX 

TTL 

16R8 

20 

8 

8 

8 

55  5  37  25  1 6 

16R6 

20 

8 

6 

2 

8 

55  5  37  25  1 6 

16R4 

20 

8 

4 

4 

8 

55  5  37  25  1 6 

16RP8 

20 

8 

8 

8 

22.2 

16RP6 

20 

8 

6 

2 

8 

22.2 

16RP4 

20 

8 

4 

4 

8 

22.2 

16RA8 

20 

8 

0-8 

8-0 

4 

20 

16X4 

20 

8 

4 

4 

8 

14 

23S8 

20 

g 

4 

4 

8-1 2 

OJ.O,  £0.0 

20R8 

24 

12 

8 

8 

37  25  16 

20R6 

24 

12 

6 

2 

8 

37,  25,  16 

20R4 

24 

12 

4 

4 

8 

37,  25,  16 

20X10 

24 

10 

10 

A 
"» 

OA 

I  u 

o 
o 

A 

99  9 

20X4 

24 

10 

4 

6 

4 

22.2 

20RP1 0 

24 

10 

10 

Q 
O 

9*;  *37 

20RP8 

24 

1 0 

8 

2 

g 

9**  ^7 

20RP6 

OA 

10 

a 

A 

a 
o 

9R  97 

20RP4 

24 

10 

4 

C 
D 

p 

0 

9^  V7 

20XRP10 

24 

10 

10 

8 

30,22.2,  14 

20XRP8 

24 

10 

8 

2 

8 

30,22.2,  14 

20XRP6 

24 

10 

6 

4 

8 

30,22.2,  14 

20XRP4 

24 

10 

4 

6 

8 

30,  22.2,  14 

20RS10 

24 

10 

10 

8-16 

19.2 

20RS8 

24 

10 

8 

2 

8-16 

19.2 

20RS4 

24 

10 

4 

6 

8-16 

19.2 

20RA10 

24 

10 

0-10 

10-0 

4 

33,  20 

22RX8 

24 

14 

0-8 

8-0 

8 

28.5 

22V10 

24 

12 

0-10 

10-0 

8-16 

40,  28.5,  18 

32VX10 

24 

12 

0-10 

10-0 

8-16 

25,  22.2 

32R16 

40 

16 

16 

8-16 

16 

CMOS 

C16R8 

20 

8 

8 

8 

28.5 

C16R6 

20 

8 

6 

2 

8 

28.5 

C16R4 

20 

8 

4 

4 

8 

28.5 

C20R8 

24 

12 

8 

8 

20,  15.3 

C20R6 

24 

12 

6 

2 

8 

20,  15.3 

C20R4 

24 

12 

4 

4 

8 

20, 15.3 

C22V10 

24 

12 

0-10 

10-0 

8-16 

33.3,  20 

C29M16 

24 

5 

16 

8-16 

20,  15 

C29MA16 

24 

5 

16 

4-12 

20,  15 

ECL 

1 0H/020EV/G8 

24 

12 

0-8 

8-0 

8-12 

125 

10H20G8 

24 

12 

0-8 

8-0 

4-8 

NA 

(Latched) 

Figure  1 2.  Registered  Programmable  Logic  Devices 
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Encoders  typically  require  a  large  number  of  inputs  and  fewer 
outputs,  whereas  decoders  typically  require  a  large  number  of 
outputs  and  fewer  inputs.  We  can  see  from  these  tables  that  the 
PAL  device  for  the  priority  encoder  application  (Figure  6)  should 
be  the  PAL20R4  and  for  the  decoder  application  (Figure  7)  it 
should  be  the  PAL6L16. 

Another  important  device  selection  consideration  for  encoders 
and  decoders  is  the  number  of  product  terms  required  for  a 
design.  A  careful  selection  of  code  values  (and  priority  assign- 
ments in  priority  encoders)  can  often  reduce  the  required  number 
of  product  terms.  This  can  sometimes  determine  whether  or  not 
a  design  fts  a  device  successfully.  Figure  13  shows  the  truth 
tables  of  two  simple  partial  3-to-2  encoders.  The  product  terms 
required  for  the  two  designs  are  different  due  to  the  different 
assignment  of  encoded  bits. 


XI  = 


XO  - 


INPUTS 

OUTPUTS 

INPUTS 

OUTPUTS 

ABC 

x,  x0 

ABC 

x,  \ 

1    o  o 
o     i  0 
0      0  1 

0  0 

0  1 

1  0 

1       0  0 
0       1  0 
0       0  1 

0  1 

1  0 

1  1 

/A  *  /B  *  C 


/A  *     B  *  /C 


XI  -  /A  *  B  *  /C 

+  /A  *  /B  *  C 

XO  =  A  *  /B  *  /C 

+  /A  *  /B  *  C 


Figure  13.  Two  Encoders  With  Different  Product  Term 
Requirements 

Another  way  of  looking  at  a  decoder  is  as  a  logic  function  which, 
depending  upon  the  select  code  applied,  connects  one  data  input 
to  the  seeded  outputs.  Also  known  as  a  Demultiplexer,  a 
decoder  essentially  connects  an  input  to  one  of  2"  outputs  based 
upon  n  select  code  bits.  The  reverse  logic  function,  which 
combines  data  from  multiple  sources  to  an  output  signal,  is  called 
a  multiplexer  and  is  discussed  next. 


Multiplexers 

A  multiplexer  (sometimes  referred  to  as  a  data  selector)  is  a 
special  combinatorial  circuit,  widely  used  in  digital  design.  It  is 
designed  to  gate  one  of  several  inputs  to  a  single  output.  The 
input  selected  for  connection  to  the  output  is  controlled  by  a 
separate  set  of  select  inputs. 

The  traditional  use  of  a  multiplexer  is  for  "time  division  multiplex- 
ing" in  data  communication,  when  gating  several  data  lines  to  a 
single  data  transmission  line  for  short  intervals  of  time.  The  data 
received  is  then  demultiplexed  by  using  a  demultiplexer,  de- 
scribed earlier. 

The  design  methodology  employed  for  multiplexer  design  is  the 
truth-table  approach.  As  an  example,  we  can  look  at  a  three- 
input-to-one-output  (3:1)  multiplexer,  which  uses  two  select  sig- 
nals A  and  B.  Based  on  these  two  select  bits,  the  data  on  one  of 
the  three  inputs  is  sent  to  the  output.  The  truth  table  is  shown  in 
Figure  14. 


SELECT 

INPUTS 

OUTPUT 

B  A 

11  CO 

11 C1 

11 C2 

01 Y 

0  0 

0 

X 

X 

0 

0  0 

1 

X 

X 

1 

0  1 

X 

0 

X 

0 

0  1 

X 

1 

X 

1 

1  0 

X 

X 

0 

0 

1  0 

X 

X 

1 

1 

Figure  14.  Truth  Table  for  a  Three-to-One  Multiplexer 

Deriving  the  Boolean  equation  from  this  truth  table  is  a  straight- 
forward task.  Inthiscasenofurtherminimizationispossible.  The 
Boolean  equation  is: 

/01Y  =  /B  *  /A  *  /I1C0 
+  /B  *  A  *  /I1C1 
+     B  *  /A  *  /I1C2 

The  complete  PALASM  2  design  file  for  this  design  example 
showing  four  such  three-to-one  multiplexers  is  shown  in 
Figure  15. 


KHD 


2-43 


Combinatorial  Logic  Design 


TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 

.■DESCRIPTION 


QUAD  3:1  MULTIPLEXER 

P7074 

01 

COLI/VOLPIGNO 

MONOLITHIC  MEMORIES,  INC. 

8/20/87 


;THIS   IS  AN  EXAMPLE  OF  A  QUAD  3-TO-l  MULTIPLEXER  USING  A  PAL14L4.  SELECT 
,-LINES  A.B  ARE  ENCODED  IN  BINARY,   WITH  A  REPRESENTING  THE  LSB.      THE  OUTPUTS 
;(Y)   ARE  ALL  HIGH  IF  THE  SELECT  LINES  ARE  BOTH  HIGH   (B,A=H) . 

•  OPERATIONS  TABLE: 


INPUT 

OUTPUTS 

SELECT 

B  A 

Y 

L  L 

CO 

L  H 

CI 

H  L 

C2 

H  H 

H 

CHIP  QUAD_MUX  PAL14L4 

I1C0  I1C1  I1C2  I2C0  I2C1  I2C2  I3C0  I3C1  I3C2  GND 
I4C0   I4C1   I4C2     04Y     03Y     02Y     OlY       B         A  VCC 

EQUATIONS 


/OlY 


/03Y 


/04Y 


/B*/A 

/I1C0 

; SELECT 

INPUT 

ICO 

+ 

/B*  A 

* 

/I1C1 

; SELECT 

INPUT 

1C1 

+ 

B*/A 

/I1C2 

; SELECT 

INPUT 

1C2 

/B*/A 

* 

/I2C0 

; SELECT 

INPUT 

2C0 

+ 

/B*  A 

* 

/I2C1 

; SELECT 

INPUT 

2C1 

+ 

B*/A 

* 

/I2C2 

; SELECT 

INPUT 

2C2 

/B*/A 

* 

/I3C0 

; SELECT 

INPUT 

3C0 

+ 

/B*  A 

/I3C1 

; SELECT 

INPUT 

3C1 

+ 

B*/A 

A 

/I3C2 

; SELECT 

INPUT 

3C2 

/B*/A 

/I4C0 

,■  SELECT 

INPUT 

4C0 

+ 

/B*  A 

* 

/I4C1 

; SELECT 

INPUT 

4C1 

+ 

B*/A 

* 

/I4C2 

; SELECT 

INPUT 

4C2 

;    SIMULATION  NOT  INCLUDED  HERE 

Figure  15.  Quad  3:1  Multiplexer 
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The  equations  derived  in  the  above  example  can  be  easily 
generalized  for  other  multiplexers.  The  symbol  for  a  general 
2"-inputs-to-one-output  multiplexer  is  shown  in  Figure  16  where 
n  select  lines  are  used. 


I2n. 


MULTIPLEXER 


■  Y 
OUTPUT 


So  Si  Sn-1 
INPUT  SELECT  LINES 

Figure  16.  General  Model  of  a  2n-to-1  Multiplexer 

The  Boolean  equations  are: 


n=3 


/S1VS0*  (10) 
/SI*  SO* (II) 
S1*/S0*(I2) 
SI*  SO* (13) 


=  /S2*/S1*/S0* (10) 

+  /S2*/S1*  SO* (II) 

+  /S2*  S1*/S0* (12) 

+  /S2*  SI*  SO*  (13) 

+  S2*/S1*/S0* (14) 

+  S2*/S1*  SO*  (15) 

+  S2*  S1*/S0* (16) 
S2*  SI*  SO*  (17) 

Multiplexer  Device  Selection  Considerations 

Multiplexe  s  typically  require  more  inputs  than  outputs,  so  the 
devices  wi  h  a  large  number  of  inputs  and  l/Os  are  usually  more 
useful.  Careful  consideration  must  also  be  given  to  the  number 
of  product  terms  available  on  each  output. 


Several  multiplexers  are  often  used  simultaneously  to  route 
multiple  address  and  data  bits,  under  the  control  of  the  same 
select  lines.  In  such  cases,  multiple  devices  can  be  cascaded 
when  the  number  of  inputs  and  outputs  exceeds  device  limits. 
Cascading  is  also  possible  for  large  multiplexers  that  do  not  fit  in 
a  single  device.  In  such  cases,  the  select  bits  should  also  be 
judiciously  selected  for  each  PLD,  to  minimize  the  number  of 
product  terms. 

Another  common  trick  for  designing  a  multiplexer  is  to  connect  a 
number  of  outputs  together  and  control  the  output  enables  using 
the  select  bits  to  multiplex  data.  This  multiplexing  technique  is 
used  extensively  in  DRAM  controllers  for  multiplexing  address 
signals.  Timing  considerations  for  such  designs  include  the 
output  enable  and  disable  times,  which  should  be  carefully 
selected  to  avoid  output  contentions. 

Comparators 

A  comparator  is  a  combinatorial  circuit  designed  primarily  to 
compare  the  relative  magnitude  of  two  binary  numbers.  Figure  1 7 
shows  the  truth  table  for  a  two-bit  comparator. 


INPUTS 

OUTPUTS 

A 

B 

EQL 

LES 

GTR 

A, 

A, 

B2 

B, 

A=B 

A<B 

A>B 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

1 

1 

1 

0 

1 

0 

1 

0 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

1 

0 

1 

0 

1 

1 

0 

0 

0 

0 

1, 

1 

1 

0 

1 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

1 

1 

1 

1 

0 

0 

Figure  17.  Truth  Table  for  a  Comparator 

A  basic  comparator  compares  two  numbers  only  for  equality, 
and  generates  the  EQL  signal  (indicating  A=B).  An  extension, 
called  a  magnitude  comparator,  also  generates  the  LES  signal 
(indicating  A<B)  and  GTR  signal  (indicating  A>B).  Based  on  this 
truth  table,  the  equations  for  the  three  output  signals  EQL,  LES 
and  GTR  can  be  easily  derived.  These  equations  can  then  be 
optimized  by  using  Boolean  algebra,  Karnaugh  maps  or  the 
minimization  routine  available  with  PALASM  2  software. 
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The  final  Boolean  equations  are: 


EQL 


LES  -  B2 
+  (B2 


'  /A2 

:  +  :  /A2)   *  Bl  *  /Al 


LES 


=    /A2  *  /Al  * 

/B2 

/Bl 

+   /AZ    *      Al  * 

/B2 

* 

Bl 

+    A2  *  /Al  * 

B2 

* 

/Bl 

+    A2  *    Al  * 

B2 

* 

Bl 

-    /A2  *  /Al  * 

/B2 

* 

Bl 

+  /A2  *  /Al  * 

B2 

* 

/Bl 

+  /A2  *  /Al  * 

B2 

* 

Bl 

+  /A2  *    Al  * 

B2 

* 

/Bl 

+  /A2  *    Al  * 

B2  * 

Bl 

+    A2  *  /Al  * 

B2 

* 

Bl 

-  /Al  *    B2  * 

Bl 

+  /A2  *  /Al  * 

Bl 

+  /A2  *  B2 

GTR  -    A2  *  /B2 

+  (A2  :  +  :  /B2)   *  Al  *  /Bl 

These  equations  can  then  be  extended  for  a  general  comparison 
of  n-bit  comparands  as  follows: 

LES  -    Bn  *  /An 

+  (Bn  :+:  /An)  *  Bn-1  *  /An-1 

+  (Bn  :  +  :  /An)  *  (Bn-1  :+:  /An-1)  *  Bn-2  * 
/An-2 

+  ... 
+  ... 
+  ... 

+  (Bn  :  +  :  /An)  *  (Bn-1  :  +  :  /An-1)  ... 

(B2  :  +  :  /A2)  *  Bl  *  /Al 


/Bn-1 

:  +  :  /Bn-1)  *  An-2  * 


GTR  = 

/A2 

* 

Al 

/B2 

* 

/Bl 

+ 

A2 

/Al 

/B2 

* 

/Bl 

GTR  = 

An  *  /Bn 

+ 

A2 

* 

Al 

* 

/B2 

/Bl 

+ 

(An  :+:  /Bn)  * 

An-1  * 

+ 

A2 

* 

/Al 

* 

/B2 

* 

Bl 

+ 

(An  :  +  :  /Bn)  * 

(An-1 

+ 

A2 

* 

Al 

* 

/B2 

* 

Bl 

/Bn-2 

+ 

A2 

* 

Al 

* 

B2 

* 

/Bl 

+ 

Al 

* 

/B2 

* 

/Bl 

+ 
+ 

+ 

A2 

* 

Al 

* 

/Bl 

+ 

(An  :  +  :  /Bn)  * 

(An-1 

+ 

A2 

* 

/B2 

(A2  :  +  :  /B2) 

*  Al  * 

Comparator  Device  Selection  Considerations 

The  number  of  product  terms  needed  is  directly  related  to  the 
number  of  bits  compared.  For  LES  (less  than)  and  GTR  (greater 
than)  functions,  the  number  of  product  terms  required  depends 
upon  the  number  of  bits  in  the  two  operands  compared,  as  well  as 
their  value.  The  LES  and  GTR  equations  can  be  rewritten  as 
follows: 


:+:  /Bn-1) 
/Bl 


The  total  number  of  product  terms  required  for  an  n-bit  compari- 
son is  2"-1 .  Comparators  require  a  large  number  of  product  terms 
and  devices  that  offer  many  product  terms  can  be  used  very  ef- 
fectively. Figure  1 8  shows  a  list  of  devices  that  offer  many  product 
terms.  It  also  includes  registered  devices,  since  registers  are 
sometimes  used  to  hold  the  comparator  output  signals  asserted 
for  long  durations. 


DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

OUTPUT 

PINS 

PRODUCT 

tPD  IN  ns 

OF  PINS 

PINS 

PINS 

TERMS/ 

OUTPUT 

TTL 

16C1 

20 

16 

2 

16 

40 

20C1 

24 

20 

2 

16 

40 

20S10 

24 

12 

2 

8 

8-16 

35 

22V10 

24 

12 

10 

8-16 

25,  35 

32VX10 

24 

12 

10 

8-16 

25,  35 

CMOS 

C22V10 

24 

12 

10 

8-16 

25,  35 

C29M16 

24 

5 

16 

8-16 

35,45 

C29MA16 

24 

5 

16 

4-12 

35,  45 

Figure  18.  PLDs  With  Large  Numbers  of  Product  Terms 
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As  is  obvious  from  these  equations,  comparators  require  exclu- 
sive-OR  functions,  and  can  also  be  efficiently  implemented  in  the 


PAL1 6X4,  which  offers  exclusive-OR  functions.  An  example  of  a 
four-bit  comparator  implemented  on  a  PAL16X4  is  shown  in 
Figure  19. 


PAL16X4  PAL  DESIGN  SPECIFICATION 

P7031  BIRKNER/COLI  07/12/81 

BETWEEN  LIMITS  COMPARATOR /REGISTER 

MMI   SUNNYVALE,  CALIFORNIA 

CLK  LOAD  CLEAR  BO  Bl  B2  B3  NC  /OC2  GND 
/OC1   /NE       /EQ     A3  A2  Al  AO  /LT  /GT  VCC 

DESCRIPTION 

THE  DEVICE  CONTINUOUSLY  COMPARES  THE  BUS  VALUE   (B3-B0)   WITH  THE  VALUE  OF 
THE  REGISTER   (A3-A0)   AND  REPORTS  THE  STATUS  ON  OUTPUTS  LT,    EQ,   NE,   AND  GT: 

*  LT  INDICATES  THAT  B  IS     LESS       THAN  A 

*  EQ  INDICATES  THAT  B  IS       EQUAL       TO  A 

*  NE  INDICATES  THAT  B  IS  NOT  EQUAL  TO  A 

*  GT  INDICATES  THAT  B  IS  GREATER  THAN  A 

STATUS  BUS  REG 


;/OCl 

/OC2 

CLK 

LOAD 

CLEAR 

LT 

EQ 

NE 

GT 

B3-B0 

A3-A0 

OPERATION 

•  H 

X 

X 

X 

X 

X 

X 

X 

X 

X 

Z 

REG  HI-Z 

•  X 

H 

X 

X 

X 

Z 

z 

z 

z 

X 

X 

STATUS  HI-Z 

;  L 

X 

X 

L 

L 

X 

X 

X 

X 

X 

A 

READ  REG 

;  x 

X 

C 

H 

L 

X 

X 

X 

X 

B 

B 

LOAD  REG 

;  x 

X 

C 

L 

L 

X 

X 

X 

X 

X 

A 

HOLD 

;  X 

X 

c 

X 

H 

X 

X 

X 

X 

X 

L 

CLEAR  REG 

;  X 

L 

X 

L 

L 

STATUS 

B 

A 

COMPARE 

IF(OC2)      LT  =  (A3*/B3)  ;B3=L,  A3=H 

+  (A3:*:B3)    *    (A2*/B2)  ;B2=L,  A2=H 

+  (A3:*:B3)    *    (A2:*:B2)    *    (A1*/B1)  ;B1=L,  A1=H 

+  (A3:*:B3)    *    (A2:*:B2)   *    (A1:*:B1)  *    (A0*/B0)      ;B0=L,  A0=H 

IF(OC2)      GT  =  (/A3*B3)  ;B3=H,  A3=L 

+  (A3:*:B3)   *    (/A2*B2)  ;B2=H,  A2=L 

+  (A3:*:B3)   *   (A2:*:B2)   *    (/A1*B1)  ;B1=H,   A1  =  L 

+  (A3:*:B3)    *    (A2:*:B2)   *    (A1:*:B1)  *    (/A0*B0)      ;B0=H,  A0=L 

IF(OC2)      EQ  f  (A3:*:B3)    *    (A2:*:B2)    *    (A1:*:B1)  *    (A0:*:B0)    ;COMPARE  EQUAL 

IF(OC2)      NE  =  (A3:+:B3)    +   (A2:+:B2)    +    (A1:+:B1)  +    (A0:+:BO)    ;NOT  EQUAL 

/A3    :=  (/A3)*/LOAD  ; HOLD     REG  A3 

+  </B3)*   LOAD  ; LOAD     REG  A3 

+     CLEAR  ; CLEAR  REG  A3 

/A2    :=  (/A2)*/LOAD  ;HOLD     REG  A2 

+  (/B2)*  LOAD  ; LOAD     REG  A2 

+     CLEAR  ; CLEAR  REG  A2 

/Al    :=  (/Al)*/LOAD  ; HOLD     REG  Al 

+  (/Bl)*  LOAD  ; LOAD     REG  Al 

+  CLEAR  ; CLEAR  REG  Al 

/AO    :=  (/A0)*/LOAD  ; HOLD     REG  AO 

+  (/BO)*  LOAD  ; LOAD     REG  AO 

+  CLEAR  ; CLEAR  REG  AO 

Figure  19.  PAL16X4  Based  Four-Bit  Comparator 
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FUNCTION  TABLE 


CLK 

/OC1 

/OC2 

LOAD  CLEAE 

B3  B2 

Bl  BO 

A3 

A2  Al  AO  LT 

EQ  NE  GT 

: CONTROL 

BUS 

REG 

/OC 

OPERATIONS 

BBBB 

AAAA 

STATUS  

COMMENTS 

"  CLK 

1 

2 

LOAD  CLEAR 

3210 

3210 

LI 

FO 

N.E 

GT 

(HEX  VALUES) 

Q 

x 

X 

H 

XXXX 

LLLL 

X 

x 

X 

x 

CLEAR  REG 

r 

L 

H 

L 

LLLL 

LLLL 

X 

x 

X' 

x 

LOAD  REG   ( 0 ) 

X 

L 

L 

L 

L 

LLLL 

LLLL 

L 

L 

L 

COMPARE   ( 0  EQ 

0  ) 

L 

L 

L 

L 

LLLH 

LLLL 

L 

L 

g 

COMPARE   ( 1  GT 

0  ) 

X 

L 

w 

L 

L 

XXXX 

LLLL 

X 

x 

x 

x 

READ  REG   ( 0 ) 

Q 

X 

X 

H 

XXXX 

LLLL 

X 

x 

X 

x 

CLEAR  REG 

n 
M 

L 

H 

L 

LHLH 

LHLH 

X 

J[ 

x 

LOAD  REG  (5) 

L 

L 

L 

L 

LLLL 

LHLH 

H 

L 

H 

L 

COMPARE   { 0  LT 

5  ) 

X 

L 

L 

L 

L 

LHLH 

LHLH 

L 

H 

L 

L 

COMPARE   ( 5  EQ 

5  } 

X 

L 

L 

L 

L 

HHHH 

LHLH 

L 

L 

H 

H 

COMPARE   ( F  GT 

5  ) 

X 

L 

X 

L 

L 

XXXX 

X 

X 

X 

X 

READ  REG    ( 5 ) 

c 

L 

X 

X 

H 

XXXX 

LLLL 

X 

X 

X 

X 

CLEAR  REG 

c 

L 

X 

H 

L 

HLHL 

HLHL 

X 

X 

X 

X 

LOAD  REG  (A) 

X 

L 

L 

L 

L 

LHLL 

HLHL 

H 

L 

H 

L 

COMPARE   ( 4  LT 

A) 

X 

L 

L 

L 

L 

HLHL 

HLHL 

L 

H 

L 

L 

COMPARE   (A  EQ 

A) 

X 

L 

L 

L 

L 

HLHH 

HLHL 

L 

L 

H 

H 

COMPARE   (B  GT 

A) 

X 

L 

X 

L 

L 

XXXX 

HLHL 

X 

X 

X 

X 

READ  REG  (A) 

c 

L 

X 

X 

H 

XXXX 

LLLL 

X 

X 

X 

X 

CLEAR  REG 

c 

L 

X 

H 

L 

HHHH 

HHHH 

X 

X 

X 

X 

LOAD  REG  (F) 

X 

L 

L 

L 

L 

HHHL 

HHHH 

H 

L 

H 

L 

COMPARE   (E  LT 

F) 

X 

L 

L 

L 

L 

HHHH 

HHHH 

L 

H 

L 

L 

COMPARE   (F  EQ 

F) 

X 

L 

X 

L 

L 

XXXX 

HHHH 

X 

X 

X 

X 

READ  REG  (F) 

c 

L 

X 

L 

L 

XXXX 

HHHH 

X 

X 

X 

X 

HOLD  (F) 

X 

H 

X 

X 

X 

XXXX 

ZZZZ 

X 

X 

X 

X 

TEST  HI-Z  (/0C1=H) 

X 

X 

H 

X 

X 

XXXX 

XXXX 

z 

z 

z 

z 

TEST  HI-Z  (/0C2=H) 

Figure  19.  PAL16X4  Based  Four-Bit  Comparator  (Cont'd.) 


The  values  of  the  comparands  themselves  affect  the  number  of 
product  terms  used.  When  the  comparison  is  made  with  compar- 
ands which  are  power-of-two  numbers,  the  number  of  product 
terms  required  can  be  reduced  drastically.  This  essentially  relies 
on  the  fact  that  when  the  lower  bits  of  a  comparand  are  all  zeros 
only  the  highest  bit  needs  to  be  compared,  requiring  only  one 
product  term.  For  example  in  a  two-bit  comparator,  if  A1  is  zero 
and  A2  is  one,  the  equation  for  the  greater-than  function  becomes 
very  simple  and  requires  only  one  product  term: 

GTR  =  /B2 

The  general  equation  for  the  GTR  signal  can  also  be  simplified 
when  comparing  a  number  B  to  a  fixed  power-of-two  comparand 
A  with  p  least  significant  zeros. 

A  =  000010000  ...  00 
n      p  1 

GTR  =     /Bn  *  /Bn-1  ...  *  /Bp+1  *  /Bp 


This  general  GTR  equation  can  also  be  considered  as  an  equa- 
tion for  comparing  a  number  to  a  range  of  numbers  extending 
from  zero  to  number  A.  In  factthis  trick  is  used  very  often  by  many 
system  designers  for  address  decoder  functions.  In  PLD  Design 
Methodology  (page  2-22)  the  ROMCS1  signal  is  one  such  signal 
which  is  generated  for  the  address  range  from  (000000)  hex  to 
(0FFFFF)  hex.  Forthis  design  n=23,  the  comparand  A=(0FFFFF 
+  1  )=1 00000  and  p=21 .  Substituting  in  the  general  equation  we 
get  the  same  address  decoder  Boolean  logic  equation. 

ROMCSl  =  /A23  *  /A22  *  /A21 

As  such  designs  require  few  product  terms  and  no  XOR  gates, 
they  are  efficiently  implemented  on  standard  combinatorial  PLDs. 
A  list  of  such  standard  combinatorial  PLDs  is  shown  in  Figure  20. 
A  general  form  of  range  comparators  with  two  boundary  compar- 
ands will  be  discussed  later. 
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DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

OUTPUT 

PINS 

PRODUCT 

t„„  IN  ns 

PD 

OF  PINS 

PINS 

PINS 

TERMS/ 

OUTPUT 

TTL 

1 6L8 

20 

10 

2 

6 

7 

10,  15,  25,  35 

16P8 

20 

10 

2 

6 

7 

25 

18P8 

20 

10 

8 

8 

15,25 

20L8 

24 

14 

2 

6 

7 

15,  25,35 

20L10 

24 

12 

2 

8 

3 

15,20,  25,30 

20S10 

24 

12 

2 

8 

8-16 

35 

22P10 

24 

12 

10 

8 

15,25 

22XP10 

24 

12 

10 

8 

20,  30,  40 

22RX8 

24 

14 

8 

8 

25 

24 

12 

10 

8-16 

25,  35 

32VX10 

24 

12 

10 

8-16 

25,30 

CMOS 

C16L8 

20 

10 

2 

6 

7 

25 

C20L8 

24 

14 

2 

6 

7 

35,45 

C22V10 

24 

12 

10 

8-16 

25,  35 

C29M16 

24 

5 

16 

8-16 

35,  45 

C29MA16 

24 

5 

16 

4-12 

35,  45 

Figure  20.  Standard  Combinatorial  PLDs  with  Sum-ot-Products  Logic 


The  third  output  signal  is  the  EQL  signal.  The  EQL  Boolean 
equation  tells  us  whether  the  two  numbers  are  identical.  Such 
information  is  useful  not  only  in  address  decoders,  but  also  in 
digital  signal  processing  designs.  This  equation  requires  a  large 
number  of  product  terms.  A  closer  examination  reveals  that  it  is 
essentially  an  exclusive-OR  function. 


EQL 


+  A?  * 


EQL  =  (Al 


/B2  ' 
B2  I 

»:  Bl) 


(/Al 
(/Al 

'  (A2 


/Bl  +  Al 
/Bl  +  Al 


Bl) 
Bl) 


B2)   ;  Exclusive-NOR 
;  function 


Inverting  this: 

/EQL  =   (Al   :  +  :  Bl)   +  (A2 


B2) 


Exclusive-OR 
function 


This  equation  can  be  extended  to  give  a  general  equation  for 
equal-to  comparison  for  two  n-bit  comparands. 

/EQL  =  (An  :  +  :  Bn) 

+  (An-1   :  +  :  Bn-1) 

+  (An-2  :+:  Bn-2) 

+  (An-3  :+:  Bn-3) 
+ 

+  ... 

+  (Al  :  +  :  Bl) 


This  inverted  equation  is  implementable  in  the  sum-of -products 
form  of  the  exclusive-OR  functions,  and  can  be  easily  expanded 
to  the  following: 

/EQL  =  Al  *  /Bl  +  /Al  *  Bl 
+  A2  *  /B2  +  /A2  *  B2 
+  A3  *  /B3  +  /A3  *  B3 

+  ... 
+  ... 

+  An  *  /Bn  +  /An  *  Bn 

This  gives  us  a  general  sum-of-products  form  of  a  comparator 
equation  which  is  easily  implemented  in  PAL  devices.  An  n-bit 
comparator  requires  2n  product  terms.  An  eight-bit  comparator 
is  implemented  in  the  design  shown  in  Figure  21.  This  design 
uses  the  PAL16C1,  which  is  specifically  designed  for  comparator 
applications,  and  provides  a  large  number  of  product  terms 
connected  to  EQL  and  /EQL  (NE)  outputs.  All  of  the  devices  that 
have  a  large  number  of  product  terms  are  illustrated  in  Figure  18, 
and  would  be  appropriate  for  such  designs. 
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Title  OctalComparator 
Pattern  OctComp.pds 
Revision  A 

Author      Mehrnaz  Hada 

Company  Monolithic  Memories  Inc.,  Santa  Clara, CA 
Date  1/29/85 


;The  octal  comparator  establishes  when  two  8-bit  data 
;strings  (A7-A0)  and  (B7-B0)  are  equivalent  ( EQ=H )  or  not 
;equivalent   ( NE=H ) . 


CHIP  OctalCom  PAL16C1 


A7  AO  BO  Al  Bl  A2  B2  A3  B3  GND 
A4  B4  A5  B5  EQ  NE  A6  B6  B7  VCC 


EQUATIONS 


A0*/B0 

+ 

/AO* 

BO 

AO 

:  +  : 

BO 

+ 

A1*/B1 

+ 

/Al* 
/A2* 

Bl 

Al 

:  + : 

Bl 

+ 

A2*/B2 

+ 

B2 

A2 

:  + : 

B2 

+ 

A3*/B3 

+ 

/A3* 

B3 

A3 

:  + : 

B3 

+ 

A4*/B4 

+ 

/A4* 

B4 

A4 

:  + : 

B4 

+ 

A5*/B5 

+ 

/A5* 

B5 

A5 

:  + : 

B5 

+ 

A6*/B6 

+ 

/A6* 

B6 

A6 

:  +  : 

B6 

+ 

A7*/B7 

+ 

/A7* 

B7 

A7 

:  +  : 

B7 

SIMULATION 


TRACE_ON  A7  A6  A5  A4  A3  A2  Al  AO  NE 
B7  B6  B5  B4  B3  B2  Bl  BO 


SETF     A7  /A6  /A5  /A4  /A3  /A2  /Al  /AO  ;A7=H,  B7=L 

/B7  /B6  /B5  /B4  /B3  /B2  /Bl  /BO 

SETF  /A7  A6  ;A6=H,  B6=L 

SETF  /A6  A5  ;A5=H,  B5=L 

SETF  /A5  A4  ;A4=H,  B4=L 

SETF  /A4  A3  ;A3=H,  B3=L 

SETF  /A3  A2  ;A2=H,  B2=L 

SETF  /A2  Al  ;A1=H,  B1=L 

SETF  /Al  AO  ;AO=H,  B0=L 

SETF  /A7  /A6  /A5  /A4  /A3  /A2  /Al  /AO  ;A7=L,  B7=H 

B7 

SETF  /B7  B6  ;A6=L,  B6=H 

SETF  /B6  B5  ;A5=L,  B5=L 

SETF  /B5  B4  ;A4=L,  B4=H 

SETF  /B4   B3  ;A3=L,  B3=H 

SETF  /B3  B2  ;A2=L,  B2=H 

SETF  /B2  Bl  ;A1=L,  B1=H 

SETF  /Bl   BO  ;A0=L,  B0=H 

SETF  /BO  ;Test  all  L's 

SETF  A7  A6  A5  A4  A3  A2  Al  AO  ;Test  all  H's 

B7  B6  B5  B4  B3  B2  Bl  BO 

SETF  /A7  A6  /AS  A4  /A3  A2  /Al  AO  ;Test  even  ones 

/B7  B6  /B5  B4  /B3  B2  /Bl  BO 

SETF  A7   /A6  A5  /A4  A3  /A2  Al  /AO  ;Test  odd  ones 

B7  /B6  B5  /B4  B3  /B2  Bl  /BO 

TRACE  OFF 


Figure  21.  Octal  Comparator  Design  Example 
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Note  that  the  EQL  equation,  as  well  as  GTR  and  LES  equations, 
rely  upon  the  XOR  function.  Often  the  logic  represented  by  the 
equations  is  implemented  in  two  or  more  devices.  There  are 
several  PAL  devices  that  provide  dedicated  XOR  capability,  and 
can  efficiently  implement  the  logic.  These  devices,  listed  in  Figure 
22,  can  be  used  for  comparator  applications.  The  PAL32VX10 
and  the  PAL22RX8  offer  an  architecture  with  a  multiple  number 
of  product  terms  on  one  side  of  the  XOR  gate,  which  allows  a 
number  to  be  compared  against  multiple  comparands.  The  de- 


sign example  in  Figure  1 9  shows  the  XOR  implementation  of  the 
LES,  GTR  and  EQL  functions. 

Let  us  analyze  these  equations  further.  The  LES  and  GTR 
outputs  indicate  whether  one  number  is  greater  than  or  less  than 
another.  In  fact,  these  equations  can  also  be  judiciously  com- 
bined to  get  a  comparison  of  a  range  of  numbers  such  as 
A  >  X  >  B.  Such  range  comparisons  are  very  useful  for  address 
decoder  circuits. 


DEVICE 

TOTAL 
NUMBER 
OF  PINS 

DEDICATED 
INPUT 
PINS 

DEDICATED 
REGISTERED 
OUTPUT 
PINS 

I/O 
PINS 

LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 

SPEED  GRADES 
FREQUENCY  IN 
MHz 

^MAX 

TTL 

16X4 

20 

8 

4 

4 

8 

14 

20X10 
20X8 
20X4 

24 
24 
24 

10 
10 
10 

10 
8 
4 

2 
6 

4 
4 
4 

22.2 
22.2 
22.2 

22XP10 

24 

12 

10 

8 

tro- 
20,  30,  40 

20XRPK 
20XRP8 
20XRP6 
20XRP4 

1 

24 
24 
24 
24 

10 
10 
10 
10 

10 
8 
6 
4 

2 
4 
6 

8 
8 
8 
8 

30,  22.2,  14 
30,  22.2, 14 
30,  22.2,  14 
30,  22.2,  14 

22RX8 
32VX10 

24 
24 

14 
12 

0-8 
0-10 

8-0 
10-0 

8 
8-16 

28.5 
25,  22.2 

Figure  22.  PAL  Devices  with  XOR  Capability  for  Comparators 
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Range  Decoders 

Range  decoders  implemented  as  address  decoders  are  one  of 
the  most  commonly  used  applications  of  PLDs  in  digital  systems. 
A  good  example  is  the  address  decoder  illustrated  earlier  on  page 
2-22,  in  the  PLD  Design  Methodology  section.  Range  decoders 
compare  a  number  (address)  to  a  given  range  of  comparands 
(addresses).  One  way  to  arrive  at  the  range  decoder  Boolean 
equations  is  to  use  the  traditional  truth  table  approach.  Another 
way  is  to  use  the  Boolean  equations  generated  earlier  in  the 
comparator  section  for  greater-than  and  less-than  functions.  To 
decode  a  range  of  three-bit  numbers  from  B  to  A,  we  must 
compare  another  number  X  such  that  A>X>B.  The  Boolean 
equations  for  the  GTR  (A>X)  and  LES  (B<X)  functions  are 
illustrated  below: 

GTR  =    A3  *  /X3 

+  (A3  :  +  :  /X3)  *  A2  *  /X2 

+   (A3  :  +  :  /X3)   *   (A2  :  +  :  /X2)   *  Al  *  /XI 


Combining  these  two  equations  can  give  us  a  range  signal  which 
will  be  asserted  only  when  Ais  greater  than  X  and  X  is  greaterthan 
B.  The  combined  Boolean  equation  follows: 

RANG  -   (  A3  *  /X3 

+   (A3  :+:   /X3>   *  A2  *  /X2 

+   (A3  :+:  /X3)   *   (A2  :+:   /X2)   *  Al  *  /XI  ) 
* 

(  X3  *  /B3 
+  (X3  :+:  /B3)   *  X2  *  /B2 

+   (X3  :+:  /B3)   *   (X2  :+:  /B2)   *  XI  *  /Bl  ) 


LES  =    X3  *  /B3 

+   (X3  :  +  :  /B3)   *  X2  *  /B2 

+   (X3  :+:  /B3)   *   <X2   :+:  /B2)   *  XI  *  /Bl 





Using  Boolean  algebra  we  get  the  following  equation: 
RANG  - 

(A3  :+:   /X3)   *   <X3  :+:  /B3)   *   (A2   :+:  /X2)   *  Al  *  /XI  *  X2  *  /B2 
+  (A3  :+:  /X3)   *   <X3  :+:  /B3)   *   (X2  :+:   /B2)   *  A2  *  /X2  *  XI  *  /Bl 
+   (A3  :+:   /X3)   *   (A2  :+:  /X2)   *  Al  *  /XI  *  X3  *  /B3 
+   (X3  :+:  /B3)   *   <X2  :+:  /B2>   *  A3  *  /X3  *  XI  *  /Bl 
+   (A3  :+:  /X3)   *  A2  *  /X2  *  X3  *  /B3 
+   (X3  :+:   /B3)   *  A3  *  /X3  *  X2  *  /B2 

The  general  equation  for  n-bit  comparands  can  also  be  obtained  by  extending  these  equations. 

RANG  =  (  An  *  /Xn 

+  (An  :  +  :  /Xn)  *  An-1  *  /Xn-1 

+  (An  :  +  :  /Xn)  *  (An-1  :  +  :  /Xn-1)  *  An-2  *  /Xn-2 
+  ... 

+  ... 

+  (An  :  +  :  /Xn)  *  (An-1  :  +  :  /Xn-1)  ...  (A2  :+:  /X2)  *  Al  *  /XI) 

* 

(Xn  *  /Bn 

+  (Xn  :  +  :  /Bn)  *  Xn-1  *  /Bn-1 

+  (Xn  :  +  :  /Bn)  *  (Xn-1  :  +  :  /Bn-1)  *  Xn-2  *  /Bn-2 
+  ... 
+  .. 
+  ... 

+  (Xn  :  +  :  /Bn)  *  (Xn-1  :  +  :  /Bn-1)  ...  (X2  :  +  :  /B2)  *  XI  *  /Bl) 
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The  number  of  product  terms  required  is  clearly  very  large  and 
can  easily  exceed  one  hundred  for  an  eight-bit  range  comparator. 
Most  microprocessors  have  addresses  which  exceed  16  bits.  In 
order  to  fit  the  design  on  a  PAL  device,  one  commonly  used 
technique  is  to  select  the  address  range  defined  by  A  and  B  such 
that  the  range  extends  from  address  B+1  toA-1 ,  where  Aand  B+1 
are  power-of-two  numbers.  Because  the  address  space  is 
aligned  on  the  power-of-two  boundaries,  a  number  of  bits  of  the 
address  comparands  will  be  zero.  When  implemented  in  Boolean 
equations,  this  substantially  reduces  the  number  of  product  terms 
required. 

The  maximum  number  of  product  terms  required  for  a  three-bit 
range  decoder  shown  above,  with  any  comparand  values,  is  28. 
If  the  address  is  chosen  from  2  to  3,  resulting  in  A=4  and  B=1 ,  then 
only  one  product  term  will  be  required. 


/: 


3  *  X2 


RANG 


Similarly  for  a  range  from  one  to  five,  B  =  1  and  A  =  6  (a  multiple- 
of-two),  and  the  number  of  product  terms  required  is  only  two. 


RANG  =  /X3 
+  X3 


X2 
/X2 


Thus  a  careful  selection  of  range  boundaries  allows  such  logic 
functions  to  be  implemented  easily  in  PLDs.  Such  reduction  in 
logic  obviously  also  holds  true  for  discrete  implementations.  Most 
address  decoders  are  designed  with  address  ranges  with 
boundariesthat  are  power-of-two  numbers,  and  require  few  prod- 
uct terms  for  implementation. 


For  a  power-of-two  address  range,  the  comparand  A  would  be  a 
power-of-two;  2,  4  or  8.  These  are  numbers  whose  least  signifi- 
cant bits  are  all  zeros.  Similarly,  comparand  B  will  be  a  power-of- 
two  number  (minus  one);  1 , 3,  and  7.  These  are  numbers  whose 
least  significant  bits  are  all  ones.  Substituting  these  in  the  general 
equation  for  range  comparators  we  arrive  at: 


oooo 


00000000000 

I  1 


B  =  0000000000011111 


RANG  =  /Xn  *  /Xn-1  *...*  /Xp  *  (Xp-1  +  Xp-2  +  ...  +Xq) 

This  is  a  general  equation  for  a  power-of-two  range  comparison. 
In  the  example  on  page  2-22,  the  ROMCS2  signal  addresses  the 
range  from  100000  to  1FFFFF,  in  which  case  B=0FFFFF  and 
A=2FFFFF  Here  n=23,  p=22  and  q=21 .  The  address  decode 
equation  for  the  ROMCS2  signal  can  be  arrived  at  by  substituting; 


ROMCS2 


/A23  *  /A22  *  A21 


This  is  the  same  equation  as  was  found  from  the  truth  table. 

Such  designs  are  very  common  for  address  decoder  applica- 
tions. These  do  not  require  any  XOR  gates,  and  can  be  imple- 
mented in  standard  combinatorial  PLDs  with  only  sum-of-prod- 
ucts  logic.  A  list  of  such  PLDs  is  shown  in  Figure  20. 


Adders/Arithmetic  Circuits 

Digital  systems  are  designed  to  carry  out  a  variety  of  arithmetic 
instructions  on  binary  numerical  data.  A  good  example  is  the 
ALU  (Arithmetic  Logic  Unit)  used  in  all  digital  computers.  The 
basic  function  of  an  ALU  is  that  of  an  adder  performing  addition 
on  two  binary  numbers.  A  binary  adder  takes  two  inputs,  adds 
them,  and  generates  the  binary  sum.  A  full  adder  is  a  one-bit 
adder  with  carry-in  and  carry-out;  this  is  the  basic  building  block 
of  any  adding  circuit.  The  truth  table  of  such  an  adder  is  shown 
in  Figure  23. 


INPUTS 

OUTPUTS 

A            B  C,N 

v  com 

0              0  0 
0             0  1 
0             1  0 

0  1  1 

1  0  0 
1              0  1 
1               1  0 

1         1  1 

0  0 

1  0 
1  0 

0  1 

1  0 
0  1 

0  1 

1  1 

Figure  23.  Truth  Table  for  a  Full  Adder 

This  truth  table  is  then  used  to  form  the  Boolean  equations  in  the 
manner  described  earlier. 


=  A 

+  /A 
+  A 
+  /A 


Cout 


■  A 
+  A 
+  B 


/B  *  /Cin 
/B  *  Cin 
B  *  Cin 
B  *  /Cin 

Cin 
B 

Cin 


Bigger  binary  adders  can  be  made  by  cascading  these  full 
adders.  Each  carry-out  is  directed  to  carry-in  for  the  next  stage. 
Such  adders  are  known  as  Ripple  Adders. 

Combinatorial  PAL  devices  are  ideal  for  this  purpose,  since  they 
also  provide  internal  feedback.  Thus  one  strong  consideration  in 
such  designs  is  the  internal  feedback  capability  of  the  device,  in 
addition  to  other  general  device  selection  considerations. 

These  ripple  adders  have  an  advantage  in  that  they  can  be 
cascaded  to  any  length  possible.  But  since  the  carry-out  from  the 
least  significant  bit  has  to  travel  all  the  way  to  the  highest 
significant  bit,  this  can  take  a  long  time,  making  such  large  adders 
inefficient.  Adders  with  built  in  carry-look-ahead  circuitry  can 
save  time  by  simultaneously  generating  the  carry-in  signal  for  all 
of  the  bits. 
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Rewriting  the  equations  for  a  full  adder  from  above  gives: 
YO  -  AO  :+:  BO  :+:  Cin 
where  the  carry-out  signal  is: 

CO  =  AO  *  BO  +    (AO  +  BO)    *  Cin 

Extending  these  equations  for  an  n-bit  carry-look-ahead  adder, 
we  can  directly  get  the  following  equations: 

YO  =  AO  :+:  BO  :+:  Cin 

Yl  -  Al   :+:  Bl  :+:  CO 

where 

CO  =  AO  *  BO 

+  (AO  +  BO)  *  Cin 

Y2  =  A2  :+:  B2  :+:  CI 

where 

Cl  -  Al  *  Bl 

+  (A1  +  B1)*(A0  *  BO) 

+  (A1  +  B1)*(A0  +  BO)   *  Cin 

Y3  =  A3  :  +  :  B3  :  +  :  C2 

where 

C2  =  A2  *  B2 

+  (A2  +  B2)  *  (Al  *  Bl) 

+  (A2  +  B2)*(A1  +  B1)*(A0  *  BO) 

+  (A2  +  B2)*(A1  +  B1)*(A0  +  BO)   *  Cin 

In  general 

Yn  =  An  :  +  :  Bn  :  +  :  Cn-1 

and  Cn-1  =  An-1  *  Bn-1 

+  (An-l  +  Bn-l)*(An-2*  Bn-2) 

+  ... 
+  ... 

+  (»n-l  +  Bn-1)  *  ...  *  (AO  +  BO)  *  Cin 

and  finally  the  carry-out  is: 

Cn  =  An  *  Bn 

+  (An  +  Bn)  *  (An-1  *  Bn-1) 

+  (An  +  Bn)  *  (An-1  +  Bn-1)  *  (An-2  *  Bn-2) 

+  ... 

+  ... 

+  (An  +  Bn)  *  ...  *  (AO  +  BO)  *  Cin 

These  equations  are  essentially  a  combination  of  the  traditional 
generate  and  propagate  logic  for  ALU  design. 


Adder  Device  Selection  Considerations 

The  number  of  product  terms  required  for  implementing  a  carry- 
look-ahead  adder  is  enormous.  The  carry-out  function  alone  for 
a  four-bit  carry-look-ahead  adder  requires  over  36  product  terms 
in  the  sum-of-products  form.  For  a  single-level  AND-OR  implem- 
entation the  number  of  product  terms  required  for  the  most 
significant  bit  Y3  is  28.  This  will  not  fit  into  any  of  the  available 
standard  combinatorial  devices.  A  PLD  with  this  many  product 
terms  per  output  will  not  only  be  slow,  but  will  also  allow  very  low 
silicon  utilization. 

Special  circuitry,  known  as  bit-pair  decoding,  has  been  provided 
on  the  PAL1 6X4 to  allow  implementation  of  such  functions  with  up 
tosixproductterms.  Designed primarilyforarithmeticoperations, 
the  PAL1 6X4  device  has  an  architecture  which  implements  three 
different  stages  of  logic.  The  first  is  the  bit-pair  decoding  between 
an  input  and  a  feedback.  The  second  is  the  standard  AND-OR 
array  logic.  The  third  is  the  XOR  gate  provided  on  selected 
outputs. 

Although  adders  are  strictly  a  combinatorial  function,  the 
PAL16X4  provides  four  registered  outputs.  Not  only  are  these 
registers  used  for  storing  the  result,  but  they  are  also  used 
temporarily  to  hold  one  of  the  two  operands  being  added.  The 
other  operand  is  provided  directly  at  the  inputs.  To  keep  the 
discussion  simple  we  will  not  discuss  how  the  first  operand  is 
loaded  into  the  registers.  This  load  function  can  be  implemented 
by  using  an  extra  product  term  on  each  output. 

The  logic  trick  lies  in  the  bit-pair  decoding  function.  All  of  the  bits 
of  the  first  operand  in  the  registers  (A)  and  the  second  operand  at 
the  inputs  (B)  are  bit-pair  decoded.  As  illustrated  in  Figure  24,  the 
results  of  this  bit-pair  decoding  are  A  +  B,  A  +  /B,  /A  +  B,  and 
/A  +  /B.  These  outputs  are  then  fed  to  the  AND  array  as  inputs. 


Figure  24.  Bit-Pair-Decoding  Function 
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Sixteen  AND  combinations  of  these  tour  inputs  can  then  be 
formed  on  every  product  term  of  the  AND-OR  array.  These  are 
shown  in  Figure  25,  and  include  the  standard  true  and  comple- 
ments of  both  the  bits  as  well  as  XOR,  XNOR  and  various  other 
combinations.  This  bit  pair  decoding  essentially  provides  an  extra 
two-level  AND-OR  logic  level  before  the  AN  D-OR  array.  The  cost 
as  well  as  extra  propagation  delay  of  the  extra  logic  level  is 
minimal,  since  the  array  size  does  not  increase. 

The  equationsfor  the  adder  can  obviously  benefit  from  multi-level 
logic.  The  bit-pair  decoding  is  used  to  implement  the  first  two 


levels  of  logic.  The  next  level  of  logic  is  implemented  in  the 
standard  AND-OR  array.  Every  product  term  of  this  AND-OR 
array  can  combine  one  of  the  sixteen  possible  functions  of 
different  inputs/feedbacks  of  the  device. 

The  product  terms  are  then  combined  together  through  an  OR 
gate  to  implement  the  CARRY-OUTfunction,  shown  in  Figure  26. 
For  adder  outputs  YO,  Y1,  Y2,  and  Y3,  the  product  terms  are 
combined  through  the  XOR  gate,  as  shown  in  Figure  27.  This 
XOR  gate  on  the  outputs  is  the  final  logic  level  on  the  device. 


-o 

-D- 
-O 
-O 
-O 
-D- 

o- 

-O 


PRODUCT  TERMS 


-o 

-D- 
-O 
-D- 
-O 
O- 
-D- 
-O 


A  +  B 


A  +E 


A:':B 
A'B 
A  ♦  B 

A:  +  :B 


A  *  B 


Figure  25.  Sixteen  Possible  Input  Logic  Combinations 
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A3*B3 
(A3  +  B3)  *A2*  B2 
(A3  +  B3)'(A2  +  B2)'A1'B1 
(A3  +  B3)  •  (A2  +  B2)*  (A,  +B0  -Ao'Bo 
(A3  +  B3)  •  (A2  +  B2)  *  (A,  +  B, )  1  (A0 *  B0 )  *  Cin 


A3 


A, 


Ac 


C',„ 


ts3 

B2 
I 

til 

BIT  PAIR 
DECODING 

BIT  PAIR 
DECODING 

BIT  PAIR 
DECODING 

BIT  PAIR 
DECODING 

7 

> 


p>    o    n    M     cjrgojevi  *-t-t-    *-  0000 

micocoicomimcaicD  mlmmlm  mimmm 

+   +  +   ++  +  +  +  +  +  ++  ++  +  + 

COCT«C10JC\J<MC\l  OOOO 

<   <l<|<    <<l<|<  <<KI<  <  <|<l< 


CARRY-OUT  =    A3  *  B3 

(A3+  B3)  •  A2  *B2 

(A3+B3) '(Aj+BaJ'A^B,  40526 

(As+BaJ^Az+BzJMAi+B,)*  Ao'Bo 

(A3  +  B3)  •  (A2+  B2) '  (A,  +  B,) '  (A0 '  Bo)  *C in 

Figure  26.  Implementation  of  CARRY-OUT  Function 


A3 :+:  B3 


(A2-  B2) 
(A2+B2)-(A,'B,) 
(A2  +  B2)  *(Ai+B,)*  (Ao'Bo) 
(A2+B2)'(A1  +  B1)'(Ao'B0)'Cin 


eoncoco  evj    <N    cvj    t\i  •—--.-»-  OOOO 

On  i  CD    03  |CQ  CD  |C0  CO  |C0  CO|CD   CD|CO  CO|CQ  CD  |  CD 

+    ++    +  +    +    +    +  ++     +    +  ++    +  + 

enoeio  cvj   cm    c\j   t\i  t-  t-   t-  oooo  .E.E 

<<KI<  <<|<|<  <<KI<  <<KI<  OIO 


Y3  =  A3:+:B3 

:+:«A2*Bz) 
+(A2+  B2)  *  (A1 "  B1) 
+(A2  +  B2)'(A1  +  B,),(Ao*B0) 
+(A2+B2)'(A1  +  Bl)*(A0-Bo)*Cin) 

Figure  27.  Relationship  Between  Adder  Boolean  Equation  and  PAL16X4  Device  Logic 
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Thus  the  most  significant  bit  of  the  output  can  be  implemented 
using  only  five  product  terms;  the  carry-out  function  can  be 
implemented  using  only  six  product  terms.  All  of  this  is  possible 
because  he  PAL1 6X4  offers  a  modified  programmable  array  with 
hardware  dedicated  to  bit-pair  decoding  at  the  inputs  and  an  XOR 
gate  at  the  outputs. 

PAL16X4 
PNEW 
ADDER 

HMI   SUNNYVALE,  CALIFORNIA 


Figure  28  illustrates  an  example  of  one  such  adder,  which  also 
includes  the  logic  required  for  loading  the  registers  with  one  of  the 
operands. 


PAL  DESIGN  SPECIFICATION 
J.   ENGINEER  9/15/87 


CLK  LOAD  ADD  BO  Bl  B2  B3 
CIN  NC       NC     A3  A2  Al  AO 


; DESCRIPTION 


NC  NC  GND 
G     P  VCC 


; THE  DEVICE  FUNCTIONS  AS  AN  ADDER.    IT  FIRST  STORES  ONE  OPERAND  IN  THE  FOUR- 
;BIT  OUTPUT  REGISTER  UNDER  THE  CONTROL  OF  LOAD  SIGNAL.   USING  FEEDBACK  AND  A 
; SECOND  OPERAND  AT  THE  INPUTS,    IT  ADDS  THE  TWO  UNDER  CONTROL  OF  SIGNAL  ADD. 
; THE  RESULTS  ARE  STORED  BACK   IN  THE  OUTPUT  REGISTERS.   THE  DEVICE  ALSO 
; GENERATES  THE  PROPAGATE  AND  GENERATE  SIGNALS  WHICH  ALLOW  A  NUMBER  OF 
; SIMILAR  DEVICES  TO  BE  CASCADED. 


IF(VCC) 


P  =  (A3  +  B3) 

*  (A2   +  B2) 

*  (Al   +  Bl) 

*  (AO   +  BO) 


IF(VCC)  G 


( A3*B3 ) 


PROPAGATE  SIGNAL 


GENERATE  SIGNAL 


(A3 
(A3 
(A3 


B3) 
B3) 
B3) 


(A2*B2) 
(A2  +  B2) 
(A2  +  B2) 


(A1*B1 ) 

(Al   *   Bl)    *  (A0*B0) 


CARRY 0 . EQU     ADD  *  CIN 


CARRY  1 .  EQU     ADD  *  (A0*B0) 

+  ADD  *    (AO   +  BO)    *  CIN 


CARRY 2. EQU  ADD 
+  ADD 
+  ADD 


( A1*B1 ) 
(Al  +  Bl) 
(Al  +  Bl) 


CARRY 3 . EQU     ADD  *  (A2*B2) 

+  ADD  *  (A2  +  B2) 

+  ADD  *  (A2   +  B2) 

+  ADD  *  (A2  +  B2) 


( A0*BO) 
(AO  +  BO) 


( A1*B1 ) 
(Al   +  Bl) 
(Al   +  Bl) 


CIN 


(A0*B0) 
(AO  +  BO) 


CIN 


/A3 


/A2 


/Al 


/AO 


(/A3)*/LOAD  *  /ADD 
(/B3)*  LOAD 
(A3:*:B3)*AD 
:  CARRY3 


:=   (/A2)*/LOAD  * 
+   (/B2)*  LOAD 
+  (A2:*:B2)*ADD 
CARRY 2 


/ADD 


:=    (/Al)*/LOAD  *  /ADD 
+   (/Bl)*  LOAD 
+  (A1:*:B1)*ADD 

:  +  :  CARRY1 


(/A0)*/LOAD  * 
(/BO)*  LOAD 
(A0:*:B0)*ADD 
:  CARRYO 


/ADD 


HOLD  REG  A3 
LOAD  REG  B3 
ADD  A3  AND  B3 


; HOLD  REG  A 2 
; LOAD  REG  B2 
; ADD  A2  AND  B2 


; HOLD  REG  Al 
; LOAD  REG  Bl 
; ADD  Al  AND  Bl 


; HOLD  REG  AO 
; LOAD  REG  BO 
; ADD  AO  AND  BO 


FUNCTION  TABLE 

;   NOT   INCLUDED  HERE 


Figure  28.  A  Four-Bit  Adder  Implemented  in  a  PAL16X4 
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Latches 

PAL  devices  are  often  used  to  implement  latches.  One  of  the 
most  common  uses  for  latches  is  as  temporary  storage  for  data 
or  addresses.  PLD-based  latches  are  often  used  in  address 
decoders  to  assert  the  decoded  signal  for  long  durations.  These 
latches  are  also  very  useful  for  asynchronous  digital  designs,  and 
are  used  often  for  control  and  arbitration  functions. 

A  latch  is  essentially  a  simple  combinatorial  circuit  in  which  the 
output  is  a  function  of  inputs  and  feedback.  The  most  commonly 
used  latch  is  the  D-type  latch.  When  the  control  signal  latch- 
enable  (LEN)  is  HIGH,  the  latch  is  in  the  "transparent  mode"  and 
the  input  signal  /D  is  available  at  the  outputs.  When  the  LEN 
signal  is  LOW,  the  input  data  is  latched  on  the  outputs  and  is 
retained  until  LEN  goes  back  HIGH.  In  atypical  address  decoder, 
the  input  will  be  a  combination  of  various  address  signals, 
decoded  as  explained  earlier  for  range  comparators.  The  latch- 
ing signal  in  most  microprocessors  is  called  AS  (address  strobe) 
or  ALE  (address  latch  enable). 

The  truth  table  for  a  latch  can  be  derived  directly  from  this 
functional  description,  and  is  shown  in  Figure  29. 


INPUTS 

OUTPUTS 

ID 

LEN 

IQ 

0 

1 

0 

1 

1 

1 

X 

0 

/Q  (previous) 

are  prime  candidates  for  such  hazards.  As  the  input  changes,  the 
two  outputs  will  not  respond  simultaneously  and  change  state  at 
the  same  instant.  Although  this  will  not  change  the  steady-state 
output  of  the  circuit,  it  may  cause  a  spurious  pulse  or  a  "glitch". 
Such  hazards  are  even  more  dangerous  in  latches,  where  the 
glitch  can  cause  incorrect  data  to  be  latched. 

There  are  two  types  of  hazards,  static  and  dynamic.  Static 
hazards  occur  when  the  steady-state  output  of  combinatorial 
logic  is  not  supposed  to  change  due  to  an  input  transition,  but  a 
momentary  change  does  occur.  Such  a  glitch  can  be  further 
classified  as  a  static  1  or  a  static  0  hazard  as  shown  in  Figure  31 . 


STATIC  0  HAZARD  STATIC  1  HAZARD 


405  31 

Figure  31.  Static  Hazards 

Dynamic  hazards  involve  situations  where  the  the  steady-state 
output  is  supposed  to  change  due  to  an  input  transition.  The 
hazard  occurs  when  the  transient  output  changes  several  times 
before  settling.  Figure  32  shows  dynamic  hazards. 

_TLT 

405  32 

Figure  32.  Dynamic  Hazards 


Figure  29.  Truth  Table  of  a  Simple  Latch 

The  Boolean  equations  for  this  latch  can  be  directly  derived  from 
the  truth  table: 

/Q  =     /D  *  LEN 
+  /Q  *  /LEN 

The  logic  implementation  for  this  latch  is  shown  in  Figure  30. 


A  Karnaugh  map  is  a  very  good  way  of  detecting  hazard  condi- 
tions. When  trying  to  detect  a  static  0  or  static  1  hazard,  only  the 
mapping  of  the  zeros  and  the  ones,  respectively,  are  required. 
For  example,  the  latch  equations  in  Figure  30  can  be  mapped  to 
a  Karnaugh  map  shown  in  Figure  33.  The  relationship  between 
the  Karnaugh  mapping  and  the  Boolean  equation  product  terms 
is  also  illustrated. 


D- 
LEN  - 


5> 


Figure  30.  A  Transparent  Latch 


Hazards 


0 


Q.  D'LEN— Q 
+  Q-LEN  —Q 


1 


l_  STATIC  1 
HAZARD. 


COVERING 
"HAZARD 


Even  when  a  combinatorial  circuit  has  been  designed  correctly, 
it  may  still  have  erroneous  outputs  due  to  "hazards".  Hazards 
exist  because  physical  circuits  do  not  behave  ideally.  Combina- 
torial complementary  output  functions  based  on  the  same  inputs 


Figure  33.  Karnaugh  Map  for  Transparent  Latch  Design 
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The  possibility  of  a  hazard  exists  when  the  signal_LEN  changes. 
Initially,  when  D  and  LEN  are  HIGH,  the  output  Q  is  also  HIGH. 
When  LEN  switches  to  LOW,  it  is  possible  for  the  output  to  go 
LOW  momentarily.  This  is  because  when  LEN  goes  LOW  the  first 
product  term  is  disabled,  and  to  maintain  the  output  HIGH  the 
second  product  term  should  be  enabled  exactly  at  the  same 
instant.  Due  to  the  uneven  gate  delays  or  routing  conditions  on 
the  board,  these  two  events  will  not  take  place  simultaneously. 
This  is  a  static  1  hazard.  It  can  also  be  identified  directly  in  the 
Karnaugh  map  by  the  two  adjacent  but  disjoint  sets  of  ones, 
grouped  together  to  form  a  product  term  each. 

The  hazard  conditions  can  be  easily  avoided  in  the  PLDs  by 
providing  an  extra  cover  product  term.  This  product  term  is  shown 
with  a  dotted  line  in  the  Karnaugh  map.  This  third  product  term 
will  keep  the  output  asserted  during  the  transition  of  the  LEN 
signal,  when  the  control  changes  from  the  first  product  term  to  the 
second.  The  modified  Boolean  equation  is  shown  above. 


/Q  -  /D  *  LEN 
+  /Q  *  /LEN 

+  /D  *  /Q     (Cover  product  term) 

Devices  on  which  latches  are  implemented  need  to  provide 
output  feedback.  All  devices  with  I/O  pins  provide  this  necessary 
feedback.  The  only  other  consideration  for  selecting  a  device 
would  be  the  provision  of  sufficient  number  of  product  terms  for 
addressing  the  needs  of  glitch-free  and  testable  design. 

Acomplete  latch  design  implemented  in  a  PAL20L10  is  shown  in 
Figure  34.  Although  the  design  is  now  glitch-free,  the  cover 
product  term  introduces  further  complications  in  the  testability  of 
this  circuit.  This  design  does  not  have  the  additional  circuitry  to 
address  the  testability  requirements.  Details  on  designing  for 
testability  are  discussed  on  page  3-1  OS. 


TITLE  CLEAN  OCTAL  LATCH 

PATTERN  P7096 

REVISION  A 

AUTHOR  VINCENT  COLI 

COMPANY  MONOLITHIC  MEMORIES,  INC. 

DATE  03/10/83 

CHIP  OCT_LAT  PAL20L10 

G  NC  DO  Dl  D2  D3  D4  D5  D6  D7  NC  GND 
/OC  NC  Q7  Q6  Q5  Q4  Q3  Q2  Ql  QO  NC  VCC 

; DESCRIPTION 

;THIS  PAL16L8   IMPLEMENTS  AN  8-BIT  LATCH  FUNCTION  WITH  THREE-STATE  OUTPUTS. 
; THE  LATCH  PASSES  EIGHT  BITS  OF  DATA   (D7-D0)   TO  THE  EIGHT  OUTPUTS  (Q7-Q0) 
; WHEN  LATCH  ENABLE  IS  TRUE   (G=HIGH )  .     THE  DATA  IS  LATCHED  WHEN  LATCH  ENABLE 
;IS  FALSE   (G=LOW).     THE  OUTPUTS  WILL  BE  DISABLED   (HI-Z)   WHEN  OUTPUT  ENABLE 
;IS  TRUE   (/OC=TRUE)   REGARDLESS  OF  ANY  OTHER  INPUTS . 


/OC 

G 

D7-D0 

Q7-Q0 

COMMENTS 

H 

X 

X 

Z 

HI-Z 

L 

L 

X 

0 

LATCH  OUTPUT 

L 

H 

D 

D 

LOAD  LATCH 

;THIS  DESIGN  SHOWS  HOW  TO  IMPLEMENT  A  "CLEAN"  LATCH  SINCE  THERE  ARE  NO 
.•OUTPUT  GLITCHES  AS  THE  DEVICE  CHANGES  STATE. 


Figure  34.  Clean  Octal  Latch  Design 


^MonomhlaMMemorleS^ 
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EQUATIONS 

/QO  = 


G*/D0 
/G*/Q0 
+  /D0*/Q0 

QO.TRST  =  OC 


/Ql 


G*/D1 
+  /G*/Q1 
+  /D1*/Q1 


Ql.TRST  =  OC 

/Q2  =  G*/D2 
+  /G«/Q2 
+  /D2*/Q2 

Q2.TRST  =  OC 

/Q3  =  G*/D3 
+  /G*/Q3 

+  /D3*/Q3 
Q3.TRST  =  OC 


/Q4 


G*/D4 
+  /G*/Q4 
+  /D4*/Q4 


Q4.TRST  =  OC 


/Q5 


G*/D5 
+  /G*/QS 
+  /D5*/Q5 


Q5.TRST  =  OC 

/Q6  -  G*/D6 
+  /G*/Q6 
+  /D6*/Q6 

Q6.TRST  =  OC 

/Q7  =  G*/D7 
+  /G*/Q7 
+  /D7*/Q7 

Q7.TRST  =  OC 


; LOAD  LATCH   (QO ) 

; LATCH  OUTPUT 

,-COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

;LOAD  LATCH  (Ql) 

; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

; LOAD  LATCH  (Q2) 

; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

.■OUTPUT  ENABLE 

; LOAD  LATCH  (Q3) 
; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

; LOAD  LATCH  (Q4) 

; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

; LOAD  LATCH  (Q5) 

; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

; LOAD  LATCH  (Q6) 

; LATCH  OUTPUT 

,-COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 

; LOAD  LATCH  (Q7) 

; LATCH  OUTPUT 

; COVER  ALWAYS  HIGH  HAZARD 

; OUTPUT  ENABLE 


SIMULATION  NOT  INCLUDED  HERE 


Figure  34.  Clean  Octal  Latch  Design  (Cont'd.) 
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In  the  previous  section  we  discussed  combinatorial  designs, 
circuits  whose  outputs  are  totally  independent  of  any  system 
clock.  In  this  section  we  will  discuss  sequential  circuits,  where 
outputs  store  their  previous  values  until  a  new  clock  is  applied. 
The  storage  elements  which  retain  the  previous  output  values  are 


called  flip-flops.  A  bank  of  these  flip-flops  forms  a  register, 
although  individual  flip-flops  are  often  called  registers.  Monolithic 
Memories  and  Advanced  Micro  Devices  offer  a  number  of  PLDs 
which  provide  registered  outputs.  The  table  in  Figure  1  lists  all  of 
the  PLDs  which  provide  registered  outputs. 


DE\ 

ICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

16RI 

I 

Q 
O 

O 

o 

o 

DO.D,  Of  ,  eLO,   I  O 

16R( 

20 

8 

6 

2 

8 

55.5,  37,  25,  16 

16R4 

20 

8 

4 

4 

8 

55.5,  37,  25,  16 

16RP8 

on 

D 

o 

g 

a 

oo  o 

C.C..C. 

16R 

»6 

on 

o 
o 

D 

o 
a 

Q 

o 

oo  o 

16R 

M 

on 

o 
o 

A 

4 

4 

Q 
O 

OO  O 

16RA8 

Q 

o 

n  q 
U-o 

d  n 
o-u 

4 

on 

16X- 

I 

20 

8 

4 

4 

8 

14 

23SI 

t 

20 

9 

4 

4 

8-12 

33.3,  28.5 

20R 

3 

24 

12 

8 

. 

8 

37,25,  16 

20R 

24 

12 

6 

2 

8 

37,  25,  16 

20R 

I 

24 

12 

4 

4 

8 

37,25,  16 

20X 

0 

24 

10 

10 

4 

22.2 

20X( 

24 

10 

8 

2 

4 

22.2 

20* 

; 

24 

10 

4 

6 

4 

22.2 

20RP10 

24 

10 

10 

8 

37,  25 

20RP8 

24 

10 

8 

2 

8 

37,  25 

20RP6 

24 

10 

6 

4 

8 

37,  25 

20RP4 

24 

10 

4 

6 

8 

37,  25 

20XRP10 

24 

10 

10 

8 

30,22.2,  14 

20XRP8 

24 

10 

8 

2 

8 

30,  22.2,  14 

20X1 

24 

10 

6 

4 

8 

30,22.2,  14 

20X 

3P4 

24 

10 

4 

6 

8 

30,  22.2,  14 

20RS10 

24 

10 

10 

8-16 

19.2 

20RS8 

24 

10 

8 

2 

8-16 

19.2 

20RS4 

24 

10 

4 

6 

8-16 

19.2 

20RA10 

24 

10 

0-10 

10-0 

4 

33,  20 

22RX8 

24 

14 

0-8 

8-0 

8 

28.5 

22V10 

24 

12 

0-10 

10-0 

8-16 

40,  28.5,  18 

32VX10 

24 

12 

0-10 

10-0 

8-16 

25,  22.2 

32R16 

40 

16 

16 

8-16 

16 

PLS167 

24 

14 

6 

Total  48 

33 

PLS168 

24 

12 

8 

Total  48 

33 

PLS 

05 

28 

16 

8 

Total  48 

37 

Figure  1.  Registered  PLDs 
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DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

^MAX 

CMOS 

C16R8 

20 

8 

8 

8 

28.5 

C16R6 

20 

8 

6 

2 

8 

28.5 

C16R4 

20 

8 

4 

4 

8 

28.5 

C20R8 

24 

12 

8 

8 

20,  15.3 

C20R6 

24 

12 

6 

2 

8 

20,  15.3 

C20R4 

24 

12 

4 

4 

8 

20,  15.3 

C22V10 

24 

12 

0-10 

10-0 

8-16 

33.3,  20 

C29M16 

24 

5 

16 

8-16 

20,  15 

C29MA16 

5 



16 





20,  15 

ECL 

10H20EV/EG8 

24 

12 

0-8 

8-0 

8-12 

125 

10020EV/EG8 

24 

12 

0-8 

8-0 

8-12 

125 

Figure  1.  Registered  PLDs  (Cont'd.) 


Before  we  discuss  purely  registered  designs,  let  us  take  a  look  at 
designs  which  combine  both  registered  and  combinatorial  por- 
tions. Registered  and  combinatorial  outputs  are  often  mixed  on 
a  single  device.  There  can  be  two  distinct  designs,  one  registered 
and  one  combinatorial  (often  glue  logic)  combined  on  a  single 
device  for  higher  integration.  There  may  also  be  a  design 
requirement  where  registered  outputs  need  to  be  decoded  using 
combinatorial  logic. 

There  are  a  number  of  devices  which  provide  both  registered  and 
combinatorial  outputs.  Some  devices  provide  programmable 


register  bypass,  which  allows  outputs  to  be  programmed  as 
registered  or  combinatorial.  Figure  2  shows  a  list  of  PLDs  with 
both  registered  and  combinatorial  outputs.  Figure  3  shows  a  list 
of  PLDs  with  programmable  register  bypass. 

In  most  design  software  packages  the  output  registers  are 
signified  by  the  ":="  assignment  symbol,  as  opposed  to  the  "=" 
sign  for  acombinatorial  output.  This  helps  easily  identify  registers 
in  each  equation.  In  devices  which  provide  outputs  configurable 
as  either  registered  or  combinatorial,  this  sign  is  also  used  by  the 
software  to  configure  the  outputs. 
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DEVICE 


TOTAL 
NUMBER 
OF  PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 
REGISTERED 
OUTPUT 
PINS 


I/O 
PINS 


LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 


SPEED  GRADES 
FREQUENCY  IN 
MHz 


TTL 


16R6 

16R4 

16RP6 

16RP4 

16RA8 

16X4 

23S8 


20 
20 
20 
20 
20 
20 
20 


6 
4 
6 
4 

0-8 
4 
4 


2 
4 
2 
4 

8-0 
4 
4 


8 
8 
8 
8 
4 
8 
8-12 


55.5,  37,  25,  16 
55.5,  37,  25,  16 
22.2 
22.2 
20 
14 

33.3,  28.5 


20R6 
20R4 
20X8 
20X4 


24 
24 
24 
24 


12 
12 
10 
10 


37,25,  16 
37,  25,  16 

22.2 

22.2 


20RP8 

20RP6 

20RP4 

20XRP8 

20XRP6 

20XRP4 


24 
24 
24 
24 
24 
24 


10 
10 
10 
10 
10 
10 


37,  25 
37,25 
37,  25 
30,  22.2,  14 
30,  22.2,  14 
30,  22.2,  14 


20RS8 
20RS4 


24 
24 


10 
10 


8-16 
8-16 


19.2 
19.2 


20RA10 
22RX8 
22V10 
32VX1 0 
32R16 


24 
24 
24 
24 
40 


10 
14 
12 
12 
16 


0-10 
0-8 
0-10 
0-10 
16 


10-0 
8-0 
10-0 
10-0 


4 

8 
8-16 
8-16 
8-16 


33,  20 
28.5 
40,  28.5,  1£ 
25,  22.2 
16 


CMOS 


C16R6 
C16R4 
C20R6 
C20R4 


C22V10 


20 
20 
24 
24 


24 


8 
8 
12 
12 


12 


0-10 


10-0 


8-16 


28.5 
28.5 
20,  15.3 
20,  15.3 


33.3,  20 


C29M16 
C29MA16 


24 
24 


16 
16 


8-16 
4-12 


20,  15 
20,  15 


ECL 


Latched 
10H20G8 
10H/020EG8 
10H/020EV8 


24 
24 
24 


12 
12 
12 


0-8 
0-8 
0-8 


8-0 
8-0 
8-0 


4-8 
8-12 
8-12 


N.A.  (latch) 
N.A.  (latch) 
125 


Figure  2.  PLDs  with  Both  Registered  and  Combinatorial  Outputs 


RBI 
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DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

^MAX 

TTL 

16RA8 

20 

8 

0-8 

8-0 

4 

20 

23S8 

20 

9 

4 

4 

8-12 

33.3,  28.5 

20RA10 

24 

10 

0-10 

10-0 

4 

33,  20 

22RX8 

24 

14 

0-8 

8-0 

8 

28.5 

22V10 

24 

12 

0-10 

10-0 

8-16 

40,  28.5,  18 

32VX10 

24 

12 

0-10 

10-0 

8-16 

25,  22.2 

32R16 

40 

16 

16 

8-16 

16 

CMOS 

C22V10 

24 

12 

0-10 

10-0 

8-16 

33.3,  20 

C29M16 

24 

5 

16 

8-16 

20,  15 

C29MA16 

24 

5 

16 

4-12 

20,  15 

ECL 

Latched 

10H20G8 

24 

12 

0-8 

8-0 

4-8 

N.A.  (latch) 

10H/020EG8 

24 

12 

0-8 

8-0 

8-12 

N.A.  (latch) 

10H/020EV8 

24 

12 

0-8 

8-0 

8-12 
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Figure  3.  PLDs  with  Programmable  Register  Bypass 


General  Device  Selection 
Considerations 

The  same  set  of  general  device  selection  considerations  dis- 
cussed in  the  PLD  design  methodology  section  (page  2-21 )  apply 
to  registered  designs.  The  list  of  items  which  must  be  considered 
is  repeated  in  Figure  4  for  convenience.  A  device  can  be 
conveniently  selected  from  the  table  of  registered  PLDs 
(Figure  1 )  based  upon  the  specific  input  and  output  requirements. 

1.  Number  of  input  pins 

2.  Number  of  output  pins 

3.  Number  of  I/O  pins 

4.  Device  speed 

5.  Device  power  requirements 

6.  Number  of  registers 

7.  Number  of  product  terms 

8.  Output  polarity  control 

Figure  4.  General  Device  Selection  Considerations 
Maximum  Frequency 

For  registered  designs,  speed  is  a  parameter  which  needs  careful 
consideration.  Most  combinatorial  designs  use  the  propagation 
delay  (tpp)  for  ensuring  that  enough  time  is  allowed  for  the  data 
from  the  inputs  to  appear  at  the  outputs.  In  registered  designs  the 
effects  of  the  clock  must  be  taken  into  account.  This  is  reflected 


in  the  maximum  frequency  (fu<x)  parameter.  The  flexibility  inher- 
ent in  PLD  design  provides  a  choice  of  configurations  from  which 
different  fUAJ<  parameters  can  be  calculated. 

In  the  first  type  of  design,  the  PLD  is  used  for  a  stand-alone 
registered  design.  In  order  to  decide  the  next  logic  level  of  the 
registers,  the  present  logic  level  needs  to  be  available  at  the 
inputs  of  the  registers  before  they  are  clocked  (Figure  5).  Under 
these  conditions  the  clock  period  is  limited  by  the  internal  delay 
from  the  flip-flop  outputs  through  the  internal  feedback  and  logic 
to  the  flip-flops  inputs  (tsu+tCF).  For  this  configuration: 

f^*"1/'*1,,,  +  'cr*  :  designated  the  internal  f^. 

CLK 


LOGIC 

V 

REGISTER 

A 

 S 

-<su  ► 

406  01 


'max    INTERNAL:  1/(tM+  tCF) 

Figure  5.  Internal  fMAX 


2-64 


Registered  Logic  Design 


The  second  type  of  system  configuration  is  when  a  number  of 
logic  devices  with  registers,  including  PLDs,  are  clocked  with  a 
common  clock.  This  is  probably  the  most  prevalent  configuration. 
In  this  case  the  registered  outputs  are  sent  off-chip  back  to  the 
device  inputs  or  to  the  inputs  of  a  second  device.  The  slowest 
path  required  to  set  up  an  input  (Figure  6)  is  the  sum  of  the  clock- 
to-output  delay  and  the  input  setup  time  for  the  external  signals 
(tsu  +  tCLK).  The  same  speed  grade  devices  are  typically  clocked 


together, 


which  means  that  the  tJU  and  tCL(<  of  all  the  devices  will 


usually  be  the  same. 
Thus  for  this  configuration: 
f^  "  V<t„  +  tW 

This  fMAX  is  the  maximum  frequency  when  external  feedback  is 
used  or  when  several  equivalent-speed  devices  are  used.  This 
fMAX  is  designated  the  external  fMAX.  The  tables  for  registered 
devices  show  this  f  parameter. 


input 


The  third 
case, 
feedback 
by  the 
However, 
limit. 

the  lesser 


type  of  design  is  a  simple  data  path  application.  In  this 
data  is  presented  to  the  flip-flop  and  clocked;  no 
is  employed  (Figure  7).  In  this  case,  the  period  is  limited 
sum  of  data  setup  time  and  data  hold  time  (tsu+th). 
i  the  minimum  clock  period  (tWH  +  tWL)  is  usually  a  stricter 
ird  'max'  designated  "fMAX,  no  feedback  will  be 

of: 


Thus 


1/  (t, 


Flip-flop  Types 

There  are  four  basic  types  of  flip-flops;  S-R,  J-K,  Tand  the  popular 
D-type.  Details  on  the  various  types  of  flip-flops  are  provided  in 
the  logic  reference  (page  6-8).  In  this  section  we  will  assume  that 
the  reader  is  familiar  with  these  flip-flops. 

Almost  all  registered  PLDs  provide  the  basic  D-type  flip-flops. 
D-type  flip-flops  are  the  simplest  to  design  with  and  will  be  used 
throughout  this  section.  Some  PLDs  provide  the  capability  of 
configuring  output  registers  as  either  D,  T,  J-K  or  S-R.  Configur- 
able flip-flops  in  some  cases  can  reduce  the  number  of  product 
terms  required  for  certain  designs.  The  effect  of  the  configurable 
flip-flops  will  be  discussed  wherever  relevant. 

Synchronous  vs  Asynchronous 

Registered  designs  can  be  easily  classified  into  two  categories; 
synchronous  and  asynchronous.  In  synchronous  designs  the 
clock  inputs  of  all  the  registers  are  tied  together  to  a  common 
clock.  This  is  the  most  commonly  used  type  of  registered  design. 
With  asynchronous  designs,  theflip-flops'clock  inputs  may  not  be 
tied  together,  and  the  clocks  may  be  gated  or  even  driven  by  other 
flip-flops.  We  will  first  discuss  synchronous  registered  designs 
and  then  asynchronous  registered  designs. 


J>  (SECOND  CHIP) 


->CLK  - 

'  MAX  EXTERNAL:  1(1^+  tCLK) 

Figure  6.  External  fMAX 

CLK 


f  MAX  NO  FEEDBACK:  1/(t5u+  t„)  OR  1/(twh  ♦  !«,) 

Figure  7.       with  No  Feedback 
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Synchronous  Registered  Designs 


Counters 


Synchronous  registered  designs  are  used  for  two  major  func- 
tions: data  handling  and  control.  Registered  synchronous  de- 
signsfordata  handling  include  counters  and  shift  registers.  There 
are  various  types  of  counters,  among  them  binary  counters, 
modulo  counters,  Johnson  counters,  and  Gray-code  counters. 
These  counters  are  differentiated  by  the  sequence  of  values 
through  which  the  counter  travels.  A  binary  counter  is  the 
simplest  form  of  a  counter,  and  is  used  most  often  for  data 
functions.  Any  system  requiring  a  regular  count  uses  a  binary 
counter.  Modulo,  Gray-code  and  Johnson  counters  are  also  used 
for  control. 

All  counters  are  actually  subsets  of  a  larger  class  of  digital  designs 
called  state  machines.  State  machines  are  discussed  in  detail  in 
the  next  chapter  of  this  handbook. 


Counters  are  the  most  commonly  used  sequential  circuits.  Aset 
of  registers  which  cycles  through  a  predetermined,  unvarying 
sequence  is  called  a  counter.  A  general  model  of  a  synchronous 
counter  is  illustrated  in  Figure  8.  This  shows  a  common  clock  to 
all  the  flip-flops,  whose  outputs  are  fed  back  to  a  combinatorial 
logic  array  called  the  next-state  (count)  decoder.  The  next  count 
is  generated  by  this  logic  based  upon  the  present  count  and 
control  inputs.  Most  PLDs  use  the  standard  sum-of-products 
form  of  array  for  this  logic. 

The  relationship  between  a  four-bit  counter  and  its  signal  timing 
diagram  is  illustrated  in  Figure  9.  The  counters  can  also  be 
represented  by  state  diagrams  (Figure  10).  The  state  diagrams 
are  bubble-and-arrow  diagrams,  with  each  bubble  representing  a 
count  value  and  each  arrow  a  transition  from  one  count  to  the 
next.  More  detail  on  state  diagrams  is  given  in  the  next  chapter 
on  state  machine  design.  For  counters,  the  state  diagrams  are  a 
convenient  representation  tool  and  will  be  used  in  the  discussion 
when  necessary. 


COMBINATORIAL 
LOGIC 


INPUTS 


NEXT 
STATE 
(COUNT) 
DECODER 


FLIP- 
FLOPS 


Figure  8.  General  Model  of  a  Counter 


CLK 


WLjmjiJWLnjijmjijiruijmjij 


-~L 


r 


x2 


~L 


STATE  0  1  2  3  4  5 
(COUNT) 


7      8      9      10     11     12     13     14     15      0      1       2  3 


Figure  9.  Timing  Diagram  of  a  Four-bit  Binary  Counter 
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Figure  10.  State  Diagram  of  a  Four-bit  Binary  Counter 


Binary  Counters 

Let  us  examine  a  four-bit  binary  counter.  The  truth  table  (also 
called  the  transition  table)  for  such  a  counter  is  given  in  Figure  1 1 . 
The  table  lists  the  next  state  values  of  all  the  output  registers 
based  upon  their  present  values. 


PRESENT  STATE 

NEXT  STATE 

X3 

X2 

X1 

xo 

X3 

X2 

X1 

xo 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

1 

0 

1 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

1 

0 

1 

1 

1 

0 

0 

1 

0 

0 

1 

0 

1 

1 

0 

1 

1 

1 

0 

1 

1 

0 

1 

1 

1 

1 

1 

1 

0 

0 

0 

0 

Figure  11.  The  Truth  Table  for  a  Four-bit  Binary  Counter 
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We  derive  Boolean  equations  for  each  bit  directly  from  the  above 
truth  table  by  collecting  all  the  product  terms  where  outputs  are 
asserted  HIGH  (ones).  This  yields: 


= 

/X3 

* 

X2 

* 

XI 

xo 

+ 

X3 

* 

/X2 

* 

/XI 

* 

/xo 

+ 

X3 
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/X2 

* 

/XI 

* 

xo 

+ 

X3 
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/X2 

* 

XI 

* 

/xo 

+ 

X3 

* 

/X2 

* 

XI 

* 

xo 

+ 

X3 

* 

X2 

* 

/XI  *  /xo 

X3 

ft 

X2 

*  /XI 

xo 

+ 

X3 

* 

X2 

* 

XI 

/xo 

:  = 
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X2 
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/X2 
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/XI 

* 

xo 
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X3 

* 

/X2 

* 

XI 

* 

/xo 

+ 

X3 

* 

X2 

/XI 

* 

xo 

+ 

X3 

X2 

* 

XI 

* 

/xo 

/X3 

* 

/X2 

* 

/XI 
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/xo 
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/X3 
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* 

XI 
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+ 

X3 

* 

X2 

* 

/XI 

* 

/xo 

+ 

X3 

X2  * 

XI 

* 

/xo 

These  Boolean  equations  are  for  devices  with  active-HIGH 
outputs.  These  equations  can  be  inverted  for  devices  with  active- 
LOW  outputs.  The  Boolean  equations  for  active-LOW  devices 
can  also  be  directly  derived  from  the  truth  table  by  collecting  all  the 
product  terms  where  the  active-LOW  outputs  (zeros)  are 
asserted. 

Manipulating  the  equations  with  Boolean  algebra  we  obtain  the 
Boolean  logic  equations: 

XO  /XO 

XI  :=  XI   :+:  XO 

X2        X2   :+:    (XI  *  XO) 

X3  :=  X3  :+:    (X2  *  XI  *  XO) 

Similarly  for  active-LOW  output  devices 
(since  /(A  :+:  B)  =  /A  :+:  B): 

/xo  :=  xo 

/XI  :=  /XI  :+:  XO 

/X2  :-  /X2  :+:    (XI  *  XO) 

/X3  :-  /X3  :  +  :    (X2  *  XI  *  XO) 


These  equations  could  also  be  obtained  from  the  Boolean  equa- 
tions developed  for  an  adder  in  the  combinatorial  design  section 
on  page  2-54. 

Rewriting  the  equations  for  an  adder: 

XO  -  AO  :+:  BO  :+:  Cin 

XI  =  Al   :+:  Bl   :+:  CO 
; where 

CO  -  AO  *  BO  +(A0  +  BO)   *  Cin 

X2  =  A2  :  +  :  B2  :  +  :  CI 
; where 

CI  =  Al  *  Bl  +(A1  +  B1)*(A0  *  BO) 
+  (A1  +  B1)*(A0  +  BO)    *  Cin 

X3  =  A3  :+:  B3  :+:  C2 
; where 

C2  -  A2  *  B2  +(A2  +  B2)  *  (Al  *  Bl) 
+  (A2  +  B2)*(A1  +  B1)*(A0  *  BO) 
+  (A2  +  B2)*(A1  +  B1)*(A0  +  BO)    *  Cin 

Assuming  one  of  the  operands  in  the  adder  is  the  number  itself 
and  the  second  operand  is  one  (X3-X0  =  A3-A0,  B3-B0  =  0001 
and  Cin  =  0)  we  get  the  following  equations  for  a  counter: 

XO  :=  /XO 

XI   :-  XI  :+:  XO 

X2  :-  X2  :+:    (XI  *  XO) 

X3  :=  X3  :+:    <X2  *  XI  *  XO) 

These  are  of  course  the  same  equations  as  the  ones  derived 
directly  from  the  truth  table.  The  equations  for  a  binary  counter 
are  very  regular.  The  general  equation  for  an  n-bit  binary  counter 
can  be  directly  expressed: 

Xn  :=Xn  :+:  (Xn-1  *  Xn-2  ...  XO) 

For  devices  with  active-LOW  outputs  the  general  Boolean  equa- 
tions can  be  derived  by  inverting  both  sides  of  the  equation: 

/Xn  :=  /Xn  :+:  (Xn-1  *  Xn-2  ...  XO) 

These  equations  represent  a  binary  UP  counter.  Counting 
backwards  for  a  DOWN  counter,  the  Boolean  equations  can  be 
similarly  generated,  either  from  the  truth  table  or  from  the  adder 
Boolean  equations.  The  general  equation  for  a  DOWN 
counter  is: 

Xn  :=  Xn  :+:  (/Xn-1  *  /Xn-2  ...  /XO) 

This  equation  is  for  active-HIGH  outputs.  For  active-LOW  output 
devices  the  Boolean  equation  for  a  DOWN  counter  is: 

/Xn  :=  /Xn  :+:  (/Xn-1  *  /Xn-2  ...  /XO) 

A  four-bit  binary  counter  which  combines  both  UP  and  DOWN 
capability  is  illustrated  in  Figure  12.  This  design  is  implemented 
in  a  registered  PLDand  requires  at  least  six  product  terms  for  the 
most  significant  output  bit.  The  design  logic  has  been  imple- 
mented in  sum-of-products  form.  It  uses  a  control  input  (UP) 
which  selects  between  the  "count  up"  and  "count  down"  product 
terms,  allowing  the  counter  to  count  up  or  down. 
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Further  control  functions  can  be  added  to  these  counter  equa-  another  input  signal  (LOAD).  When  the  LOAD  signal  is  HIGH  the 

tions  directly  either  at  the  truth-table  stage  or  in  the  equations.  For  counter  is  loaded  with  the  input  data,  and  when  the  LOAD  signal 

example,  a  load  data  function  is  required  in  most  counters.  This  is  LOW  the  counting  is  resumed, 
allows  registers  to  be  loaded  with  a  count  under  the  control  of 


Title  4Bit_Counter 

Pattern  4cnt.pds 

Revision  B 

Author  Mehrnaz  Hada,  Bill  Hollon,  Ali  Sebt 

Company  Monolithic  Memories  Inc.  Santa  Clara,  CA 

Date  1/14/85 

CHIP   4BitCoun  PAL16RP4 

CLK  UP  AI   BI   CI  DI  CLP.  LOAD  NC  GND 
/OC  NC  NC  D  C  B  A  NC  NC  VCC 

;The  4-bit  counter  counts  up  or  down  and  has  the  clear  and 
;load  capability.  The  clear  operation  overrides  count  and 
;load.  The  counter  counts  up  when  CLR=low,  LOAD=low,  and 
;UP=high.   It  counts  down  whenever  CLR=low,  LOAD=low,  and 
;UP=low 

EQUATIONS 


/A*/B*/C*/D*/UP*/LOAD*/CLR 
/A*  B*  C*  D*  UP*/LOAD*/CLR 
A*  B*  /UP*/LOAD*/CLR 
A*/B*  D*  /LOAD*/CLR 
A*  /C*  UP*/LOAD*/CLR 
A*  C*/D*  /LOAD*/CLR 
LOAD*/CLR* 


;When  CLR=1,  A=0. 
;Else  it  will  count 
;UP  or  DOWN. 


;New  value  is  loaded 
;when  LOAD=l,  CLR=0. 


/B*/C*/D*/UP*/LOAD*/CLR 
/B*   C*   D*  UP*/LOAD*/CLR 
B*   C*/D*  /LOAD*/CLR 
B*/C*  UP*/LOAD*/CLR 
B*  D*/UP*/LOAD*/CLR 
LOAD*/CLR* 


;When  CLR=1,  B=0. 
;Else  it  will  count. 


;New  value  is  loaded 
;when  L0AD=1,  CLR=0. 


/C*/D*/UP*/LOAD*/CLR 
/C*   D*  UP*/LOAD*/CLR 
C*/D*  UP*/LOAD*/CLR 
C*  D*/UP*/LOAD*/CLR 

LOAD*/CLR*  CI 


;When  CLR=1,  C=0. 
;Else  it  will  count. 


; New  value  is  loaded 
;when  L0AD=1,  CLR=0. 


/D« 


/LOAD*/CLR 
LOAD*/CLR*  DI 


;Count 

;New  value  is  loaded 
;when  LOAD=l,  CLR=0. 


Figure  12.  Design  File  for  a  Four-bit  Counter 
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SIMULATION 

TRACE  ON     AI   BI   CI   DI   LOAD  CLR  UP  A 


SETF  LOAD  /CLR  AI   BI   CI   DI  OC 
CLOCKF  CLK 

SETF  CLR 
CLOCKF  CLK 

SETF  /CLR  UP  /LOAD 

FOR   I  :=   1   TO  16  DO 
BEGIN 

CLOCKF  CLK 
END 

SETF  LOAD  /CLR  /UP  AI   BI   CI  DI 

CLOCKF 

SETF  /LOAD 

FOR   I : =   1  TO  16  DO 
BEGIN 

CLOCKF  CLK 
END 

SETF  LOAD  CLR  AI   /BI   CI  /DI 
CLOCKF  CLK 

SETF  /OC 

TRACE  OFF 


; Load  all  registers 
; to  HIGH  and  count  up 

;Clear  all  registers 


;Start  Counting  up 

;Count  up  16  clock 
; cycles 


;Load  all  registers 
;to  HIGH  and  count 
;  down 

;Count  down  16  clock 
; cycles 


;Test  setting  LOAD 
;and  CLR  on  at  the 
; same  time. 


Figure  12.  Design  File  for  a  Four-bit  Counter  (Cont'd.) 


Binary  Counter  Device  Selection  Considerations 

One  major  device  selection  consideration  is  the  logic  require- 
ment. The  binary  counter  Boolean  equations  make  use  of 
exclusive-OR  functions  in  the  output.  In  most  of  the  registered 
PLDs  shown  in  Figure  1 ,  the  XOR  functions  are  implemented  in 
their  sum-of-products  logic  form.  This  usually  requires  a  large 
number  of  product  terms.  We  have  seen  a  design  example  for  a 
four-bit  UP/DOWN  binary  counter  which  requires  up  to  six  prod- 
uct terms.  Most  standard  PAL  devices  provide  eight  product 
terms  per  output.  However,  for  larger  counters,  a  large  number 
of  product  terms  is  required.  Figure  1 3  shows  a  list  of  PLDs  which 
provide  more  than  eight  product  terms  per  output. 

Some  PLDs  provide  a  dedicated  XOR  gate  on  the  outputs  (Figure 
1 4).  This  allows  an  AND-OR-XOR  implementation  of  the  Boolean 
logic,  and  consequently  requires  fewer  product  terms.  A  nine-bit 
binary  counter  design  is  illustrated  in  Figure  15.  This  design  has 
been  implemented  in  the  PAL20X10,  which  provides  an  XOR 
gate.  This  design  cannot  be  implemented  in  standard  PAL 
devices. 


Cascading  Binary  Counters 

Situations  areoccasionally  encountered  in  digital  system  designs 
where  very  long  counters  are  required.  Few  PLDs  provide 
greater  than  ten  outputs  to  implement  such  counters.  Binary 
counters  can  be  easily  cascaded  into  two  or  more  devices  to 
construct  such  large  counters.  The  design  of  long  counters  is  very 
simple.  These  are  designed  as  simple  binary  counters  with  a 
count  enable  control.  The  less  significant  counters  generate  an 
extra  output  signal  at  the  penultimate  count.  These  signals  are 
ANDed  together  to  form  the  count  enable  signal  for  the  higher- 
order  counter.  For  a  down  counter  the  reverse  scheme  is 
implemented. 

Cascading  counters  is  a  lot  simpler  than  cascading  adders 
because  the  carry-look-ahead  circuitry  is  not  required.  The  only 
thing  to  remember  isthatthe  more  significant  countertoggles  only 
when  the  penultimate  count  of  all  of  the  less  significant  counters 
is  reached. 

Notice  that  the  nine-bit  counter  has  a  built-in  mechanism  for  cas- 
cading. This  count  enable  output  signal  is  represented  by  CO  and 
is  easily  implemented  in  the  PLD.  This  is  just  another  example 
of  how  such  modifications  are  easy  to  handle  in  a  PLD  design, 
unlike  standard  SSI/MSI  designs. 
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DEVI 

CE 

TOTAL 
NUMBER 
OF  PINS 

DEDICATED 
INPUT 
PINS 

DEDICATED 
REGISTERED 
OUTPUT 
PINS 

I/O 
PINS 

LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 

SPEED  GRADES 
FREQUENCY  IN 
MHz 

'max 

TTL 

23S8 

20 

9 

4 

4 

8-12 

33.3,  28.5 

20RS10 

20RS8 

20RS4 

24 
24 
24 

10 
10 
10 

10 
8 
4 

2 
6 

8-16 
8-16 
8-16 

19.2 
19.2 
19.2 

22V10 
32VX10 

24 
24 

12 
12 

0-10 
0-10 

10-0 
10-0 

8-16 
8-16 

40,  28.5.  18 
25,  22.2 

PLS167 
PLS168 

24 
24 

14 

12 

6 
8 

Total  48 
Total  48 

33 
33 

PLS105 

28 

16 

8 

Total  48 

37 

CMOS 

C22V1 0 

24 

12 

0-10 

10-0 

8-16 

33.3,  20 

C29M16 
C29MA1 

« 

24 
24 

5 
5 

16 
16 

8-16 
4-12 

20,  15 
20,  15 

ECL 

10H20E 
10020E' 

V/EG8 
//EG8 

24 
24 

12 
12 

0-8 
0-8 

8-0 
8-0 

8-12 
8-12 

125 
125 

Figure  13.  Registered  PLDs  with  More  Than  Eight  Product  Terms  Per  Output 

DEVI 

TOTAL 
NUMBER 
OF  PINS 

DEDICATED 
INPUT 
PINS 

DEDICATED 
REGISTERED 
OUTPUT 
PINS 

I/O 
PINS 

LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 

SPEED  GRADES 
FREQUENCY  IN 
MHz 

'max 

TTL 

I  OA** 

20 

8 

4 

4 

8 

14 

20X10 

20X8 

20X4 

24 
24 
24 

10 
10 
10 

10 
8 
4 

2 
6 

4 
4 
4 

22.2 
22.2 
22.2 

20XRP1 
20XRP8 
20XRP6 
20XRP4 

24 
24 
24 
24 

10 
10 
10 
10 

10 
8 
6 
4 

2 
4 

6 

8 
8 
8 
8 

30,  22.2,  14 
30,  22.2,  14 
30,  22.2,  14 
30,  22.2,  14 

22RX8 
32VX10 

24 
24 

14 

12 

0-8 
0-10 

8-0 
10-0 

8 
8-16 

28.5 
25,  22.2 

Figure  14.  Registered  PLDs  for  Binary  Counter  Design  with  XOR  Gate  Output 
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Title  9BitCounter 
Pattern  9BitCnt.pds 
Revision  A 

Author       Mehrnaz  Hada 

Company  Monolithic  Memories  Inc.,  Santa  Clara,  CA 
Date  1/28/85 


;The  9-bit  synchronous  counter  has  parallel  load,  increment, 
;and  hold  capabilities.  The  carry  out  pin  (/CO)  shows  how  to 
[implement  a  carry  out  using  a  register  by  anticipating  one 
;count  before  the  terminal  count  if  counting  and  the  terminal 
,-count  if  loading. 

[Operations  Table 


/OC 

CLK 

/LD 

D8-D0 

Q8-Q0 

Operation 

H 

X 

X 

X 

Z 

HI-Z 

L 

L 

X 

X 

0 

Hold 

L 

C 

L 

D 

D 

Load 

L 

c 

H 

X 

Q  PLUS  1 

Increment 

CHIP  9BitCoun 


CLK  DO  Dl  D2  D3  D4  D5  D6  D7 
/OC  /CO  Q8  07  Q6  Q5  Q4  Q3  Q2 


D8  /LD  GND 
Ql     QO  VCC 


EQUATIONS 
/QO 


/Ql 


AJ2 


/Q3 


/Q4 


A3S 


/Q6 


/Ql 


/Q8 


/LDVQO 
LD*/D0 
/LD 

/LD*/Q1 
LD*/D1 
/LD*  QO 

/LD*/Q2 
LD*/D2 
/LD*  QO*  Ql 

/LD*/Q3 
LD*/D3 
/LD*  QO*  Ql*  Q2 

/LD*/Q4 
LD*/D4 
/LD*  QO*  Ql*  Q2*  Q3 

/LD*/Q5 
LD*/D5 
/LD*  QO*  Ql*  Q2*  Q3* 


04 


/LD*/Q6 
LD*/D6 

/LD*  QO*  Ql*  Q2*  Q3*   Q4*  Q5 

/LD*/Q7 
LD*/D7 

/LD*  QO*  Ql*  Q2*  Q3*  Q4*  Q5*  Q6 


/LD*/Q8 
+  LD*/D8 
:  +  :    /LD*  QO*   Ql*   Q2*  Q3* 


CO   :=     /LD*/Q0*  Ql*  Q2*  Q3* 
+       LD*   DO*   Dl*   D2*  D3* 

Figure  15.  Design 


Q4*  Q5*   Q6*  Q7 
Q4*  Q5*   Q6*  Q7*  Q8 
D4*   D5*   D6*   D7*  D8 

File  for  a  Nine-bit  Counter 


;Hold  QO 
;Load  DO  (LSB) 
; Count 

;Hold  Ql 
[Load  Dl 
;Count 

;Hold  Q2 
;Load  D2 
;Count 

[Hold  Q3 
[Load  D3 
i Count 

[Hold  Q4 
,-  Load  D4 
[Count 

,-Hold  QS 
[Load  D5 
[Count 

[Hold  Q6 
[Load  D6 
[Count 

[Hold  Q7 
[Load  D7 
; Count 

[Hold  Q8 
[Load  D8  (MSB) 
[Count 

[Carry  out 

; (Anticipate  Count ) 

[Carry  out 

; (Anticipate  Load) 
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SIMULATION 

TRACE_ON  /OC  /LD  D8   D7  D6  D5  D4   D3  D2  Dl  DO 
/CO  Q8  Q7  Q6  QS  Q4  Q3  Q2  QI  QO 

SETF  OC  LD  /D8  /D7  /D6  /D5  /D4  /D3  /D2  /Dl  /DO 
CLOCKF  CLK 

SETF  /LD 
CLOCKF  CLK 

SETF  LD  /D8  /D7  /D6  /D5  /D4  /D3  /D2  /Dl  DO 
CLOCKF  CLK 

SETF  /LD 
CLOCKF  CLK 

SETF  LD  /D8  /D7   /D6  /D5  /D4  /D3  D2  Dl  DO 
CLOCKF  CLK 

SETF  /LD 
CLOCKF  CLK 

SETF  LD  /D8  /D7  /D6  /D5  /D4  D3  D2  Dl  DO 
CLOCKF  CLK 

SETF  /LD 
CLOCKF  CLK 

SETF  LD  /D8  /D7   /D6  /D5  D4  D3  D2   Dl  DO 
CLOCKF  CLK 

SETF  /LD 
CLOCKF  CLK 

SETF  LD  /D8     D7     D6     D5  D4   D3  D2  Dl  DO 
CLOCKF  CLK 


;  Load 

;  Increment 
;  Load 

; Increment 
;  Load 

; Increment 
;  Load 

; Increment 
;  Load 

; Increment 


SETF  /LD 
CLOCKF  CLK 


; Increment 


TRACE_OFF 

Figure  15.  Design  File  for  a  Nine-bit  Counter  (Cont'd.) 


ZXMonolltMcMMem, 
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Flip-flop  Selection 

Until  now,  all  of  the  designs  have  been  implemented  in  devices 
with  D-type  flip-flops.  What  happens  if  the  counter  design  is 
implemented  in  a  device  which  allows  both  J-K  and  T-type 
registers?  The  Boolean  logic  equations  for  such  a  design  can  be 
derived  from  the  truth  table.  This  requires  advance  knowledge  of 


the  functionality  of  the  J-K  and  T-type  registers.  For  the  J-K 
register  the  output  is  asserted  when  the  J  input  goes  HIGH,  and 
the  output  is  unasserted  when  the  K  input  goes  HIGH.  Toggle 
type  registers  require  the  T  input  to  be  asserted  for  every  change 
in  the  output  level. 


PRESENT  STATE 

NEXT  STATE 

X3 

X2 

X1 

xo 

X3 

X2 

X1 

xo 

D 

J 

K 

T 

D 

J 

K 

T 

D 

J 

K 

T 

D 

J 

K  T 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0  1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1  1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0  1 

0 

0 

1 

1 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1  1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0  1 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1  1 

0 

1 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0  1 

0 

1 

1 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1  1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0  1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1  1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0  1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1  1 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0  1 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1  1 

1 

1 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0  1 

1 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1  1 

Figure  16.  Truth  Table  for  D,  J-K  and  T-type  Flip-flops 


Figure  1 6  shows  the  truthtablefor  both  a  J-K  and  aT-type  register 
implementation  for  a  binary  counter.  Deriving  and  optimizing  the 
equations  from  the  table,  we  get  the  following  results: 

X3-J  :-  /X3  *  X2  *  XI  *  XO 
X3-K  :=  X3  *  X2  *  XI  *  XO 

X2-J  :-  /X2  *  XI  *  XO 
X2-K  :=  X2  *  XI  *  XO 

XI- J  :=  /XI  *  XO 
Xl-K  :=  XI  *  XO 

XO-J  :=  /XO 
XO-K  :=  XO 

X3-T  :=  X2  *  XI  *  XO 
X2-T  :=  XI  *  XO 
XI -T  :-  XO 
XO-T  :=  1 

As  we  can  see  from  these  equations,  the  number  of  product  terms 
used  for  J-K  and  T-type  implementations  is  smaller  than  the 
number  of  product  terms  required  for  a  D-type  implementation 
(Figure  12). 


Which  flip-flop  is  most  efficient  depends  on  the  relative  number  of 
transitions  or  holds  required.  As  a  counter  traverses  from  one 
count  (state)  to  another,  every  output  either  makes  a  "transition" 
(changes  logic  level)  or  "holds"  (stays  at  the  same  logic  level). 
Small  counters  in  general  require  more  transitions  and  fewer 
holds.  As  the  designs  get  larger,  the  higher-order  bits  require 
fewer  transitions  and  more  holds. 

D-type  flip-flops  use  up  product  terms  only  for  active  transitions 
from  logic  LOW  levelto  HIGH  level,  and  for  logic  HIGH  level  holds 
only.  J-K  and  T-type  flip-flops  use  up  product  terms  for  both  LOW- 
to-HIGH  and  HIGH-to-LOW  transitions,  but  eliminate  hold  terms. 
Generally,  the  requirements  of  transition  and  hold  terms  depends 
upon  the  count  sequence  selection.  D-type  flip-flops  are  more 
efficient  for  small  designs.  Conversely  J-K  and  T-type  flip-flops 
can  be  more  efficient  for  large  designs,  which  require  more  hold 
terms. 

» 

A  comparison  of  product  term  requirements  of  2-,  3-,  4-  and  5-bit 
binary  counters  can  be  representative  for  other  types  of  counters 
and  state  machines.  The  table  in  Figure  1 7  shows  the  transition 
terms  and  the  hold  terms  required  for  these  counters.  For  a  J-K 
type  flip-flop  implementation,  after  optimizing,  total  product  terms 
required  are  4,  6,  8,  and  10  respectively.  The  D-type  implemen- 
tation requires  3,  6, 1 0,  and  1 5  respectively,  and  is  relatively  less 
efficient  for  large  counters. 
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BINARY  COUNTER 


TRANSITIONS 


HOLDS 


D  PRODUCT 
TERMS 


J-K  PRODUCT 
TERMS 


T  PRODUCT 
TERMS 


2-  Bit 

3-  Bit 

4-  Bit 

5-  Bit 


6 
14 

30 
62 


2 
10 
34 
98 


3 
6 
10 
15 


4 

6 
8 
10 


Figure  17.  Product  Term  Requirements  for  Configurable  Flip-flops 


There  are 
mable  D, 
PLDs 


which 


number  of  PLDs  which  offerthe  capability  of  program- 
K  and  T-type  registers.  Figure  1 8  shows  a  list  of  all 
offer  configurable  flip-flops.  The  J-K  and  T-type  flip- 


flops  are  constructed  from  a  basic  D-type  flip-flop  using  an  extra 
XOR  gate  per  flip-flop. 


DEVIC 

:e 

TOTAL 
NUMBER 
OF  PINS 

DEDICATED 
INPUT 
PINS 

DEDICATED 
REGISTERED 
OUTPUT 
PINS 

I/O 
PINS 

LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 

SPEED  GRADES 
FREQUENCY  IN 
MHz 

^MAX 

TTL 

22RXI 
32VX1 

i 

0 

24 
24 

14 
12 

0-8 
0-10 

8-0 
10-0 

8 
8-16 

28.5 
25,  22.2 

Figure  18.  Registered  PLDs  with  Configurable  Flip-flops 


Modulo  Counters 


The  number  of  unique  states  a  counter  traverses  is  generally 
referred  to  as  the  modulus.  A  typical  n-bit  binary  counter  has  a 
maximum  modulus  of  2n .  It  is  often  necessary  to  introduce  signal 
delays  into  the  logic  design  to  meet  timing  requirements.  This 
makes  it  possible  to  allow  for  bus-skew,  accesstime  or  differential 
propagation  delays  between  devices  along  two  different  signal 
paths.  A  typical  example  of  this  is  the  introduction  of  wait  states 
to  allowfor  access  times  of  different  memory  elements.  Counters 
and  delay  lines  are  commonly  used  to  introduce  the  delay. 
Counters  in  PLDs  have  the  added  advantage  of  programmability 
to  select  the  required  delay.  Such  applications  where  precise 
timing  duration  control  is  required  usually  use  modulo  counters 
with  a  non-power-of-two  modulus.  Other  applications  of  modulo 
counters  include  waveform  generators  and  arbiters. 


A  good  example  of  a  modulo  counter  is  a  BCD  counter.  Such  a 
counter  is  useful  in  applications  where  thecomputer's  outputs  are 
generated  using  a  decimal  system.  While  a  four-bit  binary 
counter  can  count  up  to  sixteen,  the  BCD  counter  terminates  the 
count  at  the  modulus  of  10. 

Modulo  counters  can  be  designed  in  a  variety  of  ways.  One  direct 
way  isto  usethe  truth  table  to  implement  acount  to  a  modulus  and 
directly  derive  the  equations  from  it.  The  truth  table  for  a  BCD 
count  (from  zero  to  nine)  is  shown  in  Figure  19. 

Now  let  us  consider  what  happens  if  the  device  accidentally 
powers  up  in  one  of  the  count  values  from  ten  to  fifteen.  These 
are  illegal  counts  (states)  and,  for  a  good  design,  a  mechanism 
must  be  built  into  the  equations  to  allow  it  to  recover  back  into  a 
legal  state.  What  we  actually  need  is  to  consider  the  truth  table 
in  Figure  20  in  conjunction  with  the  one  in  Figure  1 9  for  deriving 
the  Boolean  equations. 


PRESENT  STATE 

NEXT  STATE 

Q3  Q2 

Q1 

Q0 

Q3 

Q2 

Q1 

Q0 

0  0 

0 

0 

0->1 

0 

0 

0 

1 

0  0 

0 

1 

1  ->2 

0 

0 

1 

0 

0  0 

1 

0 

2->3 

0 

0 

1 

1 

0  0 

1 

1 

3->4 

0 

1 

0 

0 

0  1 

0 

0 

4->5 

0 

1 

0 

1 

0  1 

0 

1 

5->6 

0 

1 

1 

0 

0  1 

1 

0 

6->7 

0 

1 

1 

1 

0  1 

1 

1 

7->8 

1 

0 

0 

0 

1  0 

0 

0 

8->9 

1 

0 

0 

1 

1  0 

0 

1 

9->0 

0 

0 

0 

0 

PRESENT  STATE 

NEXT  STATE 

Q3    Q2    Q1  Q0 

Q3    Q2    Q1  Q0 

10  10 
10  11 
110  0 
110  1 
1110 
1111 

10 ->0 
11  ->  0 

12 -  >0 

13 -  >0 

14  ->0 

15  ->0 

0      0      0  0 
0       0       0  0 
0       0       0  0 
0       0       0  0 
0       0       0  0 
0       0       0  0 

Figure  19.  Truth  Table  for  a  BCD  Counter 


Figure  20.  Truth  Table  for  Illegal  State  Recovery  to 
Count  Zero 


or  the  PALASM  2  software  minimizer. 


The  Boolean  equation  for  Q3  is: 


Roni^fo»o(l  ■  nnic  IY»«i»«»».  . 

+  /Q3  *  /Ql 
+  /Q2  *  QO 


/Q3 


=  /Q3  *  /Q2 
+  /Q3  *  /Q2 
+  /Q3  *  /Q2 
+  /Q3  * 
+  /Q3  * 
+  /Q3  * 
+  /Q3  * 
Q3  * 
Q3  * 
Q3  * 
Q3  * 
Q3  * 
Q3  * 
Q3  * 


/Q2 
Q2 
Q2 
Q2 
/Q2 
/Q2 
/Q2 
Q2 
Q2 
Q2 
Q2 


/Ql 
/Ql 
Ql 
Ql 
/Ql 
/Ql 
Ql 
/Ql 
Ql 
Ql 
/Ql 
/Ql 
Ql 
Ql 


/QO 

QO 
/QO 

QO 
/QO 

QO 
/QO 

QO 
/QO 

QO 
/QO 

QO 
/QO 

QO 


Q3 
Q3 


CI 
Q2 


Similar  Boolean  equations  can  be  generated  forQ2,  Q1  and  QO. 
Figure  22  shows  the  circuit  diagram  of  a  loadable  dual  BCD 
counter  implemented  in  a  PAL1 6R8.  Figure  23  shows  the  design 
file  for  such  a  counter. 


Figure  21.  State  Sequence  of  a  BCD  Counter  Showing  Illegal  State  Recovery 


CEB  - 
LDB  - 


□  o- 
D,  - 
D2- 
D3- 


CEA  — 
IDA- 
CLOCK— 


INPUT 
BUFFERS 


FOUR- 
BIT 
BCD 

COUNT 


FOUR- 
BIT 
BCD 

COUNT 


Figure  22.  Circuit  of  a  Dual  BCD  Counter 
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TITLE  DUAL  BCD  COUNTER. 

PATTERN  0  0 

REVISION  01. 

AUTHOR  CHRIS  JAY. 

COMPANY  MM I   SANTA  CLARA,  CA. 

DATE  17  JULY  1987 


CHIP  BCD  PAL16R8 


THE  PAL16R8  HAS  BEEN  PROGRAMMED  AS  A  DUAL  BINARY 
CODED  DECIMAL  COUNTER.   REGISTERS  MAY  BE  LOADED  FROM 
DATA  INPUTS  DO  -  D3 ,   THE  LDA  ENABLES  DATA  ON 
DO   -  D3  TO  BE  SYNCHRONOUSLY  LOADED  INTO  Q4   -  Q7  . 
LDB  ENABLES  DATA  ON  DO  -  D3  TO  BE  LOADED  INTO 
REGISTERS  Q0   -  Q3.   TWO  COUNT  ENABLE  INPUTS  CEA 
AND  CEB  ENABLE  A  BINARY  CODED  DECIMAL  COUNT  IN 
BOTH  COUNTERS .   THE  COUNTERS  RESET  IF  POWERED  UP 
IN  A  HEXADECIMAL  STATE,    SO  IF  USED  IN  A  STAND 
ALONE  MODE  THE  COUNTERS  WILL  NOT  LOCK  UP  INTO 
AN   INVALID  STATE. 


;P1N 

1 

2 

3 

4 

5 

CLK 

/LDA 

CEA 

DO 

Dl 

;PIN 

6 

7 

8 

9 

10 

D2 

D3 

CEB 

/LDB 

GND 

;PIN 

11 

12 

13 

14 

15 

/OE 

Q0 

Ql 

02 

03 

;  PIN 

16 

17 

18 

19 

20 

Q4 

QS 

06 

07 

VCC 

STRING     RESA  'Q5*Q7  +  Q6*Q7'  ; RESET  FROM  HEX 

STRING     RESB  'Q1*Q3  +  Q2*Q3'  ; STATES 

EQUATIONS 


/QO 


/Ql 


/Q2 


/Q3 


/Q4 


Q0*/LDB*CEB 
RESB 

/DO* LDB* /CEB 
/Q0*/CEB*/LDB 

Q3*/LDB*CEB 
Q0*Q1*/LDB*CEB 
/QO*/  Q1*/LDB*CEB 
RESB 

/D1*LDB*/CEB 
/Q1*/CEB*/LDB 

Q3*/LDB*CEB 
/Q0*/Q2*/LDB*CEB 
/Q1*/Q2*/LDB*CEB 
QO*  Q1*Q2*/LDB*CEB 
RESB 

/D 2* LDB* /CEB 
/Q2*/CEB*/LDB 

/Q3*/Q0*/LDB*CEB 
Q0*/Q2*/LDB*CEB 
/Q1*/Q3*/LDB*CEB 
RESB 

/D3*LDB*/CEB 
/Q3*/CEB*/LDB 

Q4*/LDA*CEA 
RESA 

/DO* LDA* /CEA 
/Q4  *  /CEA*  /LDA 


COUNT  EQUATION 

RESET  FROM  ILLEGAL  STATE 

LOAD  DO 

HOLD  QO 

COUNT  EQUATION 


RESET  FROM  ILLEGAL  STATE 
LOAD  Dl 
HOLD  Ql 

COUNT  EQUATION 

; 

RESET  FROM  ILLEGAL  STATE 
LOAD  D2 
HOLD  Q2 


COUNT  EQUATION 


RESET  FROM   ILLEGAL  STATE 
LOAD  D3 
HOLD  Q3 

COUNT  EQUATION 

RESET  FROM  ILLEGAL  STATE 

LOAD  DO 

HOLD  Q4 


Figure  23.  Design  File  for  a  Dual  BCD  Counter 
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/Q6 


Q7*/LDA*CEA 
Q4*Q5*/LDA*CEA 
/Q4*/Q5*/LDA*CEA 
RESA 

/D1*LDA*/CEA 
/Q5*/CEA*/LDA 

Q7*/LDA*CEA 

/Q4*/Q6*/LDA*CEA 

/Q5*/Q6*/LDA*CEA 

Q4*Q5*Q6*/LDA*CEA 

RESA 

/D2* LDA* /CEA 
/Q6*/CEA*/LDA 

/Q7*/Q4*/LDA*CEA 
Q4*/Q6*/LDA*CEA 
/Q5*/Q7*/LDA*CEA 
RESA 

/D3*LDA*/CEA 
/Q7*/CEA*/LDA 


COUNT  EQUATION 


RESET  FROM  ILLEGAL  STATE 
LOAD  Dl 
HOLD  Q5 


COUNT  EQUATION 


RESET  FROM  ILLEGAL  STATE 
LOAD  D2 
HOLD  Q6 


COUNT  EQUATION 


RESET  FROM  ILLEGAL  STATE 
LOAD  D3 
HOLD  Q7 


SIMULATION 

TRACE_ON  CLK  Q4  QS  Q6  Q7 

QO  Ql  Q2  Q3  LDA 
LDB  DO  Dl  D2  D3 
CEA  CEB  OE 

; PRLDF     QO  Ql  Q2  Q3  Q4  Q5  Q6  Q7 

SETF         /CLK  OE 

CLOCKF  CLK 

SETF         /CLK  OE  DO  Dl  D2  /D3 

SETF         LDA  LDB  /CEA  /CEB 

CLOCKF  CLK 

SETF         /LDB  /LDA 

CLOCKF  CLK 

SETF         CEA  /LDA 

FOR   I    :=   1  TO  10  DO 

BEGIN  CLOCKF  CLK 

END 

SETF  /CEA  CEB  /LDB 
FOR   I    :=   1  TO  10  DO 
BEGIN  CLOCKF  CLK 
END 

SETF  /CEB  LDB  /DO  Dl  /D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 

SETF  LDB  DO  Dl   /D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 

SETF  LDB  /DO   /Dl   D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 

SETF  LDB  DO  /Dl   D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 

SETF  LDB  /DO  Dl   D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 

SETF  LDB  DO   Dl  D2  D3 
CLOCKF  CLK 
SETF  /LDB 
CLOCKF  CLK 
TRACE  OFF 


TRACE  ALL  SIGNALS 


SET  DATA  INPUTS  SELECT 
REGISTERS  QO   -  Q3  PERFORM 
SYNCHRONOUS  LOAD. 
SELECT  REGISTERS  Q4  -  Q7 
PERFORM  SYNCHRONOUS  LOAD. 
ENABLE  Q4   -  Q7  AS  A  BCD 
COUNTER,   CLOCK  FOR  10 
CLOCK  CYCLES 

ENABLE  QO  -  Q3  AS  A  BCD 
COUNTER,   CLOCK  FOR  10 
CYCLES 

LOAD  ILLEGAL  STATES 
TO  TEST  COUNTER  RESET 


Figure  23.  Design  File  for  a  Dual  BCD  Counter  (Cont'd.) 
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Modulo  Counter  Device  Selection  Considerations 

We  have  illustrated  a  counter  which  counts  from  zero  to  a  fixed 
modulus.  The  same  technique  can  be  applied  for  a  counter  which 
counts  down  from  a  maximum  power-of-two  number  to  a  fixed 
modulus,  or  even  a  counter  which  counts  from  one  modulus  to 
another.  The  important  considerations  will  be  the  number  of 
product  terms  used. 

The  registered  PLDs  used  for  modulo  counters  are  similar  to  the 
ones  selected  for  other  counters.  Since  the  counts  used  are 
binary,  devices  with  J-K  or  T-type  flip-flops  or  XOR  gates  will  help 
optimize  the  number  of  product  terms  used.  The  product  term 
usage  also  depends  upon  the  modulus  selected.  Generally  a 
power-of-two  or  a  multiple-of-two  modulus  will  require  fewer 
product  terms. 


Another  factor  for  flip-flop  selection  is  the  illegal  states.  D-type 
flip-flops  are  generally  better  suited  for  illegal  state  recovery  than 
the  J-K  or  T-type  flip-flops.  This  is  because  when  noproductterm 
is  asserted,  the  D-type  flip-flops  reset  to  zero.  Designers  using  J- 
K  or  T-type  flip-flops  must  design-in  illegal  state  recovery. 

Certain  devices  allow  the  use  of  a  synchronous  RESET  product 
term  for  modulo  counters.  The  idea  is  to  use  a  minimal  number 
of  product  terms  to  build  a  binary  counter  which  counts  up  to  a 
power-of-two  number.  However,  this  counter  is  RESET  to  zero 
using  the  synchronous  RESET  product  term  when  the  desired 
modulus  is  reached.  It  then  begins  counting  afresh  from  zero,  and 
the  procedure  is  repeated.  Similar  operation  can  also  be 
achieved  with  a  synchronous  PRESET  product  term  for  a  down 
counter.  Figure  24  shows  a  table  of  all  the  PLDs  that  provide  the 
RESET  and  PRESET  product  terms. 


DEVIC 

TOTAL 
NUMBER 
OF  PINS 

DEDICATED 
INPUT 
PINS 

DEDICATED 
REGISTERED 
OUTPUT 
PINS 

I/O 
PINS 

LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 

SPEED  GRADES 
FREQUENCY  IN 
MHz 

^MAX 

TTL 

16RA8 
23S8 

20 
20 

8 

9 

0-8 
4 

8-0 
4 

4 
8-12 

20 

33.3,  28.5 

20RA10 
22RX8 
22V10 
32VX10 

24 
24 
24 
24 

10 
14 
12 
12 

0-10 
0-8 
0-10 
0-10 

10-0 
8-0 
10-0 
10-0 

4 
8 

8-16 
8-16 

33,  20 
28.5 
40,  28.5,  18 
25,  22.2 

CMOS 

C22V10 

24 

12 

0-10 

10-0 

8-16 

33.3,  20 

C29M16 
C29MA16 

24 
24 

5 
5 

16 
16 

8-16 
4-12 

20,  15 
20,  15 

ECL 

10H20EV/I 
1 0020EV/E 

EG  8 
G8 

24 
24 

12 
12 

0-8 
0-8 

8-0 
8-0 

8-12 
8-12 

125 
125 

Figure  24.  PLDs  Which  Provide  RESET  or  PRESET  Product  Terms 
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Using  synchronous  RESET  and  PRESET  product  terms  allows 
the  counter  to  recover  from  illegal  states.  Notice  that  the  logic 
product  terms  in  the  counter  are  designed  for  a  complete  binary 
count.  If  the  counter  powers  up  in  any  illegal  state  (as  shown  in 
Figure  25),  it  will  continue  the  count  until  the  terminal  count  and 
return  to  zero,  where  the  correct  modulo  count  will  begin.  This 
illegal  state  recovery  will  take  an  unpredictable  number  of  clock 
cycles,  and  you  may  wish  to  design  a  more  systematic  recovery 
system. 

Cascading  Modulo  Counters 

For  large  modulo  counters,  the  technique  of  generating  Boolean 
equations  from  the  truth  tables  is  very  tedious  and  time  consum- 
ing. Another  approach  for  designing  modulo  counters  is  to  divide 
it  into  two  smaller  modulo  counters.  In  addition  to  simplifying  the 
design,  this  approach  usually  helps  optimize  the  number  of 
product  terms. 


As  an  example,  a  modulo-360  counter  can  be  directly  imple- 
mented with  nine  register  bits.  However,  instead  of  implementing 
this  as  a  straight  9-bit  counter,  we  can  implement  this  as  two 
counters:  one  four-bit  counter  (counting  from  zero  to  14)  and 
another  five-bit  counter  (counting  from  zero  to  23).  Together  the 
two  counters  count  up  to  360.  Shown  in  Figure  26  the  terminal 
count  output  MOUT  is  asserted  when  the  count  reaches  360. 

The  design  requires  nine  inputs,  nine  outputs,  one  clock  pin,  one 
LOAD  pin,  one  RESET  and  one  MOUT  (module  output  signal) 
pin.  Note  that  no  extra  flip-flops  or  pins  were  needed.  Obviously, 
the  count  values  of  this  counter  are  not  the  same  as  a  straight 
modulo-360  counter.  Actually,  this  is  what  contributes  to  the 
optimization  of  the  number  of  product  terms  used.  The  PLPL 
design  file  of  this  counter  is  shown  in  Figure  27. 


SYNCHRONOUS 
RESET  PRODUCT 
TERM  ASSERTED 


Figure  25.  A  BCD  Counter  Using  Synchronous  RESET  Product  Term 


Q24  (4)  (3)  (2)  (1)  (0)  Q15  (3)  (2)  (1)  (0) 


A  (8:4) 
LOAD 
RST 
CLK 


M_OUT 


406  10 

Figure  26.  A  Modulo-360  Counter  In  PAL22V10 
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DEVICE      MODOLO_360   COUNTER  (PAL22V10) 


CLK       =  1 
LOAD     ■  3 
A[8:0]   =  is 
Q24[4:0)  = 
Q15[3:0]  = 
M  OUT  =  23 


RST     =  2 

11,13 
18:14 
22:19 

"ASSERTED  WHEN  COUNT  REACHES  360" 


BEGIN 


"COUNTER  INITIALIZATION" 


"INITIALIZATION  COUNT" 
"INITIALIZATION  COUNT" 


IF  (RST)  THEN  ARESET ( )  ; 
IF   (LOAD)   THEN  BEGIN 

Q24[4:0]    :=  A[8:4] 

Q15[3:0]  :=  A[3:0] 
END; 

ELSE  BEGIN 

IF   (Q24[4]*/Q24[3]«  Q24[2]*  Q24[l]*  Q24[0]* 

Q15[3]*  Q15[2)*  Q15[l]*/Q15[0])   THEN  M_OUT  =  1; 
CASE   (Q24[4:0])   BEGIN   "MODULO  24" 
#B00000)  BEGIN 

Q15[3:0]    :=  Q15[3:0]  ; 
Q24[4:0]    :=  #B00001  ; 

END  ; 


#B00001)  BEGIN 

Q15(3:0)  :=  Q15[3:0] 
Q24[4:0]    :=   #B00010  ; 

END  ; 

IB00010)  BEGIN 

Q15[3:0]  :  =  Q15[3:0] 
Q24[4:0]    !=   #B00011  j 


IB00011) 


BEGIN 
Q15[3:0] 
Q24 [ 4:0] 


Q15[3:0) 
#B00100  ; 


#B00100) 

END  ; 
JB00101 ) 

END  ; 
#B00110 ) 

END  ; 
#B00111 ) 

END  ; 
#B01000) 

END  ; 
jfBOlOOl) 


BEGIN 

Q15[3:0] 

Q24[4:0] 


BEGIN 

Q15[3:0] 

Q24[4:0] 


BEGIN 

Q15[3:0] 

Q24[4:0] 


BEGIN 

Q15[3:0] 

Q24[4:0] 


BEGIN 

Q15[3:0] 

Q24[4:0] 


BEGIN 

Q15[3:0] 

Q24[4:0] 


Q15[3:0] 
#B00101  ; 


Q15[3:0] 
»B00110  ; 


Q15[3:0) 
#B00111  i 


Q15[3:01 
#B01000  ; 


Q15[3:0] 
#B01001  ; 


Q15[3:0] 
#B01010  ; 


#B01010)  BEGIN 

Q15[3:0] 
Q24(4:0] 

END  ; 


Q15(3:0] 
#B01011  ; 


#B01011)  BEGIN 

Q15[3:0]  :=  Q15[3:0]  ; 
Q24[4:0]    :=  #B01100  ; 

END  ; 

Figure  27.  Design  File  of  a  Modulo-360  Counter 
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#B01100)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  ; 


Q15(3:0] 
#B01101  i 


#B01101)  BEGIN 

Q15[3i0] 
Q24[4:0] 

END  ; 


Q15[ 3:0] 
#B01110  ; 


#B01110)  BEGIN 

Q15[3:0] 
O24[4:0] 

END  ; 


Q15[3:0] 
♦B01111  ; 


#B01111)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  ; 


Q15[3:0] 
#B10000  ; 


#B10000)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  ; 


Q15[3:0) 
#B10001  ; 


IB10001)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  ; 


Q15[3:0] 
»B10010  ; 


#B10010)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  ; 


Q15[3:0] 
#B10011  ; 


# BIO Oil)  BEGIN 

Q15[3:0] 
Q24[4:0] 

END  : 


Q15[3:0) 
#B10100  ; 


JB10100)  BEGIN 

015(3:0] 
Q24[4:0] 

END  ; 


Q15[3:0] 
#B10101  ; 


#B10101)  BEGIN 

Q15[3i0] 
Q24[4:0] 

END  ; 


Q15[3:0] 
#B10110  ; 


#B10110)  BEGIN 

Q15[3:0] 
Q24[4:0) 

END  : 


Q15[3:0J 
#B10111  | 


#B10111)  BEGIN 

Q24[3:0] 
CASE  (Q15[3:0] 
#BOO00) 


#B0001 
#B0010) 
JB0011 ) 


»B00000 
BEGIN 
Q15[3. 
Q1513 
Q15[3 
Q15[3 


#B0100)  Q15[3:0] 

#B0101)  Q15[3:0] 

#B0110)  Q15[3:0] 

#B0111)  Q15[3:0] 

#B1000)  Q15[3:0] 

#B1001)  Q15[3:0] 

#B1010)  Q15[3:0] 

#B1011)  Q15[3:0] 

IB1100)  Q15[3:0] 

IB1101)  Q15[3:0] 

#B1110)  015(3:0] 


"MODULO  15" 
#B0001 
IB0010 
#B0011 
#B0100 
#B0101 
IB0110 
#B0111 
IB1000 
#B1001 
#B1010 
IB1011 
#B1100 
#81111 
IB11I0 
#B0000 


Figure  27.  Design  File  of  a  Modulo-360  Counler  (Cont'd.) 
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TEST_VECTORS 

IN     CLK  RST  LOAD  A[8:0]  ; 
OUT  Q24[4:0]   Q15[3:0]   M_OUT  ; 

BEGIN 


C 

c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 

END. 


RST 

LOAD 

A[f 

:  C 

] 

Q24  [  4 

0 

Qi5[: 

:  C 

] 

M  < 

1 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

0 

1 

1 

0 

1 

0 

1 

1 

1 

0 

0 

H 

L 

H 

L 

H 

H 

H 

L 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

H 

L 

H 

H 

L 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

H 

H 

H 

H 

L 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

H 

H 

L 

H 

L 

0 

1 

0 

1 

1 

1 

1 

1 

1 

1 

0 

L 

H 

H 

H 

H 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

L 

L 

L 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

L 

L 

H 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

L 

H 

L 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

L 

H 

H 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

L 

L 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

L 

H 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

H 

L 

H 

H 

H 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

H 

H 

H 

H 

H 

H 

L 

H 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

H 

L 

L 

L 

L 

L 
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X 

X 

X 

X 
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X 

X 

X 

X 

L 

L 

L 

H 

L 

L 

L 

L 

L 

L 

Figure  27.  Design  File  of  a  Modulo-360  Counter  (Cont'd.) 


PLS  Devices  for  Modulo  Counters 

Monolithic  Memories  offers  afamily  of  PLS  devices.  PLS  devices  also  provide  a  complement  array  term  which  is  very  useful  for 

provide  programmable  AND  and  programmable  OR  arrays.  designing  modulo  counters.  The  available  PLS  devices  are  listed 

These  devices  offer  S-R  type  output  and  buried  flip-flops.  They  in  Figure  28. 


DEVI 

CE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

'max 

TTL 

PLS105 

28 

16 

8 

Total  48 

37 

PLS1 

67 

24 

14 

6 

Total  48 

33 

PLS1 

68 

24 

12 

8 

Total  48 

33 

Figure  28.  PLS  Devices 
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state  recovery  and  programmable  modulo  counter  designs. 
Illegal  State  Recovery 

For  any  counter  going  through  a  sequence  of  legal  counts 
(states),  at  least  one  product  term  is  always  active  as  long  as  the 
counter  never  needs  to  hold  a  count;  such  terms  determine  the 
next  legal  state.  When  the  counter  powers  up  in  any  invalid  state, 
none  of  the  product  terms  for  deciding  the  next  state  will  be  active. 
When  none  of  the  product  terms  connected  to  the  complement 
array  is  active,  the  complement  array  is  asserted. 

The  complement  array  is  usually  connected  to  all  of  the  product 
terms  of  the  counter.  Thus  when  the  count  is  legal,  the  comple- 
ment array  output  is  always  LOW  (Figure  30).  When  the  counter 
is  in  an  illegal  state,  the  complement  array  detects  it  and  is 


Programmable  Modulus  Counter 

The  complement  array  term  can  instead  be  used  for  designing 
programmable  modulus  counters.  Acounter  is  implemented  with 
the  device  registers.  Different  product  terms  are  programmed 
with  different  modulus  values.  These  product  terms  are  under  the 
control  of  input  signals.  At  any  one  time  only  one  of  the  product 
terms  is  kept  active.  The  complement  array  is  connected  to  all 
such  terminal  count  product  terms.  Since  all  of  these  product 
terms  are  LOW  the  complement  array  is  initially  HIGH  and  is  used 
as  a  count  enable  for  the  counter.  When  the  count  reaches  the 
terminal  value  the  complement  array  term  goes  LOW  disabling 
the  normal  count  product  terms.  Simultaneously  the  terminal 
count  product  term  becomes  active,  resetting  the  counter  to 
begin  counting  afresh. 
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Figure  29.  The  Architecture  of  the  Complement  Array  Term 
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Figure  30.  Complement  Array  for  Illegal  State  Recovery 
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Figure  31.  Programmable  Modulo  Counter  Using  Complement  Array  Term 
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Counters  with  Encoding 


Until  now  we  have  discussed  counters  which  generate  binary 
output  sequences.  Most  peripherals  require  a  predetermined 
sequence  of  control  signals.  Custom  control  sequences  can  be 
generated  by  decoding  the  binary  sequence  with  combinatorial 
logic.    Figure  32  shows  a  general  model  of  a  counter  with 


COMBINATORIAL 
LOGIC 


combinatorial  output  decoding  circuitry.  This  combinatorial  circuit 
modifies  the  counter  bits  and  generates  output  signals  in  the 
manner  required  for  peripheral  timing  and  control.  Since  these 
circuits  require  extra  combinatorial  logic,  they  are  not  very  effi- 
cient. They  are  also  more  susceptible  to  hazards  and  output 
glitches. 
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Figure  32.  Counter  with  an  Output  Decoder 


It  is  possible  to  have  a  different  output  coding  for  a  four-bit 
counter,  as  shown  in  Figure  33.  This  code,  called  Gray  code, 
allows  only  one  output  bit  to  toggle  for  each  new  count  value.  This 
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code  can  be  easily  derived  from  a  four-bit  binary  counter  code 
(also  shown  in  Figure  33)  using  an  output  decoder. 

Wecan  derive  the  Boolean  equations  forthe  combinatorial  output 
decoder  from  the  truth  table.  The  equations  are: 

G3  =  X3 
G2  -  X3  :  +  :  X2 
Gl  =  X2   :  +  :  XI 
GO  =  XI  :  +  :  XO 

A  more  efficient  and  easier  technique  for  generating  control 
signals  is  to  implement  the  decode  circuitry  before  the  registers. 
This  alternative  is  shown  in  Figure  34.  This  essentially  generates 
a  non-standard  counter  with  state  values  which  are  not  a  binary 
progression.  It  can  also  be  considered  to  be  a  counter  where  the 
product  terms  for  a  binary  count  and  encoding  the  outputs  have 
been  combined. 

Many  different  codes  can  be  generated  using  such  techniques. 
We  will  limit  ourselves  to  the  ones  which  are  most  commonly 
used:  Gray-code  counters  and  Johnson  counters. 


Figure  33.  Generating  Gray  Code  from  a  Binary  Code 
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Figure  34.  Counter  with  Combined  Next  State  Generation  and  Output  Encoding  Circuit 
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Gray-Code  Counters 


The  Boolean  logic  equations  for  a  Gray-code  counter  are: 


Gray-code  counters  are  often  used  in  digital  designs  for  control 
timing  functions.  The  primary  advantage  of  Gray-code  counters 
stems  from  the  characteristic  that  only  one  output  bit  changes 
value  for  every  clock  cycle.  These  output  signals  can  then  easily 
be  decoded  using  a  combinatorial  decoder  without  any  risk  of 
hazards.  Gray-code  counters  are  used  extensively  as  system 
clocks,  since  the  different  output  bits  provide  different  clock 
phases  without  the  risks  of  hazards.  Gray-code  is  also  used  in 
high-speed  data  communication  applications,  where  data  is 
transmitted  from  one  part  of  the  system  to  another,  and  where  the 
error  susceptibility  increases  with  the  number  of  bit  changes 
between  adjacent  numbers  in  a  sequence.  These  are  also  used 
for  such  specialized  applications  as  shaft  encoders  and  real-time 
process  control. 

The  implementation  of  a  Gray-code  counter  is  very  simple.  A 
truth  table  can  be  derived  from  the  transition  table  as  is  done  for 
a  binary  counter.  The  Boolean  equations  can  then  be  directly 
derived  from  the  truth  table.  The  truth  table  for  the  Gray-code 
counter  is  shown  in  Figure  35. 
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Figure  35.  Truth  Table  for  a  Four-bit  Gray-code  Counter 
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A  Johnson  counter  is  part  of  a  family  of  counters  known  as  "ring 
counters."  These  counters  are  used  for  special  applications 
where  code  symmetry  is  desired.  Ring  counters  are  also  often 
used  for  timing  purposes,  since  all  of  the  outputs  are  essentially 
a  series  of  pulses.  This  code  symmetry  also  allows  use  of  the 
fewest  possible  product  terms  with  a  D-type  register.  Devices 
such  as  the  Logic  Cell  Array  (LCA),  which  provide  a  small  amount 
of  logic  per  cell,  can  implement  Johnson  counters  very  easily. 

Johnson  counters  are  also  known  as  circular-shift  counters.  The 
sequence  for  a  five-stage  Johnson  counter  is  shown  in  Figure  36. 
As  can  be  seen  in  the  truth  table,  the  counter  first  fills  up  with  1  's 
from  left  to  right  and  then  it  fills  up  with  zeros  again.  Note  from  the 
output  sequence  that  only  one  of  the  Johnson  counter  bits 
changes  for  every  clock  period,  like  the  Gray-code  counter.  One 
major  advantage  of  the  Johnson  counter  is  that  it  can  be  readily 
decoded  with  small  two-input  NAND  gates  and  hence  is  suitable 
for  high-speed  applications. 

Note  that  the  five-stage  sequence  has  a  table  of  1 0  legal  states 
and  22  illegal  states  (Figure  37).  In  general,  an  n-bit  Johnson 
counter  will  produce  a  modulus  of  2n.  Figure  38  shows  the  state 
diagram  of  the  five-bit  counter. 
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Figure  36.  Flve-blt  Johnson  Counter  Truth  Table 


The  implementation  of  a  Johnson  counter  is  relatively  straight- 
forward, and  is  the  same  regardless  of  the  number  of  stages. 
When  D-type  flip-flops  are  used,  the  Q  output  of  each  flip-flop  is 
connected  to  the  D  input  of  the  following  stage.  The  single 
exception  isthe  Q  output  of  the  last  stage,  which  is  complemented 
and  connected  to  the  D  input  of  the  first  stage. 
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Figure  37.  Illegal  States  for  a  Flve-blt  Johnson  Counter 


Figure  38.  State  Diagram  of  a  Flve-blt  Johnson  Counter 


One  disadvantage  of  the  counter  is  the  number  of  invalid  (or 
illegal)  states.  The  invalid  states  increase  exponentially  with  the 
length  of  the  counter.  The  bigger  the  counter  becomes,  the 
greater  are  its  chances  of  entering  an  illegal  state.  Johnson 
counters  are  very  susceptible  to  illegal  states,  and  can  "hang  up" 
very  easily.  Noise  or  improper  use  can  cause  this  counter  to  end 
up  in  an  illegal  state.  Therefore  a  design  with  illegal  state  recovery 
circuitry  is  always  recommended. 


Registered  Logic  Design 


Figure  39  shows  a  nine-bit  Johnson  counterwhich  can  be  derived 
by  directly  extending  the  design  of  afive-bit  Johnson  counter.  The 
design  file  adds  a  LOAD  function;  it  requires  nine  input  pins,  nine 
output  pins,  one  RESET  pin,  one  LOAD  pin,  one  invalid  status  pin 


and  one  clock  pin.  The  design  incorporates  the  product  terms 
required  for  illegal  state  recovery.  The  PLPL  design  file  for  this 
counter  is  shown  in  Figure  40. 
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Figure  39.  Block  Diagram  of  a  Nine-bit  Johnson  Counter 

N I NE_B I T_JOHNSON_COUNTER   ( AmPAL 2  2 VI 0 ) 


PIN  CLK     =  1 
RST     =  2 
A(8:0]   =  3:11 
Q[8:0]   =  14:21,23 
VF       =   22   "VALID  STATE  FLAG" 
LOAD  =  13  ; 


BEGIN 


IF  (RST)  THEN  ARESET(Q[ 8:0] )  ; 
IF  (LOAD)  THEN  Q[8:0]  :=  A|8:0] 
ELSE  BEGIN 

CASE  (Q[8:0]) 

BEGIN 
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END  i 


Figure  40.  Design  File  for  a  Nine-bit  Johnson  Counter 
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BEGIN 

L 

"C  R  O 

"LSA  AAAAAAAAA  QQQQQQQQQ  V" 
"KTD     876543210       876543210  F" 


X 

1 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

L 

L 

L 

L 

H; 

"RESET" 

c 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

H 

H 

H 

H 

H 

H 

H 

H 

H 

H; 

" LOAD" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

H 

H 

H 

a 

H 

H 

H 

H 

H; 

"COUNT" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

H 

H 

H 

H 

K 

H 

H 

H; 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

H 

H 

H 

H 

H 

H 

H; 

"COUNT" 

c 

0 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 

H 

L 

H 

L 

H 

L 

H 

L 

H 

L ; 

"LD  INVALID  ST" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

L 

L 

L 

L 

H; 

"INVALID  ST" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

L 

L 

L 

L 

L 

L 

L 

L 

Hi 

"RECOVERY" 

c 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

0 

H 

H 

H 

H 

H 

H 

H 

H 

L 

H; 

"LOAD" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

H 

H 

B 

H 

H 

H 

H 

H 

H 

H ; 

"COUNT" 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

H 

H 

H 

H 

H 

H 

H 

H 

H ; 

c 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

L 

L 

L 

L 

L 

L 

L 

L 

H 

H; 

c 

0 

0 

X 

X 

X 

X 

X 

X 

X 

X 

X 

L 

L 

L 

L 

L 

L 

L 

L 

L 

H  ; 

END. 

Figure  40.  Design  File  for  a  Nine-bit  Johnson  Counter  (Cont'd.) 

Shift  Registers 


A  Shift  Register  is  a  special  digital  circuit  often  used  as  a  primary 
building  block  in  digital  computer  systems.  It  is  closely  related  to 
a  ring  counter.  Its  fundamental  usage  is  for  temporary  data 
storage  and  bit -wise  data  manipulation  for  advanced  arithmetic 
and  multiplication  operations.  Shift  registers  are  also  frequently 
used  in  communications,  for  converting  parallel  byte-wide  data 
from  the  microprocessor  to  a  serial  data  bit-stream  for  transmis- 
sion. An  example  of  a  serial  data  link  controller  on  page  2-453 
shows  such  a  shift  register  design.  Shift  registers  are  also  used 
in  graphics  systems  for  serializing  parallel  data  for  use  by  the 
display  monitor.  Anumber  of  examples  of  video  shift  registers  are 
included  in  the  graphics  section  (page  2-257). 

The  fundamental  purpose  of  a  shift  register  (Figure  41)  is  to  shift 
data  from  one  flip-flop  to  another.  There  are  several  types  of  shift 
registers.  They  are  classified  by  the  way  in  which  incoming  data 
is  received  (parallel  or  serial),  and  how  outgoing  data  is  transmit- 
ted (parallel  or  serial). 


In  the  following  example,  we  will  discuss  a  simple  universal  shifter 
which  provides  both  serial  and  parallel  input  and  outputf  unctions. 
Depending  upon  the  control  signals  10  and  11,  the  data  is  shifted 
from  one  flip-flop  to  another  in  the  left  or  the  right  direction.  These 
inputs  also  control  when  the  new  parallel  data  is  loaded  onto  the 
registers.  When  shifting  left  or  right,  serial  data  can  be  received 
and  transmitted  on  serial  pins  LIRO  and  RILO.  Since  the  flip-flop 
outputs  appear  on  the  output  pins  at  all  times,  the  parallel  output 
data  is  always  available.  The  truth  table  is  shown  in  Figure  42. 

The  Boolean  logic  equations  can  be  directly  derived  from  the  truth 
table,  and  are  shown  in  the  PAL  device  design  file  in  Figure  43. 

Shift  registers  can  be  modified  to  suit  various  system  design 
requirements.  This  universal  shift  register  can  be  used  for  serial 
in/serial  out,  parallel  in/parallel  out,  serial  in/parallel  out  and 
parallel  in/serial  out  functions. 


CONTROL 
SIGNALS  ' 


LEFT  SERIAL  DATA 
IN  AND  OUT  (LIRO) 


PARALLEL  DATA  IN 
D0  D,  D2  D3  D4  D6  D6  D7 

IIIUUI 


CLOCK 


SHIFT  REGISTER 


TTTTTTTT 

Q0  Q,  Q2  Q3  O.4  Qs  Q6  Cv 
PARALLEL  DATA  OUT 


RIGHT  SERIAL  DATA 
'  OUT  AND  IN  (RILO) 


Figure  41.  A  Shift  Register  Block  Diagram 
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Q7 

Q6 

Q5 

Q4 

Q3 

Q2 

Q1 

QO 

11 

10 

Q7 

Q6 

Q5 

Q4 

Q3 

Q2 

Q1 

QO 

0 

0 

;Retain  Data 

RILO 

Q7 

Q6 

Q5 

Q4 

Q3 

Q2 

Q1 

0 

1 

;Shift  Right 

Q6 

Q5 

Q4 

Q3 

Q2 

Q1 

QO 

LIRO 

1 

0 

;Shift  Left 

D7 

D6 

D5 

D4 

D3 

D2 

D1 

DO 

1 

1 

;Load  Data 

Figure  42.  The  Truth  Table  lor  a  Universal  Shift  Register 

TITLE  UNIVERSAL_SHIFT_REGI STER 

PATTERN  SHIFT.PDS 

REVISION  01 

AUTHOR  JOE  ENGINEER 

COMPANY  MONOLITHIC  MEMORIES,  INC. 

DATE  09/12/87 

CHIP  UNIV  SHI  PAL20X8 


CLK  10  DO  Dl  D2  D3  D4  D5  D6  D7  II  GND 
/OC  RILO  Q7  06  Q5  Q4  Q3  Q2  Ql  QO  LIRO  VCC 

DESCRIPTION 

THIS  PAL  DEVICE  IS  AN  8-BIT  SHIFT  REGISTER  WITH  PARALLEL  LOAD  AND 
HOLD  CAPABILITY.      TWO  FUNCTION  SELECT  INPUTS    (10,11)    PROVIDE  ONE  OF 
FOUR  OPERATIONS  WHICH  OCCUR  SYNCHRONOUSLY  ON  THE  RISING  EDGE  OF  THE 
CLOCK    (CLK).      THESE  OPERATIONS  ARE: 


/OC 

CLK 

11 

10 

D7-D0 

Q7-Q0 

OPERATION 

H 

X 

X 

X 

X 

Z 

HI-Z 

L 

C 

L 

L 

X 

L 

HOLD 

L 

C 

L 

H 

X 

SR(Q) 

SHIFT  RIGHT 

L 

c 

H 

L 

X 

SL(Q) 

SHIFT  LEFT 

L 

c 

H 

H 

D 

D 

LOAD 

TWO  OR  MORE  OCTAL  SHIFT  REGISTERS  MAY  BE  CASCADED  TO  PROVIDE  LARGER 
SHIFT  REGISTERS.      RILO  AND  LIRO  ARE  LOCATED  ON  PINS  14  AND  23 
RESPECTIVELY,   WHICH  PROVIDES  FOR  CONVENIENT  INTERCONNECTIONS  WHEN 
TWO  OR  MORE  OCTAL  SHIFT  REGISTERS  ARE  CASCADED  TO  IMPLEMENT  LARGER 
SHIFT  REGISTERS. 


EQUATIONS 


/Q0 

/I1*/I0*/Q0 

; HOLD  Q0 

+ 

/II*  I0VQ1 

;SHIFT  RIGHT 

:  +  : 

Il*/I0*/LIRO 

; SHIFT  LEFT 

+ 

11*  IOVDO 

; LOAD  DO 

/Ql 

/I1*/I0*/Q1 

; HOLD  Ql 

+ 

/II*  I0*/Q2 

; SHIFT  RIGHT 

:  + : 

I1*/I0*/Q0 

; SHIFT  LEFT 

+ 

11*  I0*/D1 

; LOAD  Dl 

/Q2 

/Il*/I0*/Q2 

; HOLD  Q2 

+ 

/ll*  I0*/Q3 

; SHIFT  RIGHT 

:  + : 

I1*/I0*/Q1 

,- SHIFT  LEFT 

+ 

11*  I0*/D2 

; LOAD  D2 

/Q3 

/Il*/I0*/Q3 

; HOLD  Q3 

+ 

/II*  I0*/Q4 

; SHIFT  RIGHT 

:  + : 

Il*/I0*/Q2 

; SHI  FT  LEFT 

+ 

11*  I0*/D3 

; LOAD  D3 

/Q4 

/Il*/I0*/Q4 

; HOLD  Q4 

+ 

/II*  I0*/Q5 

; SHIFT  RIGHT 

:  *  : 

Il*/I0*/Q3 

; SHIFT  LEFT 

+ 

11*  I0*/D4 

; LOAD  D4 

/os 

/Il*/I0*/Q5 

;HOLD  Q5 

+ 

/II*  I0*/Q6 

; SHIFT  RIGHT 

:  + : 

Il*/IO*/Q4 

; SHIFT  LEFT 

+ 

13*  I0*/D5 

; LOAD  D5 

Figure  43.  Octal  Shift  Register  Design  File 
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/Q6    :=  /11*/I0*/Q6 

+  /II*  I0*/Q7 

:+:  Il*/I0*/Q5 

+  II*  I0*/D6 

/Q7    :=  /Il*/I0*/Q7 

+     /II*  I0*/RILO 
:+:  Il*/I0*/Q6 
+        II*  I0*/D7 

/LIRO  =  /Q0 
L1RO.TRST  =  /I1*I0 

/RILO  =  /Q7 
RILO.TRST  =  I1«/I0 


HOLD  06 
SHIFT  RIGHT 
SHIFT  LEFT 
LOAD  D6 

HOLD  Q7 
SHIFT  RIGHT 
SHIFT  LEFT 
LOAD  D7 

; LEFT  IN  RIGHT  OUT 


; RIGHT   IN  LEFT  OUT 


SIMULATION  NOT  SHOWN  HERE 

Figure  43.  Octal  Shift  Register  Design  File  (Cont'd.) 


Barrel  Shifters 

In  most  data  processing  systems,  some  form  of  data  shifting  or 
rotation  is  necessary.  In  typical  computer  systems,  the  shifter  is 
located  at  the  output  of  the  ALU,  and  usually  requires  a  single- 
cycle  shift  and  add  function  (Figure  44).  For  such  applications  as 
floating-point  arithmetic  or  string  manipulation,  ordinary  shift 
registers  are  inefficient,  since  they  require  n  clock  cycles  for  an  n- 
bit  shift. 


INPUT  DATA  BUS 


>  REGISTER 

OE- 


OUTPUT  DATA  BUS 

Figure  44.  Typical  ALU  Architecture 


Aspecialized  shift  register,  called  a  "barrel  shifter,"  is  used  to  shift 
(or  rotate)  data  by  any  number  of  bits  in  a  single  clock  cycle.  The 
name  "barrel  shifter"  is  used  because  of  the  circular  nature  of  the 
shift  operation.  The  storage  registers  on  the  output  of  the  shifter 
are  used  in  this  architecture  to  pipeline  the  data  operation, 
increasing  throughput.  The  three-state  buffer  on  the  output 
registers  is  also  useful  for  providing  an  interface  to  the  data  bus. 

The  design  of  a  barrel  shifter  proceeds  in  the  same  manner  as  a 
regular  shift  register.  The  truth  table  is  drawn,  and  the  Boolean 
equations  are  then  written  based  upon  the  truth  tables.  An  eight- 
bit  barrel  shifter  requires  at  least  eight  data  inputs,  eight  regis- 
tered data  outputs,  three  control  lines  to  specify  the  shift  distance, 
a  clock  input  and  an  output  enable  that  controls  the  three-state 
buffer  on  the  register  output. 

Figure  45  shows  the  block  diagram  for  an  eight-bit  registered 
barrel  shifter,  while  Figure  46  shows  the  truth  table.  The  regis- 
tered barrel  shifter  requires  a  total  of  1 4  inputs  and  8outputs.  This 
function  can  be  easily  implemented  in  an  AmPAL22V10.  Figure 
47  shows  the  PLPL  design  file  for  this  eight-bit  barrel  shifter 
function. 


D7  D6  D5  D4  D3  D2  D,  D0 

UlltlU 


So  ■ 

s,  ■ 

S2  ■ 
CLK  ■ 


RST 
OE 


TTTTTTTT 

Q7  Q6  Q5  Q4  Q3  Q2  Q,  Q0 

Figure  45.  Block  Diagram  of  an  Eight-bit  Barrel  Shifter 
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S2 

S1 

so 

Q7 

Q6 

Q5 

Q4 

Q3 

Q2 

Q1 

QO 

0 

0 

0 

D7 

D6 

D5 

D4 

D3 

D2 

D1 

DO 

o 

o 

■1 

D6 

D5 

D4 

D3 

D2 

D1 

DO 

D7 

0 

1 

0 

D5 

D4 

D3 

D2 

D1 

DO 

D7 

D6 

0 

1 

1 

D4 

D3 

D2 

D1 

DO 

D7 

D6 

D5 

1 

0 

0 

D3 

D2 

D1 

DO 

D7 

D6 

D5 

D4 

1 

0 

1 

D2 

D1 

DO 

D7 

D6 

D5 

D4 

D3 

1 

1 

0 

D1 

DO 

D7 

D6 

D5 

D4 

D3 

D2 

1 

1 

1 

DO 

D7 

D6 

D5 

D4 

D3 

D2 

D1 

Figure  46.  Truth  Table  for  an  Eight-bit  Barrel  Shifter 

DEVICE       E IGHT_BI T_BARREL_SH I FTER  (PAL22V10) 

PIN  CLK  =  1 

D  [7:0]   =  2:9     /Q[7:0]   =  23:16 

SEL2         =  13         SEL1       =   11  SELO  «  10 

OE  =  14         RSY         =  15; 

BEGIN 

IF   (RST)   THEN  ARESET ( ) ;  "SHIFTER  INITIALIZATION" 

IF   (OE)      THEN  ENABLE (Q[ 7 : 0 )) ;    "ENABLE  OUTPUTS" 

IF   ( /SEL2   *   /SEL1   *  /SELO)   THEN  Q[7:0]    :=  D[7:0]; 

IF   (/SEL2   *   /SEL1   *     SELO)   THEN  Q[7:0]    :=  D[6:0],  D[7); 

IF   (/SEL2   *     SEL1   *  /SELO)   THEN  Q[7:0]    :=  D[5:0],  D(7:6] 

IF   (/SEL2   *     SEL1   *     SELO)   THEN  Q[7:0)    :=  D[4:0J,  D[7:5] 

IF   (   SEL2   *  /SEL1   *  /SELO)   THEN  Q[7:0]    :=  D(3:0],  D[7:4] 

IF   (    SEL2   *   /SEL1   *     SELO)   THEN  Q[7:0]    :=  D[2:0],  D[7:3) 

IF   (   SEL2   *     SEL1   *  /SELO)   THEN  Q[7:0]    :=  D[1:0],  D[7:2] 

IF   (   SEL2   *     SEL1   *     SELO)   THEN  Q[7:0]    :=  D[0],  D[7:l); 

END. 

"FUNCTION  TABLE  SPECIFICATION" 
TEST_VECTORS 

IN  CLK  RST  D[7:0]   SEL2  SEL1  SELO; 
I_0  OE; 
OUT  Q [ 7 : 0 ] ; 
BEGIN 

"CLK  RST  D(7:0]  SEL2     SEL1  SELO 


c 

1 

00000000 

1 

0 

1 

0 

ZZZZZZZZ ; 

c 

0 

11111111 

0 

1 

0 

1 

HHHHHHHH ; 

c 

1 

xxxxxxxx 

X 

X 

X 

1 

LLLLLLLL ; 

c 

0 

xxxxxxxx 

X 

X 

X 

0 

ZZZZZZZZ; 

c 

0 

00000000 

0 

0 

0 

1 

LLLLLLLL; 

c 

0 

11111111 

0 

0 

0 

1 

HHHHHHHH; 

c 

0 

01111111 

1 

0 

0 

1 

HHHHLHHH ; 

c 

0 

01111111 

0 

1 

0 

1 

HHHHHHLH ; 

c 

0 

01111111 

1 

1 

0 

1 

HHLHHHHH; 

END. 

Figure  47.  Design  File  for  an  Eight-bit  Barrel  Shifter 


Gray-code,  Johnson  Counter  and  Shift  Register  Device  Since  Gray-code  counters  are  often  used  as  system  clocks,  very 

Selection  Considerations  high  speed  PAL  devices  provide  the  highest  resolution  clocks. 


Gray-code  counters,  Johnson  counters  and  shift  registers  are  not        Barrel  shifters  are  very  logic-intensive  and  require  many  product 


very  logi 
minimal, 
mentatio 
most  PAl 

intensive;  the  number  of  product  terms  required  is  terms,  since  data  from  all  the  inputs  needs  to  be  accessible  at  any 
The  D-type  flip-flops  provide  the  most  efficient  imple-  output.  Registered  PLDs  with  a  large  number  of  product  terms 
is,  allowing  these  designs  to  be  easily  implemented  in  are  ideal  for  barrel  shifters.  Large  barrel  shifters  can  also  be 
.  devices.                                                            partitioned  into  a  number  of  PLDs. 
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OE  Q[7:0] 


Until  now  we  have  discussed  strictly  synchronous  registered 
designs,  where  a  common  system  clock  is  used.  In  asynchro- 
nous registered  designs,  a  common  clock  is  not  used.  The 
register  clock  may  be  generated  by  the  output  of  another  register, 
or  by  a  logical  combination  of  various  other  signals.  Such  designs 
are  usually  slow  for  such  applications  as  timing  generation, 
because  when  the  output  of  one  register  is  used  to  clock  another, 
multiple  delays  are  encountered  before  all  the  register  outputs 
stabilize.  On  the  other  hand,  designs  can  be  very  fast  for 
asynchronous  applications  such  as  bus  arbitration  and  control, 
where  a  fast  response  to  a  bus  signal  can  be  provided  without 
waiting  for  a  common  system  clock. 

Although  asynchronous  designs  are  easier  to  visualize,  they 
present  larger  problems  in  implementation.  Combinatorial  haz- 


worry  about  race  conditions  when  clocking  a  number  of  registers 
simultaneously.  Careful  design  analysis  is  strongly  recom- 
mended before  implementing  any  asynchronous  design. 

Ripple  counters  are  probably  the  easiest  examples  of  such 
asynchronous  designs.  Figure  48  shows  the  logic  diagram  of  a 
five-bit  binary  ripple  counter.  These  counters  clearly  have  the 
advantage  of  design  simplicity.  The  output  from  one  stage  is  fed 
as  the  clock  to  the  next  stage.  However,  this  results  in  a  slower 
counting  rate,  since  the  clock  signals  need  to  propagate  through 
all  five  registers  before  the  next  count  is  reached. 

The  ripple  counter  design  illustrated  below  is  implemented  in  a 
PAL20RA1 0.  Figure  49  shows  the  design  file  for  this  counter. 


Figure  48.  A  Five-bit  Ripple  Counter 


TITLE        Dual  5-Bit  Counter 

PATTERN     Upcount .  pds 

REVISION  01 

AUTHOR      Joe  Engineer 

COMPANY    Monolithic  Memories,  Inc. 

DATE  11/12/87 

CHIP  Upcount  PAL20RA10 


PL  CKA  CKB  ENA  ENB  NC  NC  NC  NC  NC  INIT  GND 
OE  QB4  QB3  QB2  QB1  QBO  QA4  QA3  QA2  QA1  QAO  VCC 


EQUATIONS 

;  The  First  Counter 


/QA4 

:=  QA4  * 

/ENA 

;   Toggle  if  lower  MSB 

is  zero 

+  /QA4  * 

ENA 

;  Hold  count 

QA4.CLKF 

=  /QA3 

;  Toggle 

QA4.RSTF 

=  INIT 

;  Initialize 

QA4.TRST 

=  VCC 

/QA3 

QA3  * 

/ENA 

;  Toggle  when  QA2  zero 

+  /QA3  * 

ENA 

;  Hold  count 

QA3.CLKF 

-  /QA2 

;  Toggle 

QA3.RSTF 

-  INIT 

;  Initialize 

QA3 .  TRST 

-  VCC 

Figure  49.  Design  File  for  a  Five-bit  Ripple  Counter 
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/QA2 

QA2.CLKF 
QA2.RSTF 
QA2.TRST 


:-  QA2 
+  /QA2 

-  /QA1 

-  INIT 

-  VCC 


/ENA  ;  Toggle  when  QA1  zero 

ENA     ;  Hold  count 

;  Toggle 

;  Initialize 


/QA1 

QA1.CLKF 
QA1.RSTF 
QA1.TRST 


QA1 
+  /QA1 

-  /QAO 

-  INIT 

-  VCC 


/ENA  ;  Toggle  when  QAO  zero 

ENA     ;  Hold  count 

;  Toggle 

;  Initialize 


/QAO 

QAO.CLKF 
QAO.RSTF 
QAO.TRST 


QAO 
+  /QAO 

-  CKA 

-  INIT 

-  VCC 


*  /ENA 

*  ENA 


Toggle  LSB 
Hold  count 
Toggle 
Initialize 


;  The  Second  Counter 


/QB4 

QB4 

* 

/ENB 

Toggle  if  lower  M 

SB  is  z 

+  /QB4 

ENB 

Hold  count 

QB4.CLKF 

-  /QB3 

Toggle 

Qd'J  .Kblr 

=  INIT 

Initialize 

QB4.TRST 

-  VCC 

/QB3 

:=  QB3 

* 

/ENB 

Toggle  when  QB2 

zero 

+  /QB3 

ENB 

Hold  count 

QB3.CLKF 

-  /QB2 

Toggle 

QB3.RSTF 

-  INIT 

Initialize 

QB3 . TRST 

-  VCC 

/QB2 

:-  QB2 

* 

/ENB 

Toggle  when  QB1 

zero 

+  /QB2 

* 

ENB 

Hold  count 

QB2.CLKF 

=  /QB1 

Toggle 

QB2.RSTF 

«  INIT 

t 

Initialize 

QB2.TRST 

=  VCC 

/QB1 

QB1 

* 

/ENB 

Toggle  when  QBO 

zero 

+  /QB1 

* 

ENB 

Hold  count 

QB1.CLKF 

-  /QBO 

Toggle 

QB1 .RSTF 

-  INIT 

Initialize 

QB1 . TRST 

-  VCC 

/QBO 

QBO 

* 

/ENB 

Toggle  LSB 

+  /QBO 

* 

ENB 

Hold  count 

QBO.CLKF 

-  CKB 

Toggle 

QBO. RSTF 

-  INIT 

Initialize 

QBO . TRST 

-  VCC 

SIMULATION 

TRACE_ON  /PL  QA4  QA3  QA2  QA1  QAO  QB4  QB3  QB2  QB1  QBO 

SETF  PL  INIT  /CKA  /CKB  /OE 

SETF  INIT  CKA  CKB 

SETF  /INIT  /CKA  /CKB 

CHECK  /QA4  /QA3  /QA2  /QA1  /QAO 

CHECK  /QB4  /QB3  /QB2  /QB1  /QBO 


Figure  49.  Design  File  for  a  Five-bit  Ripple  Counter  (Cont'd.) 
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SETF  CKA  CKB 

CHECK  /QA4  /QA3  /QA2  /QA1  QAO 
CHECK  /QB4  /QB3  /QB2  /QB1  QBO 
SETF  /CKA  /CKB 

SETF  CKA  CKB 

CHECK  /QA4  /QA3  /QA2  QA1  /QAO 
CHECK  /QB4  /QB3  /QB2  QB1  /QBO 
SETF  /CKA  /CKB 

SETF  CKA  CKB 

CHECK  /QA4  /QA3  /QA2  QA1  QAO 
CHECK  /QB4  /QB3  /QB2  QB1  QBO 
SETF  /CKA  /CKB 

SETF  CKA  CKB 

CHECK  /QA4  /QA3  QA2  /QA1  /QAO 
CHECK  /QB4  /QB3  QB2  /QB1  /QBO 
SETF  /CKA  /CKB 

SETF  CKA  CKB 

CHECK  /QA4  /QA3  QA2  /QA1  QAO 
CHECK  /QB4  /QB3  QB2  /QB1  QBO 
SETF  /CKA  /CKB 

TRACE_OFF 

Figure  49.  Design  File  for  a  Five-bit  Ripple  Counter  (Cont'd.) 


There  are  additional  independent  SET  and  RESET  controls  in  the 
PAL20RA1 0  which  can  be  used  to  restart  the  count.  If  a  modulo 
counter  is  desired,  these  SET  and  RESET  signals  can  be  used  as 
described  above.  Figure  48  shows  the  implementation  of  a 
modulo-20  counter  which  is  RESET  when  output  bits  Q4  and  Q2 
are  both  HIGH.  Since  the  RESET  is  implemented  with  a  product 
term,  the  extra  AND  gate  shown  can  be  implemented  directly 
within  the  PAL  device. 


Asynchronous  Designs  Device  Selection  Considerations 

The  device  selection  for  asynchronous  designs  is  easy.  As  the 
clock  signals  require  logic,  only  PLDs  which  allow  implementation 
of  Boolean  logic  on  the  clock  signals  are  useful.  Figure  50  shows 
a  list  of  devices  which  provide  programmable  clock  signals.  They 
also  provide  programmable  SET  and  RESET  product  terms,  and 
the  capability  of  bypassing  the  register. 


DEVICE 

TOTAL 

DEDICATED 

DEDICATED 

I/O 

LOGICAL 

SPEED  GRADES 

NUMBER 

INPUT 

REGISTERED 

PINS 

PRODUCT 

FREQUENCY  IN 

OF  PINS 

PINS 

OUTPUT 

TERMS/ 

MHz 

PINS 

OUTPUT 

^MAX 

TTL 

16RA8 

20 

8 

0-8 

8-0 

4 

20 

20RA10 

24 

10 

0-10 

10-0 

4 

33,  20 

CMOS 

C29MA16 

24 

5 

16 

4-12 

20,  15 

Figure  50.  Programmable  Logic  Devices  for  Asynchronous  Designs 
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Other  Applications  of 
Registered  PLDs 

Registered  PLDs  are  used  for  a  number  of  miscellaneous  appli- 
cations which  are  not  covered  by  the  synchronous  and  asynchro- 
nous design  applications  discussed  up  to  now.  In  the  subsequent 
discussion  we  will  cover  a  few  of  these  applications: 

•  Frequency  dividers 

•  Addressable  registers 

Frequency  Dividers 

Standard  synchronous  counters  provide  the  basic  capability  of 
dividing  an  input  frequency.  A  single  register  of  a  PAL  device  will 
let  us  divide  by  two.  If  we  stack  these  registers,  a  binary  counter 
provides  symmetrical  division  by  2,4,8,16,  etc.  This  divider  has 
been  a  standard  for  years,  and  the  PAL  device  has  always  been 
an  excellent  choice  for  such  applications. 

One  unique  application  of  PAL  devices  is  for  dividing  input 
frequencies  by  odd  numbers.  This  has  been  done  historically  by 
designing  a  counter  which  cycles  modulo  an  odd  number,  and 


decoding 
this  appr 
cycle  is  r 

Let's  exa 
be  imple 
at  four,  r 
shows  th 

the  specific  states  of  the  counter.  The  disadvantage  of 
oach  is  that  the  output  is  not  symmetrical  and  the  duty 
ot  50%. 

mine  a  simple  divide-by-five  counter.  This  counter  can 
rented  using  three  flip-flops  that  start  at  zero  and  reset 
^suiting  in  a  five-state  counter.  The  table  in  Figure  51 
e  outputs  of  the  three  individual  flip-flops. 

Prese 

nt  State 

Next  State 

Q2 

Q1  QO 

Q2    Q1  QO 

o  o  o  o  V 

0  0 

0  1 

1  0 

1  1 

0  0 

0      0      1      State  zero  to  one. 
0      1      0     State  one  to  two. 
0      1      1      State  two  to  three. 
10      0     State  three  to  four. 
0      0      0     State  four  to  zero. 

F 

The  Boo 

:igure  51.  Truth  Table  for  a  Five-bit  Counter 

ean  equations  are: 

Q2   :=  /Q2  *     Ql  *  Q0      ;MSB  bit 
Ql   :=  /Ql  *    QO  +  Ql  *  /Q0 
QO   :=  /Q2  *  /Q0  ;LSB  bit 


The  waveforms  for  this  divider  are  shown  in  Figure  52.  Notice  that 
the  Q2  output  goes  HIGH  for  one  state  and  that  this  output  is  one- 
fifth  of  the  input  frequency,  but  it  is  a  20%  duty  cycle.  Q1  is  active 
for  two  states;  it  provides  the  same  frequency,  but  with  a  40%  duty 
cycle.  If  we  want  a  50%  duty  cycle  we  are  going  to  have  to  divide 
a  state  in  half. 

To  provide  the  50%  duty  cycle,  the  two  edges  should  be  evenly 
spaced  in  the  count  sequence,  one  edge  in  the  middle  of  state  two 
and  one  at  the  beginning  of  state  zero.  The  first  edge  can  be 
formed  by  logically  "ANDing"  state_2  with  the  falling  edge  of  the 
clock.  The  second  edge  can  be  formed  by  decoding  state  zero. 

edge_l  =  /clock  *  /Q2  *  Ql  *  /Q0     ;edge  between 

; states  two  and 
; three 

edge_2  -  /Q2  *  /Ql  *  /Q0  ;edge  at  state 

;  zero 

The  logical  "OR"  of  these  two  equations  will  provide  the  needed 
rising  edges.  To  provide  a  clean  output,  this  signal  should  clock 
another  output  register. 

The  next  step  in  the  design  is  to  pick  the  appropriate  PAL  device 
to  fit  this  design.  Our  biggest  concern  is  that  we  need  the 
capability  of  clocking  the  counter  at  one  speed  and  the  output  flip- 
flop  at  another.  To  do  this,  we  cannot  use  a  PAL  device  that  has 
a  dedicated  clock  pin;  we  need  an  architecture  that  allows  pro- 
grammable clocks.  The  PAL20RA10,  PAL16RA8  and 
AmPALC29MA16  have  this  feature.  This  design  uses  the 
PAL20RA1 0  device. 

Since  the  clock  signal  requires  two  product  terms  (one  for  each 
edge),  and  the  PAL20RA10  clock  signal  has  only  one  product 
term,  this  implementation  is  not  possible.  Another  technique  is  to 
use  the  independent  asynchronous  SET  and  asynchronous 
RESET  product  terms  of  the  PAL20RA1 0  output  register.  A  HIGH 
on  the  SET  product  term  asserts  the  register  output,  and  a  HIGH 
on  the  RESET  product  term  unassertsthe  register  output.  Due  to 
the  asynchronous  nature  of  the  product  terms  some  adjustment 
in  timing  is  required.  The  SET  product  term  is  asserted  when  in 
state  0  (Q2=0,  Q1  =0  and  Q0=0),  and  the  RESET  product  term  is 
asserted  when  between  states  two  and  three. 

OUTPUT.  SET  -  /clock  *  /Q2  *  Ql  *  /Q0   ;  set  between 

; states  2  S  3 

OUTPUT. RESET  -  /Q2  *  /Ql  *  /Q0  ; reset  at 

; state  zero 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


DIVIDE  BY  FIVE  EXAMPLE 

DIVBY5.PDS 

A 

BARRY  SEIDNER 

MONOLITHIC  MEMORIES  INC.,   SANTA  CLARA,  CA 

7/25/1987 


CHIP  DIVIDER  PAL20RA10 

PL  CLK  13  14  15  16  17  18  19  110  111  GND 
OE  Q14  Q15  Q16  Q17  Q18  Q19  Q20     /Q21  /Q22  /Q23  VCC 


EQUATIONS 

;The  following  equations  delineate  the  three  bit  counter  of  our 
;state  machine  driver.  It  has  a  total  count  of  five  starting 
;at   zero  and  counting  thru  four,   then  resetting  to  zero. 


Q23:=  Q21   *  Q22 

Q23.CLKF=CLK 

Q23.TRST=VCC 

Q22:=/Q22   *  Q21 

Q22.CLKF=CLK 

Q22.TRST=VCC 


/Q23 


+  022   *  /Q21 


;MSB  OF  STATE  MACHINE 

; CLOCK  ON  RISING  EDGE  OF  CLK  PIN 

; OUTPUT  ALWAYS  ON 

;BIT  1  OF  STATE  MACHINE 

[CLOCK  ON  RISING  EDGE  OF  CLK  PIN 

; OUTPUT  ALWAYS  ON 


Q21:=/Q21   *  /Q23 
Q21 . CLKF=CLK 
Q21 .TRST=VCC 


LSB  OF  STATE  MACHINE 

CLOCK  ON  RISING  EDGE  OF  CLK 

OUTPUT  ALWAYS  ON 


;The  last  cell  of  our  divide  by  five  counter  is  the  actual 
;output  of  divider.   It  divides  by  setting  and  resetting  at  the 
jappropriate  time  in  the  count  sequence.  The  output  will  stay  high 
;for  count  zero,  one,  and  the  first  half  of  two-then  go  low  for 
; the  second  half  of  two,  thru  three  and  four. 

Q20.SETF  =  /Q23   *     Q22   *  /Q21   *  /CLK  ;SET  ON  COUT  THREE  AND  1/2 

Q20.RSTF  =  /Q23   *   /Q22   *   /Q21  ; RESET  ON  COUNT  ZERO 

Q20.CLKF  =  GND  ;CLOCK   INPUT  NOT  USED 

Q20  :=  GND  ; DATA  INPUT  NOT  USED 

Q20.TRST  =  VCC  ; OUTPUT  ALWAYS  ON 


SIMULATION 

;This  simulation  will  run  the  divider  through 
;several  passes  of  the  count  to  verify  operation. 

TRACEON         OE  PL  CLK  Q23  Q22  Q21  Q20 

; Initialization  of  all  inputs.  Before  the  simulator  can  be 
;executed,  all  of  the  inputs  to  the  PAL  device  should  be 
;set  to  a  specific  value.  This  section  of  simulator 
;declaration  sets  three  input  variables  inactive. 

SETF         PL  /OE     /CLK  ; SET  PRELOAD  FALSE 

;SET  OUTPUT  ENABLE 
;SET  CLOCK  LOW 

;The  PRLDF  statement  will  set  the  registers  to  a  known 
,-state  at  the  beginning  of  our  simulation.  This  line  sets 
;us  to  state  zero. 

PRLDF       /Q23  /Q22  /Q21  Q20 

;The  FOR  loop  statement  exemplifies  an  easy  way  to  perform 
[repetitive  functions.  In  this  example,   the  "RA"  cell  does 
;not  have  a  dedicated  clock  input  so  this  routine  will 
;generate  30  repetitive  clocks  to  drive  our  design. 

FOR   I:=l  TO  30  DO 
BEGIN 

SETF     CLK  ;SET  CLOCK  HIGH 

SETF  /CLK  ;SET  CLOCK  LOW 

END 

;End  of  simulation 
TRACE_OFF 

Figure  53.  Design  File  for  Frequency  Divider 
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Addressable  registers  are  commonly-used  MSI  functions,  often 
implemented  in  PAL  devices.  Addressable  registers  are  used  as 
building  blocks  for  digital  computers.  Depending  upon  the  ad- 


previous  vaiDes.  An  example  ot  a  to-dii  aaaressaoie  register  i 
shown  in  Figure  54. 


:  REGISTER 


TITLE  16-BIT  ADDRE: 

PATTERN  ADREG16.PDS 

REVISION  A 

AUTHOR  John  Birlcnar 

COMPANY  Monolithic  Haworias  Inc.   Santa  Clara,  CA 

DATE  2/11/B5 


.star  loads  one  of  16  registers 
I  data  input,  DATA. 


AO  *Q14 
/Al  *Q1* 
/A2  -Q14 
/A3*Q14 
/AO*  Al-  A2*  A3 *DATA 

■  /AO  *Q15 
/Al  *Q15 
/A2  *Q15 
/A3*Q1S 
AO*  Al*  A2*  A3 *DATA 


.•hold 
jhold 
;hold 


;hold 
.-hold 
jhold 
;hold 
I  load 


CHIP  ADREG16  PAL32R16 

QO  Ql  Q2  Q3  /El  NC  NC  AO  Al  VCC  A2  A3  DATA  NC  /PRLD2  CLK2 

Q4  Q5  Q6  Q7  QB  Q9  Q10  Qll  /E2  NC  NC  NC  NC  GND  NC  NC  NC  NC  /PRLD1  CLK1 

Q12  Q13  Q14  Q15 


EQUATIONS 
QO  : -  AO 


*Q0 
*Q0 


Al 

A2 

/A0*/A1*/A2*, 

i  /AO  *Q1 
Al  *Q1 
A2  *Q1 
A3*Q1 
A0*/A1*/A2*/A3*DATA 


r     /AO*  Al*/A2*/A3*l 


/AO 


*Q3 
*C3 


*  /A2  *Q4 

A3-Q4 

i-     /AO*/Al*  A 2 */A3 *DATA 


/A2  *Q5 
A3*Q5 
AO*/Al*  A2*/A3*DATA 


+  /AO* 
/AO 


/A2  *Q6 
A3*Q6 
.*  A2*/A3*DATA 


/A2  »Q7 
A3*Q7 
*  A2*/A3*DATA 


+  /A3*Q9 

+  A0*/A1*/A2*  A3  *  DATA 

:-  AO  *Q10 

+  /Al  *Q10 

+  A2  *Q10 

+  /A3*Q10 

+  /AO*  Al*/A2*  A3 "DATA 


/AO 


/Al 


•Qll 
•Qll 
•Qll 
/A3*Q11 
AO*  Al"/A2*  A3  *  DATA 

AO  *Q12 
Al  *Q12 
/A2  -Q12 
/A3*Q12 
/A0*/A1*  A2*  A3* DATA 


■  /AO 


*Q13 
•Q13 
/A2  *Q13 
/A3*Q13 
A0*/A1*  A2*  A3 *DATA 


;hold 
;hold 
jhold 
ihold 
;load 

;hold 


;hold 
jhold 
;load 

;hold 
ihold 
;hold 
.•hold 
;load 

;ho!d 
;hold 
;hold 
;hold 
;load 

;hold 
;hold 
ihold 
ihold 
;load 

;hold 
;hold 
;hold 
rhold 
;  load 

ihold 
;hold 
jhold 
ihold 
;  load 

;hold 
;hold 
;hold 
fhold 
;load 

;hold 
;hold 
ihold 
;hold 
I  load 

,-hold 
;hold 
;hold 
;hold 
;load 

jhold 
;hold 
jhold 
jhold 
;load 

jhold 
jhold 
,-hold 
jhold 
I  load 

;hold 
jhold 
;hold 
jhold 
;  load 

ihold 
jhold 
jhold 
jhold 
j  load 


SIMULATION 

TRACE  ON  QO  Ql  Q2  Q3  Q4  Q3  Q6  Q7  Q8  Q9  Q10  Qll  Q12  Q13  Q14  Q15 
AO  Al  A2  A3  DATA 

SETF  El  E2  /DATA  /PR1D1  /PRLD2 


SETF  AO  /Al  /A2  /A3 
CLOCKF  CLK1  CLK2 


SETF  AO  Al  A2  /A3 
CLOCKF  CLX1  CLX2 


SETF  /AO  /Al  /A2 
CLOCKF  CLK1  CLK2 


SETF  AO  /Al  /A2 
CLOCKF  CLK1  CLK2 


SETF  /AO  Al  A2 
CLOCKF  CLK1  CLK2 


Q7 
Q8 


Q12 
Q13 
Q14 

Q15 


g  g  cgcgcg 
XXXXLLLLLL 
XXXXXXLLLL 
XXXXXXXXLL 

XXXXXXXXXX 

xxxxxxxxxx 

XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXLLLHHLLH 
XXLLLLLHHH 


cgcgcgcgcg 
LLLLLLLLLL 
LLLLLLLLLL 


cgcgcgcgcg 
LLLLLLLLLL 
LLLLLLLLLL 


XXXXXXLLLL 
XXXXXXXXLL 
XXXXXXXXXX 
XXXXXXXXXX 

XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
HLLHHLLHHL 
HLLLLHHHHL 


XXTJiTJiTiI  TT-  n'  t.t.t.t.t.t.h 
.  LLLLIJ.7J.T.T.  LLILTJ.IJI.L 


LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
XXLLLLLLLL 
XXXXLLLLLL 
XXXXXXLLLL 
XXXXXXXXLL 
XXXXXXXXXX 
XXXXXXXXXX 
XXXXXXXXXX 
LHHLLKHLLH 
LLLHHHHLLL 
LLLLLLLHHH 


cgcgcgc 
LLLLLLH 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
LLLLLLL 
XXLLLLL 
XXXXLLL 
HLLHHLL 
LHHHHLL 


TJ.TJ.T.T.T.T.T.T.  LLLLLHH 


Figure  54.  A  Slxteen-bit  Addressable  Register  Design  File 
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Introduction 


State  machine  designs  are  widely  used  for  sequential  control 
logic,  which  forms  the  core  of  many  digital  systems.  State  ma- 
chines are  required  in  a  variety  of  applications  covering  a  broad 
range  of  performance  and  complexity;  low-level  control  of  micro- 
processor-to-VLSI-peripheral interfaces,  bus  arbitration  and 
timing  generation  in  conventional  microprocessors,  custom  bit- 
slice  microprocessors,  data  encryption  and  decryption  and  trans- 
mission protocols  are  but  a  few  examples. 

Typically,  the  details  of  control  logic  are  the  last  to  be  settled  in  the 
design  cycle,  since  they  are  continuously  affected  by  changing 
system  requirements  and  feature  enhancements.  Program- 
mable logic  is  a  forgiving  solution  for  control  logic  design  because 
it  allows  easy  modifications  to  be  made  without  disturbing  PC 
board  layout.  Its  flexibility  provides  an  escape  valve  that  permits 
design  changes  without  impacting  time-to-market. 

A  majority  of  registered  PAL  device  applications  are  sequential 
control  designs  where  state  machine  design  techniques  are 
employed.  As  technology  advances,  new  high-speed  and  high- 
functionality  devices  are  being  introduced  which  simplify  the  task 
of  state  machine  design.  A  broad  spectrum  of  different  function- 
ality-and-performance  solutions  is  available  for  state  machine 
design.  In  this  discussion  we  will  examine  the  functions  per- 
formed by  state  machines,  their  implementation  on  various  de- 
vices, and  their  selection.  Finally,  we  will  implement  a  state 


machine  design  and  go  through  all  of  the  stages  involved  in  a 
design  tutorial. 

What  is  a  State  Machine? 

A  state  machine  is  a  digital  device  which  traverses  through  a 
predetermined  sequence  of  states  in  an  orderly  fashion.  A  state 
is  a  set  of  values  measured  at  different  parts  of  the  circuit.  A 
simple  state  machine  can  consist  of  PAL-device-based  combina- 
torial logic,  output  registers,  and  buried  (state)  registers.  The 
state  in  such  a  sequencer  is  determined  by  the  values  stored  in 
the  buried  and/or  output  registers. 

A  general  form  of  a  state  machine  can  be  depicted  as  a  device 
shown  in  Figure  1 .  In  addition  to  the  device  inputs  and  outputs, 
a  state  machine  consists  of  two  essential  elements:  combinatorial 
logic  and  memory  (registers).  This  is  similar  to  the  registered 
counter  designs  discussed  on  page  2-66,  which  are  essentially 
simple  state  machines.  The  memory  is  used  to  store  the  state  of 
the  machine.  The  combinatorial  logic  can  be  viewed  as  two 
distinct  functional  blocks:  the  next  state  decoder  and  the  output 
decoder  (Figure  2).  The  next  state  decoder  determines  the  next 
state  of  the  state  machine  while  the  output  decoder  generates  the 
actual  outputs.  Although  they  perform  two  distinct  functions, 
these  are  usually  combined  into  one  combinatorial  logic  array  as 
in  Figure  1. 


COMBINATORIAL  LOGIC 


DEVICE 
INPUTS  ' 


NEXT 
STATE 
DECODE 


OUTPUT 
DECODE 


Figure  1.  Block  Diagram  of  a  Simple  State  Machine 


COMBINATORIAL 
LOGIC 


COMBINATORIAL 
LOGIC 


NEXT 
STATE 
DECODER 
(TRANSITION 
FUNCTION) 


MEMORY 
(REGISTERS) 


OUTPUT 
DECODER 
(OUTPUT 
DECODE 
FUNCTION) 


Figure  2.  State  Machine,  with  Separate  Output  &  Next  State  Decoders 
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The  basic  operation  of  a  state  machine  is  twofold: 

1 .  It  traverses  through  a  sequence  of  states,  where  the  next  state 
is  determined  by  next  state  decoder,  depending  upon  the 
present  state  and  input  conditions. 

2.  It  provides  sequences  of  output  signals  based  upon  state 
transitions.  The  outputs  are  generated  by  the  output  decoder 
based  upon  present  state  and  input  conditions. 

Using  input  signals  for  deciding  the  next  state  is  also  known  as 
branching.  In  addition  to  branching,  complex  sequencers  provide 
the  capability  of  repeating  sequences  (looping)  and  subroutines. 
The  transitions  from  one  state  to  another  are  called  control 
sequencing  and  the  logic  required  for  deciding  the  next  states  is 
called  the  transition  function  (Figure  2). 

The  use  of  input  signals  in  the  decision-making  process  for  output 
generation  determines  the  type  of  a  state  machine.  There  are  two 
widely-known  types  of  state  machines:  Mealy  and  Moore 
(Figure  3).  Moore  state  machine  outputs  are  a  function  of  the 
present  state  only.  In  the  more  general  Mealy-type  state  ma- 
chine, the  outputs  are  functions  of  both  the  state  and  the  input 
signals.  The  logic  required  is  known  as  the  output  function.  For 
either  type,  the  control  sequencing  depends  upon  both  states  and 
input  signals. 

Most  practical  state  machines  are  synchronous  sequential  cir- 
cuits which  rely  on  clock  signals  to  trigger  the  state  transitions.  A 
single  clock  is  connected  to  all  of  the  state  and  output  edge- 
triggered  flip-flops,  which  allows  a  state  change  to  occur  on  the 
rising  edge  of  the  clock.  Asynchronous  state  machines  are  also 
possible,  which  utilize  the  propagation  delay  in  combinatorial 
logic  for  the  memory  function  of  the  state  machine.  Such 
machines  are  highly  susceptible  to  hazards,  hard  to  design  and 
are  seldom  used.  In  our  discussion  we  will  focus  solely  on 
sequential  state  machines. 


State  Machine  Applications 

State  machines  are  used  in  a  number  of  system  control  applica- 
tions. A  sampling  of  a  few  of  the  applications,  and  how  state 
machines  are  applied,  is  described  below. 

As  sequencers  for  digital  signal  processing  (DSP)  applications, 
state  machines  offer  speed  and  sufficient  functionality  without  the 
overkill  of  complex  microprocessors.  For  simple  algorithms,  such 
as  those  involved  in  performing  a  Fast  Fourier  Transform  (FFT), 
a  state  machine  can  control  the  set  of  vectors  that  are  multiplied 
and  added  in  the  process.  For  complex  DSP  operations,  a  pro- 
grammable DSP  may  be  better.  On  the  other  hand,  the  program- 
mable DSP  solution  is  not  likely  to  be  as  fast  as  the  dedicated 
hardware  approach. 

Consider  the  case  of  a  video  controller.  It  generates  addresses 
for  scanning  purposes,  using  counters  with  various  sequences 
and  lengths.  But  instead  of  implementing  these  as  actual  count- 
ers, the  sequences  involved  can  be  "unlocked"  and  implemented, 
instead,  as  state  machine  transitions.  And  there  is  an  advantage 
beyond  mere  economy  of  parts.  A  count  can  be  set  or  initiated, 
then  left  to  take  care  of  itself,  freeing  the  microprocessor  for  other 
operations. 

In  peripheral  control  the  simple  state  machine  approach  can  be 
very  efficient.  Consider  the  case  of  run-length-limited  (RLL)  code. 
Both  encoding  and  decoding  can  be  translated  into  state  ma- 
chines, which  examine  the  serial  data  stream  as  it  is  read,  and 
generate  the  output  data. 

Industrial  control  and  robotics  offer  further  areas  where  simple 
control  functions  are  required.  Such  tasks  as  mechanical  posi- 
tioning of  a  robot  arm,  simple  decision  making,  and  calculation  of 
a  trigonometric  function,  usually  do  not  require  the  high-power 
solution  of  microprocessors  with  stacks  and  pointers.  Rather, 
what  is  required  is  a  device  that  is  capable  of  storing  a  limited 
number  of  states  and  allows  simple  branching  upon  conditions. 
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Figure  3.  The  Two  Standard  State  Machine  Models 
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Data  encryption  and  decryption  present  similar  problems  to  those 
encountered  in  encoding  and  decoding  for  mass  media,  only  here 
it  is  desirable  to  make  the  scheme  not  so  obvious.  A  program- 
mable state  machine  device  with  a  security  fuse  is  Ideal  for  this 
because  memory  is  internally  programmed  and  cannot  be  ac- 
cessed by  someone  tampering  with  the  system. 


Control  sequencing  is  represented  in  the  state  transition  diagram 
as  shown  in  Figure  5.  Direct  control  sequencing  requires  an 
unconditional  transition  from  state  A  to  state  B.  Similarly  condi- 
tional control  sequencing  shows  a  conditional  transition  from 
state  C  to  either  state  D  or  state  E,  depending  upon  input 
signal  11. 


Functions  Performed 

All  of  the  system  designf  unctions  performed  by  controllers  can  be 
categorized  as  one  of  the  following  state  machine  functions: 


Arbitration 
Event  monitoring 
Multiple  condition  testing 
Timing  delays 
Control  signal  generation 


Later  we 
functions 


will  take  a  design  example  and  illustrate  how  these 
be  used  when  designing  a  state  machine. 


State  Machine  Theory 

Let  us  take  a  brief  look  at  the  underlying  theory  for  all  sequential 
logic  systems,  the  finite  state  machine  (FSM),  or  simply  state 
machine. 

Those  parts  of  digital  systems  whose  outputs  depend  on  their 
past  inputs  as  well  as  their  current  ones  can  be  modeled  as  finite 
state  machines.  The  "history"of  the  machine  is  summed  up  in  the 
value  of  its  internal  state.  When  a  new  input  is  presented  to  the 
FSM,  an  output  is  generated  which  depends  on  this  input  and  the 
present  state  of  the  FSM,  and  the  machine  is  caused  to  move  into 
a  new  state,  referred  to  as  the  next  state.  This  new  state  also 
depends  on  both  the  input  and  present  state.  The  structure  of  an 
FSM  is  shown  pictorially  in  Figure  2.  The  internal  state  is  stored 
in  a  block  labelled  "memory".  As  discussed  earlier,  two  combina- 
torial functions  are  required:  the  transition  function,  which  gener- 
ates the  value  of  the  next  state,  and  the  output  function,  which 
generates  the  state  machine  output. 

State  Diagram  Representation 

The  behavior  of  an  FSM  may  be  specified  in  graphical  form  as 
shown  in  Figure  4.  This  is  called  a  state  diagram,  or  state 
transition  diagram.  Each  bubble  represents  a  state,  and  each 
arrow  represents  atransition  between  states.  Inputs  which  cause 
the  transitions  are  shown  next  to  each  transition  arrow. 


OUTPUTS       (  STATE  El 

Figure  4.  State  Machine  Representation 


b.  Conditional  Control 
Sequencing 


Figure  5.  Control  Sequencing 

For  Moore  machines  the  output  generation  is  represented  by 
assigning  outputs  with  states  (bubbles)  as  shown  in  Figure  6. 
Similarly,  for  Mealy  machines  conditional  output  generation  is 
represented  by  assigning  outputs  to  transitions  (arrows),  as  was 
shown  in  Figure  4.  More  detail  on  Mealy  and  Moore  output 
generation  is  given  later. 


a.  Moore  Machine 


b.  Mealy  Machine 


407  07 


Figure  6.  Output  Generation 


For  this  notation,  there  is  a  specification  uncertainty  as  to  which 
signals  are  outputs  or  inputs,  as  they  both  occur  on  the  drawing 
next  to  the  arrow  in  which  they  are  active.  This  is  usually  resolved 
by  separating  the  input  and  output  signals  names  with  a  line 
(Figures  4  &  6).  Sometimes  an  auxiliary  pin  list  detailing  the  logic 
polarity  and  input  or  output  designations  is  also  used. 

State  transition  diagrams  can  be  made  more  compact  by  writing 
on  the  transitions  not  the  input  values  which  cause  the  transition, 
as  in  Figure  4,  but  a  Boolean  expression  defining  the  input 
combination  or  combinations  which  cause  this  transition.  For 
example,  in  Figure  7,  some  transitions  have  been  shown  for  a 
machine  with  inputs  "START",  "X1"  and  "X2".  In  the  transition 
between  states  1  and  2,  the  inputs  X1  and  X2  are  ignored  (that  is, 
they  are  "don't  cares")  and  thus  do  not  appear  on  the  diagram. 
This  saves  space  and  makes  the  function  more  obvious. 


Figure  7.  State  Transition  Diagram  with  Mnemonics 
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three  input  combinations,  {10,  11,  12,  13}  =  {1011},  {1101}  and 
{1111},  which  make  both  (/I07I2  +  13)  and  (10*11  +  I0*I2)  true. 
Since  a  transition  to  two  next  states  is  impossible,  this  is  an  error 
in  the  specification.  It  must  either  be  guaranteed  that  these  input 
combinations  never  occur,  or  the  transition  conditions  must  be 
modified.  In  this  example,  changing  (10*11  +  10*12)  to  (10*11  + 
I0*I2)7I3  would  solve  the  problem. 


ALL  OTHER 
COMBINATIONS 


Figure  8.  State  Diagram  with  Conflicting 
Branch  Conditions 

State  Transition  Table  Representation 

A  second  method  for  state  machine  representation  is  the  tabular 
form  known  as  the  state  transition  table,  which  has  the  format 
shown  in  Figure  9.  Along  the  top  are  listed  all  of  the  possible  input 
bit  combinations  and  internal  states.  Each  row  gives  the  next 
state  and  the  next  output;  the  table  thus  specifies  the  transition 
and  output  functions.  This  type  of  table,  however,  is  not  suitable 
for  specifying  practical  machines  in  which  there  is  a  large  number 
of  inputs,  since  each  input  combination  defines  a  row  of  the  table. 
With  10  inputs  for  example,  there  would  have  to  be  1024  rows! 
A  modified  version  of  this  table  is  often  used  directly  for  program- 
mable logic  sequencer  (PLS)  device  design. 
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Figure  9.  A  State  Transition  Table 
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particular  combination  or  set  of  combinations  of  the  input  vari- 
ables. A  path  does  not  have  to  include  all  input  variables;  thus  it 
accommodates  "don't  cares".  These  decision  trees  take  more 
space  than  the  expressions  would,  but  in  many  practical  cases, 
state  machine  controllers  only  test  a  small  subset  of  the  input 
variables  in  each  state  and  the  trees  are  quite  manageable.  Also, 
the  chain  of  decisions  often  mirrors  the  designer's  way  of  thinking 
about  the  actions  of  the  controller.  It  is  important  to  note  that  these 
tests  are  not  performed  sequentially  in  the  FSM;  all  are  performed 
in  parallel  by  the  FSM's  state  transition  logic. 

A  benefit  of  this  method  of  specifying  transitions  is  that  the 
problem  of  Figure  8  can  be  avoided.  Such  a  conflict  would  be 
impossible  as  one  path  cannot  diverge  to  define  paths  to  two 
states. 

When  there  is  no  danger  of  conflicts  due  to  multiple  next  states 
being  defined,  this  flowchart  notation  can  be  compacted  by 
allowing  more  complex  decisions.  Expressions  can  be  tested,  as 
shown  in  Figure  11a,  or  multiple  branches  can  extend  from  a 
decoding  box,  as  in  Figure  11b.  In  the  second  case  it  is 
convenient  to  group  the  set  of  binary  inputs  into  a  vector,  and 
branch  on  different  values  of  this  vector. 

The  three  methods  of  state  machine  representation — state  dia- 
grams, state  tables,  and  flowcharts — are  all  equivalent  and 
interchangeable,  since  they  all  describe  the  same  hardware 
structure.  Each  style  has  its  own  particular  advantages.  Although 
most  popular,  the  state  transition  diagrams  are  more  complex  for 
problems  where  state  transitions  depend  on  many  inputs,  since 
the  transition  conditions  are  written  directly  on  the  transition 
arrows.  Although  cumbersome,  the  state  tables  allow  the  de- 
signer tight  control  over  signal  logic.  Flowcharts  are  convenient 
for  small  problems  where  there  are  not  more  than  about  ten  states 
and  where  up  to  two  or  three  inputs  or  input  expressions  are 
tested  in  each  state.  For  larger  problems,  they  can  become 
ungainly. 

Once  a  state  machine  is  defined,  it  must  be  implemented  on  a 
device.  Software  packages  are  then  used  to  implement  the 
design  on  a  device.  The  task  is  to  convert  the  state  machine 
description  into  transition  and  output  functions.  Software  pack- 
ages also  account  for  device-specific  architectural  variations  and 
limitations,  to  provide  a  uniform  user  interface. 


Another  popular  notation  is  based  on  flowcharts.  In  this  notation, 
states  are  represented  by  rectangular  boxes,  and  alternative 
state  transitions  are  determined  by  strings  of  diamond-shaped 
boxes.  The  elements  may  have  multiple  entry  points,  but  in 
general  have  only  one  exit.  The  state  name  is  written  as  the  first 
entry  in  the  rectangular  state  box.  Any  Moore  outputs  present  are 
written  next  in  the  state  box,  with  a  caret  (""")  following  those  that 
are  unregistered.  The  state  code  assignment,  if  it  is  known,  is 
written  next  to  the  upper  right  corner  of  the  state  box.  Decision 
boxes  are  diamond  or  hexagonal  shaped  boxes  containing  either 
an  input  signal  or  a  logic  expression.  Two  exits  labelled  "0"  and 
"1"  lead  to  either  another  decision  box,  a  state  box,  or  a  Mealy 
output.  The  rounded  oval  is  used  for  Mealy  machine  outputs. 
Once  again,  a  caret  ("**)  follows  those  outputs  that  are  unregis- 
tered. All  of  the  boxes  may  need  to  be  expanded  to  accommodate 
a  number  of  output  signals  or  a  larger  expression. 
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Figure  10.  Flowchart  Notation 
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Some  software  packages  accept  all  of  the  three  different  state 
machine  representations  directly  as  design  inputs.  However,  the 
most  prevalent  design  methodology  is  to  convert  the  three  state 
machine  design  representations  to  a  simple  textual  representa- 
tion. Textual  representations  are  accepted  by  most  software 
packages  although  the  syntax  varies.  The  PALASM  2  software 
package  offers  one  such  simple  and  easy-to-use  state  machine 
textual  representation.  The  task  of  converting  from  a  state 
transition  diagram  and  flowchart  representation  to  PALASM  2 
software  state  machine  syntax  is  demonstrated  in  a  design 
tutorial  on  page  2-122. 

Since  the  most  common  of  all  state  machine  representations  is 
the  state  transition  diagram  representation,  we  will  use  it  in  all 
subsequen  discussions.  Transition  table  and  flowchart  repre- 
sentation implementations  will  be  very  similar. 


State  Machine  Types:  Mealy  &  Moore 

With  the  state  machine  representation  clarified,  we  can  now 
return  to  the  generic  sequencer  model  of  Figure  1,  which  has 
been  labelled  (Figure  12)  to  show  the  present  state  (PS),  next 
state  (NS)  and  output  (OB,  OA).  This  will  illustrate  how  Mealy  and 
Moore  machines  are  implemented  with  most  sequencer  devices 
which  provide  asingle  combinatorial  logic  array  for  both  next  state 
and  output  decode  functions.  There  are  four  ways  of  using  the  se- 
quencer, two  of  which  implement  Moore  machines  and  two  Mealy. 
First,  let  us  look  at  the  Mealy  forms. 

The  standard  Mealy  form  is  shown  in  Figure  13,  where  the  signals 
are  labelled  as  in  Figure  1 2  to  indicate  which  registers  and  outputs 
are  used.  The  register  outputs  PS  are  fed  back  into  the  array  and 
define  the  present  state.  The  combinatorial  logic  implements  the 
transition  function,  which  produces  the  next  state  flip-flop  inputs 
NS,  and  the  output  function,  which  produces  the  machine  output 
OB.  This  is  the  asynchronous  Mealy  form. 
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Figure  11.  Using  Flowcharts 
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Figure  12.  Generic  Model  of  an  FSM 
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Figure  13.  Asynchronous  Mealy  Form 
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An  alternative  Mealy  form  is  shown  in  Figure  1 4.  Here  the  outputs 
are  passed  through  an  extra  output  register  (OA)  and  thus  do  not 
respond  immediately  to  input  changes.  This  is  the  synchronous 
Mealy  form. 

The  standard  Moore  form  is  given  in  Figure  1 5.  Here  the  outputs 
OB  depend  only  on  the  present  state  PS.  This  is  the  asynchro- 
nous Moore  form.  The  synchronous  Moore  form  is  shown  in 
Figure  16.  In  this  case  the  combinatorial  logic  can  be  assumed 
to  be  the  unity  function.  The  outputs  (OB)  can  be  generated 
directly  along  with  the  present  state  (PS).  Although  these  forms 
have  been  described  separately,  a  single  sequencer  is  able  to 
realize  a  machine  which  combines  them,  provided  that  the 
required  paths  exist  in  the  device. 


In  the  synchronous  Moore  form,  the  outputs  occur  in  the  state  in 
which  they  are  named  in  the  state  transition  diagram.  Similarly  in 
the  asynchronous  Mealy  and  Moore  forms  the  outputs  occur  in 
the  state  in  which  they  are  named,  although  delayed  a  little  by  the 
propagation  delay  of  the  output  decoder.  This  is  becausethey  are 
combinatorial  functions  of  the  state  (and  inputs  in  the  Mealy 
case). 

However,  the  synchronous  Mealy  machine  is  different.  Here  an 
output  does  not  appear  in  the  state  in  which  it  is  named,  since  it 
goes  into  another  register  first.  It  appears  when  the  machine  is 
in  the  next  state,  and  is  thus  delayed  by  oneclock  cycle.  The  state 
diagram  in  Figure  1 7  illustrates  all  of  the  possibilities  on  a  state 
transition  diagram. 


TRANSITION 
FUNCTION 

NEXT 
STATE 
NS 

REGISTER 

OUTPUT 
FUNCTION 


REGISTER 
 .£>  


PRESENT  STATE 
PS 


■OA 
OUTPUTS 


Figure  14.  Synchronous  Mealy  Form 
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Figure  15.  Asynchronous  Moore  Form 
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Figure  17.  State  Diagram  Labelling  for 
Different  Output  Types 

As  a  matter  of  notation,  Moore  outputs  are  often  placed  within  the 
state  bubble  and  Mealy  outputs  are  placed  next  to  the  path  or 
arrow  which  activates  them. 

The  relationship  of  Mealy  and  Moore,  synchronous  and  asyn- 
chronous outputs  to  the  states  is  shown  in  Figure  18. 


Logic-based  devices  include  the  PAL  and  PLS  devices.  These 
devices  use  the  sum-of-products  logic  array  to  implement  the 
transition  and  output  functions.  The  memory-based  devices  like 
PROSE  implement  the  transition  and  output  functions  using  a 
PROM  or  RAM  array.  Conceptually  this  is  no  different  from  the 
logic-based  devices,  since  the  memory  can  be  viewed  as  special 
logic.  The  instruction-based  sequencers  (Am29PL141)  offer 
hardwired  instructions  and  fixed  logic  blocks  to  provide  the 
transition  function  logic  with  enhanced  capabilities  like  subrou- 
tines. Functionally  all  three  types  of  devices  work  similarly  per- 
forming two  basic  functions:  control  sequencing  and  output 
generation. 

There  are  three  major  criteria  for  selecting  the  correct  state 
machine  device  for  a  design: 

•  Number  of  inputs/outputs 

-  I/O  flexibility 

-  Number  of  output  registers 

•  Speed 

•  Intelligence/functionality 

-  Number  of  product  terms 

-  Type  of  flip-flops 

-  Number  of  state  registers 

-  Number  of  PROM  locations  (memory-based  sequencers) 
Number  of  l/Os 


Device  Selection  Considerations 

Architecturally,  the  state  machine  devices  can  be  divided  into 
three  categories: 

■  Logic-based  devices 

•  Memory-based  devices 

•  Instruction-based  devices 
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The  number  of  inputs,  outputs  and  I/O  pins  determines  the  signals 
which  can  be  sampled  or  generated  by  a  state  machine.  Figure 
1 9  lists  the  devices  offered  for  state  machine  designs,  and  shows 
the  number  of  inputs,  outputs  and  I/O  pins  available  on  each 
device. 

Timing  and  Speed 

The  timing  considerations  for  sequencer  design  are  similar  to 
those  for  registered  logic  design  (page  2-64).  A  system  clock 
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Figure  19.  Devices  for  State  Machine  Design  (Cont'd.) 


cycle  forms  the  basic  kernel  for  evaluating  control  function 
behavior.  For  the  most  part,  all  input  and  output  functions  are 
specified  in  relationship  to  the  positive  edge.  Registered  outputs 
are  available  after  a  period  of  time  tCLK,  the  clock-to-output 
propagation  delay.  Asynchronous  outputs  require  an  additional 
propagation  delay  (tPD)  before  they  are  valid. 

For  the  circuit  to  operate  reliably,  all  of  the  flip-flop  inputs  must  be 
stable  at  the  flip-flop  no  later  than  the  minimum  set-up  time  (tj  of 
the  flip-flops  before  the  next  active  clock  edge.  If  one  of  the  inputs 
changes  after  this  threshold,  then  the  next  state  or  synchronous 
output  could  be  stored  incorrectly;  the  circuit  may  even  malfunc- 
tion. To  avoid  this,  the  clock  period  (tp)  must  be  greater  than  the 
sum  of  the  set-uptime  of  the  flip-flops  and  the  clock  to  output  time 
('«. +  'cue)-  This  determines  the  minimum  clock  period  and  hence 
the  maximum  clock  frequency,  fU4X,  of  the  circuit.  Metastability 
and  erroneous  system  operation  may  occur  if  these  specifica- 
tions are  violated. 


two  I 


The  timing 
there  are 
steady,  and 
state  and  s  gnals 


relationships  are  shown  in  Figure  20.  In  each  cycle 
regions:  the  stable  region,  when  all  signals  are 
the  transition  region,  when  the  machine  is  changing 
are  unstable.  The  active  clock  edge  causes  the 
flip-flops  to  load  the  value  of  the  new  state  which  has  been  set  up 


attheir  inputs.  At  atime  afterthis,  the  present  state  and  output  flip- 
flop  outputs  will  start  to  change  to  their  new  values.  After  a  time 
has  elapsed,  the  slowest  flip-flop  output  will  be  stable  at  its  new 
value.  Ignoring  input  changes  for  the  moment,  the  changes  in  the 
state  register  cause  the  combinatorial  logic  to  start  generating 
new  values  for  the  asynchronous  outputs  and  the  inputstotheflip- 
flops.  If  the  propagation  delay  of  the  logic  is  tPD,  then  the  stable 
period  will  start  at  a  time  equal  to  the  sum  of  the  maximum  values 

°ftcu<andtPD- 

Figure  19  also  shows  the  maximum  operating  frequency  of  the 
devices.  ECL-based  PAL  sequencers  can  implement  simple 
state  machines  from  60  MHz  to  above  100  MHz.  Conventional 
TTL  PAL  sequencers  can  now  reach  speeds  of  55  MHz.  The  new 
PLS  devices  operate  at  37  MHz.  Finally,  PROSE  sequencers 
provide  25  MHz  operation,  and  the  Am29PL141  can  run  at  20 
MHz.  The  design  engineer  usually  selects  the  simplest  device 
that  provides  the  desired  level  of  performance. 

Asynchronous  Inputs 

The  timing  of  the  inputs  to  a  synchronous  state  machine  are  often 
beyond  the  control  of  the  designer  and  may  be  random,  such  as 
sensor  or  keyboard  inputs,  or  they  may  come  from  another 
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Figure  20.  Timing  Diagram  for  Maximum  Operating  Frequency 
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Figure  21.  Asynchronous  Input  Causing  Race 


synchronous  system  that  has  an  unrelated  clock.  In  either  case 
no  assumptions  can  be  made  about  the  times  when  inputs  can  or 
cannot  arrive.  This  fact  causes  reliability  problems  that  cannot  be 
completely  eliminated,  but  only  reduced  to  acceptable  levels. 

Figure  21  shows  two  possible  transitions  from  state  "S1"  (code 
00)  either  back  to  itself,  or  to  state  "S2"  (code  11).  Which  transition 
is  taken  depends  on  input  variable  "A"  which  is  asynchronous  to 
the  clock.  The  transition  function  logic  for  both  state  bits  B1  and 
B2  includes  this  input.  The  input  A  can  appear  in  any  part  of  the 
clock  cycle.  For  the  flip-flops  to  function  correctly,  the  logic  for  B1 
and  B2  must  stabilize  correctly  before  the  clock.  The  input  should 
be  stable  in  a  window  t^  (setup  time)  before  the  clock  and  th  (hold 
time)  after  the  clock.  If  the  input  changes  within  this  window,  both 
the  flip-flops  may  not  switch,  causing  the  sequence  to  jump  to 
states  01  or  1 0,  which  are  both  undefined  transitions.  This  type 
of  erroneous  behavior  is  called  an  input  race. 

A  solution  to  this  problem  is  to  change  the  state  assignment  so 
that  only  one  state  variable  depends  on  the  asynchronous  input. 
Thus  the  11  code  must  be  changed  to  01  or  10.  Now,  with  only 
one  unsynchronized  flip-flop  input,  either  the  input  occurs  in  time 
to  cause  the  transition,  or  it  does  not,  in  which  case  no  transition 
occurs.  In  the  case  of  a  late  input,  the  machine  will  respond  to  it 
one  cycle  later,  provided  that  the  input  is  of  sufficient  duration. 

There  is  still  the  possibility  of  an  input  change  violating  the  setup 
time  of  the  internal  flip-flop,  driving  it  into  a  metastable  state.  This 
can  produce  system  failures  which  can  be  minimized,  but  never 
eliminated  (see  Metastability,  page  3-164).  The  same  problem 
arises  when  outputs  depend  on  an  asynchronous  input. 

Very  little  can  be  done  to  handle  asynchronous  inputs  without 
severely  constraining  the  design  of  the  state  machine.  The  only 
way  to  have  complete  freedom  in  the  use  of  inputs  is  to  convert 
them  into  synchronous  inputs.  This  can  be  done  by  allocating  a 
flip-flop  to  each  input  as  shown  in  Figure  22.  These  synchronizing 
flip-flops  are  clocked  by  the  sequencer  clock,  and  may  even  be 
the  sequencer's  own  internal  flip-flops.  This  method  is  not 
foolproof,  but  significantly  reduces  the  chance  of  metastability 
occurring. 


Functionality 

The  functionality  of  different  devices  is  difficult  to  compare  since 
different  device  architectures  are  available.  The  number  of 
registers  in  a  device  determines  the  number  of  state  combina- 
tions possible.  However,  all  of  the  possible  state  combinations 
are  not  necessarily  usable,  since  other  device  constraints  may  be 
reached.  The  number  of  registers  do  give  an  idea  of  the 
functionality  achievable  in  adevice.  Otherfunctionality  measures 
include  the  number  of  product  terms,  type  of  flip-flop,  or  the 
number  of  PROM  locations  in  a  memory-based  sequencer.  One 
device  may  be  stronger  than  another  in  one  of  these  measures, 
but  overall  may  be  less  useful  due  to  other  shortcomings.  Choos- 
ing the  best  device  involves  both  skill  and  experience. 

A  designer  has  a  complete  spectrum  of  devices  with  different 
architectures  to  choose  from  for  a  state  machine  design.  These 
range  from  the  PAL1 6R8D  family  of  very  high  speed  devices  to 
the  new  PROSE  PMS14R21  and  instruction-based  Am29PL141 
high-functionality  devices.  The  spectrum  is  completed  by  mid- 
range  PAL  devices  including  the  PAL23S8,  PAL22RX8A  and 
PAL32VX10/A,  with  architectural  features  specifically  designed 
for  state  machine  designs,  and  the  PLS  devices  designed  exclu- 
sively for  state  machine  implementation. 

In  order  to  give  an  idea  of  device  functionality,  we  will  consider 
each  of  the  architecture  options  available  to  the  designer  and 
evaluate  its  functionality. 
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Figure  22.  Input  Synchronizing  Register 


2-110 


State  Machine  Design 


PAL  Devices  as  Sequencers 

A  vast  majority  of  state  machine  designs  are  implemented  with 
PAL  devices.  Early  versions  of  software  required  the  user  to 
manually  write  the  sum-of-products  Boolean  equations  for  using 
PAL  devices.  Second  generation  software  allows  one  to  specify 
the  design  in  "state  machine  syntax,"  and  handles  the  translation 
to  sum-of-products  logic  automatically.  PAL  devices  implement 
the  output  and  transition  functions  in  sum-of-products  form  via  a 
user-programmable  AND  array  and  a  fixed  OR  array. 

PAL  devices  deliver  the  fastest  speed  of  any  sequencer  and  are 
ideally  suited  for  simple  control  applications  characterized  by  few 
input  and  output  signals  interacting  within  a  dedicated  controller 
in  a  sequential  manner.  The  number  of  flip-flops  in  a  PAL  device 
range  from  8  to  12,  which  offer  potentially  more  than  one  thou- 
sand state  values.  Since  some  of  the  flip-flops  are  used  for 
outputs,  and  the  number  of  product  terms  is  limited,  the  usable 
number  of  states  is  reduced  drastically.  Generally,  up  to  about  35 
states  can  be  utilized. 

PAL  Device  Flip-flops 

PAL  device  based  sequencers  implement  small  state  machine 
designs,  which  have  a  relatively  large  number  of  output  transi- 
tions. Since  the  output  registers  change  with  most  state  transi- 
tions, they  can  be  used  simultaneously  as  state  registers,  once 
the  state  values  are  carefully  selected.  Most  PAL  devices  are 
used  for  small  state  machines,  and  efficiently  share  the  same 
register  for  output  and  state  functions.  High-functionality  PAL 
device  based  sequencers  provide  dedicated  buried  state  regis- 
ters when  sharing  is  difficult. 


As  a  state  machine  traverses  from  one  state  to  another,  every 
output  either  makes  a  transition  (changes  logic  level)  or  holds 
(stays  at  the  same  logic  level).  Small  state  machine  designs 
require  relatively  more  transitions  and  fewer  holds.  As  designs 
get  larger,  state  machines  statistically  require  relatively  fewer 
transitions  and  more  holds. 

Most  PAL  devices  provide  D-type  output  registers.  D-type  flip- 
flops  use  up  product  terms  only  for  active  transitions  from  logic 
LOW  to  HIGH  level,  and  for  holds  for  logic  HIGH  level  only.  J-K, 
S-R,  and  T-type  flip-flops  use  up  product  terms  for  both  LOW-to- 
HIGH  and  HIGH-to-LOW  transitions,  but  eliminate  hold  terms. 
Thus  D-type  flip-flops  are  more  efficient  for  small  state  machine 
designs.  Some  high-end  PAL  devices  offer  the  capability  of 
configuring  the  flip-flops  as  J-K,  S-R  or  T-types,  which  are  more 
efficient  for  large  state  machine  designs  since  they  require  no 
hold  terms. 

Many  examples  of  PAL-device-based  sequencers  can  be  found 
in  system  time  base  functions,  special  counters,  interrupt  control- 
lers, and  certain  types  of  video  display  hardware. 

PALdevices  are  produced  in  a  variety  of  technologies  for  multiple 
applications,  and  provide  a  broad  range  of  speed-power  options. 
PAL  devices  which  can  be  used  for  sequencer  designs  are  listed 
in  Figure  23.  We  will  consider  the  following  PALdevices  in  detail. 

PAL10H/10020EV7EG8 
PAL16R8D  family 
PAL23S8 
PAL22RX8 
PAL32VX1 0 
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Figure  23.  Table  of  PAL  Devices  for  Sequencer  Applications 
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Figure  23.  Table  of  PAL  Devices  for  Sequencer  Applications  (Cont'd.) 
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The  ECL  PAL10H/10020EV/EG8 

At  125  MHz,  the  ECL  PAL10H/10020EV/EG8  provides 
the  highest  speed  for  a  state  machine  design.  The 
PAL1  OH/1 0020EV/EG8  has  eight  outputs  and  20  inputs.  Half  of 
the  outputs  have  8  product  terms  and  half  have  1 2  product  terms. 
All  of  the  20EV8  outputs  use  D-type  flip-flops,  while  the  20EG8 
outputs  are  transparent  latches.  Two  architectural  fuses  per 
output  control  the  polarity  of  the  logic  and  bypass  the  flip-flop  for 
combinato  ial  asynchronous  outputs.  Two  global  product  terms 
are  present  to  Set  and  Reset  all  flip-flops.  Maximum  frequency 
of  operation  as  a  state  machine  is  125  MHz. 

Figure  24  details  the  macrocell  for  the  PAL1  OH/1 0020EV8. 

The  PAL16R8D  Family 

This  is  the  high-speed  end  (55  MHz)  of  the  TTL  state  machine 
design  solutions.  PAL1 6R8D  family  devices  are  available  in  three 
different  versions  (PAL1 6R8/6/4)  with  varying  numbers  of  regis- 
tered and  combinatorial  outputs  and  I/O  pins  for  convenient 
design  fitting.  The  PAL16R8  family  provides  64  product  terms 
(eight  per  output). 


The  software  design  options  available  for  the  PAL16R8D  family 
are  the  traditional  Boolean  equations  and  the  state  machine 
syntax  provided  by  PALASM  2  software. 

The  PAL23S8 

At  33.3  MHz  and  in  a  20-pin  DIP,  this  device  (Figure  25)  provides 
the  highest  performance-to-board-space  ratio.  Designed  for 
high-functionality  state  machines,  this  device  offers  eight  output 
and  six  dedicated  buried  registers.  It  also  offers  four  output 
macrocells  with  register  bypass  for  asynchronous  outputs.  It  has 
nine  dedicated  device  inputs  and  four  I/O  pins  giving  a  total  of  23 
array  inputs.  For  improved  state  machine  initialization,  it  also 
offers  dedicated  synchronous  Preset  and  asynchronous  Reset 
product  terms.  Pin-compatible  with  the  PAL1 6R8  family,  this 
device  offers  varied  distribution  of  145  product  terms  for  large 
designs  and  optimal  fit. 

The  software  design  options  include  Boolean  equations  and  state 
machine  syntax. 
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Figure  24.  The  PAL10H/10020EV8  Output  Macrocell 
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The  PAL22RX8A 


The  24-pin  PAL22RX8A  (Figure  26)  provides  eight  configurable 
input/output  macrocells  with  register  bypass  for  a  user-program- 
mable number  of  registered  or  combinatorial  outputs  and  I/O 
pins.  It  also  provides  dedicated  asynchronous  reset  and  preset 
product  terms  for  state  machine  initialization.  The  PAL22RX8A 
provides  a  product-term-controlled  XOR  gate  with  its  sum-of- 
products  logic.  This  allows  users  to  configure  individual  macro- 
cells  with  D,  J-K,  S-R  or  T-type  flip-flops. 

The  requirements  of  transition  and  hold  terms  are  dependent 
upon  state  selection,  which  varies  for  different  applications.  The 
PAL22RX8A  provides  both  D-type  and  J-K  type  flip-flops,  allow- 
ing the  designer  to  select  the  one  requiring  fewer  product  terms. 

The  PAL22V10 

The  PAL22V10  provides  output  macros  similar  to  the 
PAL22RX8A.  It  provides  both  programmable  polarity  and  regis- 
ter bypass.  In  addition,  it  has  varied  product  term  distribution. 

The  PAL22V10  has  an  advantage  of  a  large  number  of  product 
terms.  It  provides  a  varied  distribution  of  eight  to  sixteen  product 
terms  per  output.  It  also  provides  ten  output  macrocells,  allowing 
for  larger  designs. 

The  PAL32VX10/A 

The  PAL32VX10/A  (Figure  27)  provides  high  functionality  and 
density  for  PAL  device  designs.  The  PAL32VX1 0/A  has  config- 
urable flip-flops  and  initialization  product  terms  for  state  machine 
designs.  It  offers  unprecedented  design  density  because  of  its  1 0 
input/output  macros  and  a  large  number  (8-1 6/output)  of  product 
terms. 

The  PAL32VX1 0/A  also  provides  dual  feedback  paths  from  each 
output  macrocell.  This  allows  the  macrocell  register  to  be  used 
as  a  buried  state  register  in  case  the  I/O  pin  is  used  as  a  dedicated 
input.  The  PAL32VX10/A  is  designed  for  large  state  machine 
designs  which  have  relatively  few  output  transitions  and  require 
separate  output  and  buried  registers.  The  PAL32VX10/A  pro- 
vides up  to  10  buried  registers  for  state  machine  design  without 
any  I/O  penalty.  The  PAL32VX1 0/A  also  allows  trading-off  output 
registers  for  state  registers  and  vice-versa.  This  optimizes  the 
use  of  device  resources. 
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Figure  25.  Block  Diagram  of  PAL23S8 


2-114 


£l  Monolithic  KFJ]  Memories  £1 


State  Machine  Design 


ASYNCHRONOUS  RESET 
ASYNCHRONOUS  PRESET 
CLOCK 


Figure  26.  PAL22RX8A  Output  Macrocell 
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Figure  27.  PAL32VX1 0/A  Output  Macrocell 
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Figure  28.  Architecture  of  a  PLS105  Device 
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Programmable  Logic  Sequencers  (PLS) 

Another  alternative  for  control  logic  is  the  PLS  (Programmable 
Logic  Sequencer)  device  (Figure  28).  PLS  devices  have  both 
programmable  AND  and  programmable  OR  arrays.  Forty-eight 
product  terms  are  available,  which  can  be  shared  between 
transition  functions  and  output  functions  for  all  of  the  outputs,  as 
dictated  by  the  application  needs.  PLS  devices  also  offer  S-R  flip- 
flops  with  buried  registers  for  control  sequencing.  The  flip-flops 
have  a  common  clock  line.  The  S-R  type  flip-flops  are  similar  to 
the  J-K  type  flip-flops  discussed  earlier,  and  require  no  hold 
terms.  Consequently  S-R  typeflip-f  lops  are  very  efficient  for  large 
state  machine  designs.  PLS  devices  can  function  as  either  as 
Moore  machines  or  Mealy  machines  with  a  register  on  the  output 
path. 

With  the  improvements  in  design  tools,  it  is  possible  to  specify 
designs  directly  from  the  state  diagram.  However,  one  format 
which  has  been  used  often  in  the  past  is  the  PLS  table.  This  table 
is  primarily  used  where  the  design  is  entered  manually.  The  table 
is  also  useful  for  learning  about  and  experimenting  with  se- 
quencer designs.  Often,  it  is  possible  to  manipulate  sequencer 
programs  using  the  table  in  ways  which  are  not  possible  with 
some  of  the  more  rigid  CAD  languages. 

The  table  has  provision  for  input  and  output  signal  pin  allocation 
and  naming,  and  a  line  for  each  of  the  48  device  product  terms. 
The  entries  used  in  each  line  are  shown  in  Figure  29. 


VALUE 

AND  ARRAY  TERMS 

OR  ARRAY  TERMS 

(Input,  present  state) 

(next  state,  output) 

■  r-  I 

true  input/state  bit 
complement 
don't  care 

Set  Flip-flop 
Reset  Flip-flop 
no  change 

Figure  29.  Symbols  Used  In  a  PLS  Table 


The  table  definesthe  output  function  and  transition  function.  Both 
functions  depend  on  the  present  sfafeand  inputs.  Each  possible 
link  path  between  two  states  uses  a  product  term  and  becomes 
a  line  in  the  PLS  table.  The  collection  of  all  link  path  lines  is  a 
complete  definition  of  the  combinatorial  logic  required.  The 
following  example  illustrates  the  relationship  between  a  state 
diagram  and  the  PLS  table. 

Let  us  look  at  an  arbitrary  function  as  shown  in  Figure  30.  The 
inputs  are  A,B,C,  and  D;  the  state  names  are  SO,  S1 ,  S2  and  S3; 
and  the  outputs  are  W,  X,  Y  and  Z.  To  construct  the  PLS  table,  a 
state  assignment  is  necessary.  Selecting  an  arbitrary  binary  code 
for  each  state: 

SO- 00 

51  =01 

52  =  10 

53  -  11 

With  the  two  state  variables  P  and  Q,  the  PLS  table  can  be 
derived,  and  is  shown  in  Figure  31. 


PATH 

INPUTS 

PRESENT 

NEXT 

OUTPUTS 

# 

STATE 

STATE 

A  B 

C 

D 

P 

Q 

P 

Q 

W 

X  Y  Z 

1 

H  - 

L 

L 

H 

L 

L 

H  L  L 

2 

L  - 

L 

L 

L 

H 

H 

L   L  L 

3 

-  H 

H 

L 

H 

H 

H 

L 

L   L  L 

4 

-  L 

L 

H 

L 

L 

L 

L  L  L 

5 

-  H 

L 

L 

H 

L 

L 

L 

L  L  L 

6 

-  H 

H 

L 

H 

H 

L 

L   L  L 

7 

-  L 

H 

L 

L 

L 

L 

L  L  L 

8 

H 

H 

H 

L 

L 

L 

L  L  H 

9 

L 

H 

H 

L 

L 

L 

L  H  L 

Figure  31.  Example  PLS  Table 
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Figure  30.  Example  State  Machine 


Figure  32.  "HOLD"  Transitions 
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This  is  now  a  complete  description  of  the  combinational  logic  of 
the  state  machine.  Every  line  in  the  table  is  a  product  term  which 
is  active  when  conditions  for  a  transition  to  another  state  are 
satisfied.  For  this  reason,  the  product  terms  in  a  PLS  device  are 
also  known  as  transition  terms.  As  the  circuit  moves  from  state 
to  state,  new  transition  terms  become  active  as  others  become 
inactive. 

A  comment  may  be  appended  to  each  term  to  aid  documentation 
of  the  design. 

"Hold"  Terms 

The  flip-flops  used  in  PLS  devices  are  the  S-R  type.  As  discussed 
earlier,  S-R  flip-flops  do  not  require  any  product  terms  for  holds 
and  require  product  terms  only  for  state  transition  or  output 
generation  (Figure  32). 

This  may  be  seen  in  the  two  lines  of  the  PLS  table  corresponding 
to  the  two  link  paths  shown,  in  Figure  33. 


INPUT 

PRESENT 

NEXT 

X1 

STATE 

STATE 

L 

LHLH 

H 

LHLH 

-  -HL 

Figure  33.  Hold  Transitions  In  a  PLS  Table 

The  first  path,  corresponding  to  the  first  line  in  Figure  33,  is  a 
transition  back  to  the  same  state.  No  change  is  thus  required  in 
any  of  the  state  variables.  '-' entries  can  thus  be  used.  The  first 
line  has  only  '-'  entries  in  the  OR  Array  part  of  the  table;  the 
transition  term  is  thus  not  connected  and  this  line  can  be  elimi- 
nated altogether.  This  is  a  logic  minimization  which  can  be  done 
immediately  when  converting  the  flowchart  or  state  diagram  into 
a  PLS  table.  The  second  path  is  to  a  different  state,  "011 0".  This 
does  require  a  term  since  some  state  variables  have  to  change. 
Also,  note  that  "hold"  state  transitions  are  only  free  if  no  output 
changes  are  required. 

Complement  Array  Term 

An  important  addition  to  the  PLS  device's  programmable  AND- 
OR  array  is  the  complement  array  term  (Figure  34).  This  is  an 


extra  OR  line  which  is  first  complemented  and  then  fed  back  as 
an  additional  input  into  the  AND  array.  Figure  35  shows  the  three 
possible  configurations  of  the  array  at  each  transition  term.  The 
symbols  shown  are  used  in  an  extra  truth  table  column  for  config- 
uring the  complement  array. 


X  INDICATES  FUSE 


Figure  34.  Connections  of  the  Complement  Array  Term 

Proper  usage  of  the  complement  array  can  lead  to  considerable 
savings  in  product  terms.  This  can  be  seen  in  the  typical  use  of 
the  complement  array  in  the  implementation  of  default  or  "ELSE" 
transitions.  Figure  36  shows  a  set  of  four  transitions  from  state 
SA,  three  of  which  are  enabled  by  one  of  three  combinations  of 
inputs  A,  B,  and  C,  causing  transitions  to  states  SO,  S1  and  S2. 
All  other  combinations  cause  a  transition  to  state  SX. 


0 


0 


0 


GENERATE  PROPAGATE  TRANSPARENT 

•A- 

X  INDICATES  FUSE 
407  27 

Figure  35.  Complement  Array  Configurations 
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Figure  36.  Transitions  with  a  Default 
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Three  product  terms  are  required  for  transition  to  states  SO,  S1 
and  S2  which  encode  the  three  different  conditions.  For  the 
default  transition  to  state  SX,  the  remaining  five  condition  combi- 
nations of  A,  B  and  C  would  have  to  be  encoded,  requiring  five 
more  product  terms  (Figure  37).  But  these  terms  can  be  thought 
of  as  "not  the  transitions  to  SO  or  S1  or  S2".  By  using  the  comple- 
ment array  (which  provides  this  NOT  signal),  only  one  more 
product  term  is  required  for  the  default  transition  (Figure  38). 

The  lastfive  terms  have  been  compressed  into  one.  If  none  of  the 
three  defined  input  combinations  is  true  in  state  SA,  the  inputs  to 
the  complement  array  will  all  be  zeros,  and  the  complement  array 
output  will  be  a  one.  This  activates  the  fourth  transition  term, 
causing  a  transition  to  SX  (Figure  39). 

The  same  complement  array  term  can  be  used  in  other  states  too, 
since  the  propagated  signal  is  AN  Ded  with  the  present  state  code. 
Therefore  all  defaults  can  be  realized  with  one  term  per  state. 
PLS  devices  can  be  an  effective  solution  in  state-  and  branch- 
intensive  applications,  typically  found  in  a  variety  of  protocol  con- 
trollers, waveform  generators,  and  sequence  detectors.  In  such 
applications,  complex  arrangements  of  multiple  branches  may  be 
present.  Multi-step  control  functions  may  include  many  loops  of 
dozens  of  steps  to  count  packets  of  data  or  input  events  as  they 
occur. 

PLS  devices  can  be  characterized  by  the  number  of  inputs, 
number  of  product  terms,  number  of  flip-flops  and  number  of 
outputs.  The  MMI  PLS  family  currently  has  three  members;  these 
are  listed  n  Figure  40. 

Software  for  PLS  devices  can  usually  accept  input  descriptions  in 
threeformats:  truth  tables,  Boolean  equations  and  state  machine 
syntax.  The  first  two  require  the  user  to  work  out  the  explicit  logic 
assignment  for  all  terms  and  code  the  detailed  fuse  states 
programmed  or  unprogrammed.  The  new  state  machine  syntax 
allows  design  in  a  high-level  syntax,  and  is  easier  to  use. 
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Figure  37.  Product  Terms  Required  for  Transition  to 
State  SX 
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Figure  38.  Default  Transition  Using  Complement 
Array  Term 
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Figure  39.   Transitions  Using  Complement  Array  Term 
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PRODUCT 
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PLS167 

24 

14 

6 

Total  48 

33 

PLS168 

24 

12 

8 

Total  48 

33 

PLS105 

28 

16 

8 

Total  48 

37 

Figure  40.  Table  of  PLS  Devices 


PROSE  Sequencer  (PMS14R21) 

The  PROSE  (PMS14R21  PROgrammable  SEquencer)  is  a  revo- 
lutionary architecture  optimized  for  very  high  functionality  state 
machine  designs.  It  combines  a  PROM  and  a  PAL  array  on  a 
single  device,  utilizing  the  efficiencies  of  both  for  state  machine 
design. 

The  PROSE  is  a  high-speed,  1 4-input,  8-output  state  machine.  It 
consists  of  a  1 28x21  PROM  array  preceded  by  a  1 4H2  PAL  array 
(see  Figure  41).  The  PAL  array  is  efficient  for  performing  logic 
functions  on  a  large  number  of  input  conditions,  while  the  PROM 
array  is  optimal  for  implementing  a  large  number  of  product  terms 
(decision  branches)  and  states.  The  combination  allows  a  very 
efficient  implementation  of  a  state  machine. 


Figure  41.  PROSE  PMS14R21  Device  Architecture 
PROM  Operation 

The  PROM  consists  of  128, 21 -bit  word  locations.  Each  of  these 
locations  in  the  PROM  can  be  viewed  as  a  state.  Eight  bits 
Q(7-0)  at  each  location  define  the  outputs.  The  other  1 3  bits  are 
used  as  feedback:  six  as  condition  select  bits  CS(5-0)  to  the  PAL 
array,  and  seven  to  the  PROM  array  itself  for  determining  the  next 
location  to  be  addressed.  Five  of  these  seven  constitute  the  low- 
order  address  bits  A(4-0)  for  the  next  state.  The  remaining  2  bits 
are  XF(1-0),  inputs  to  the  exclusive-OR  gates  which  determine 
address  bits  A6  and  A5  in  conjunction  with  PAL  output  signals  X1 


and  X0.  Effectively,  the  seven-bit  address  of  every  PROM 
location  is  generated  by  seven  bits  of  the  PROM  data  and 
constitutes  the  present  state.  The  next  PROM  location  is  the  one 
whose  address  is  stored  as  data  in  the  present  location.  Eight  of 
the  PROM  data  bits  are  used  as  outputs. 

PAL  Array  Operation 

The  PAL14H2  has  14  complementary  inputs  and  2  active-HIGH 
outputs  (X1  and  X0).  Each  of  the  2  outputs  is  a  sum  of  eight 
product  terms.  Eight  of  the  inputs,  1(7—0),  are  from  an  external 
source,  and  are  used  for  encoding  conditions  for  branch  selec- 
tion. Six  inputs,  CS(5-0),  are  from  internal  feedback  from  the 
PROM,  and  are  used  for  selection  of  the  conditions  that  deter- 
mine branching  when  decisions  are  involved  in  state  transitions. 

Just  as  PROMs  are  efficient  for  direct  control  sequencing,  PAL 
arrays  are  very  efficient  for  encoding  input  conditions.  The 
function  of  the  PAL14H2  in  the  PROSE  device  is  to  encode  all 
eighttest  input  conditions  along  with  the  state  information.  Based 
upon  these  test  conditions  being  true  it  inverts  the  polarity  of  the 
two  most  significant  PROM  address  bits.  This  causes  the  PROM 
to  address  a  different  location  (state),  which  is  equivalent  to 
conditional  branching.  The  PALASM  2  design  software  selects 
these  branch  locations  automatically  and  isolates  the  designer 
from  low-level  details. 

With  128  PROM  locations,  the  PROSE  device  can  be  used  for 
large  state  machines.  It  is  very  effective  in  single-thread  control 
functions  characterized  by  long  sequences  of  events  generated 
by  one  "trigger"  event.  There  may  be  a  single  repetitive  decision 
evaluated  many  times  or  a  long  chain  of  decisions  evaluated  for 
sequential  inputs.  The  output  control  stream  may  be  quite  wide 
and  complex  depending  on  the  system.  Generally,  only  narrow 
branches  (two  or  four-way)  are  needed.  Typical  applications 
occur  in  controllers  for  error  detection,  numerical  processing,  and 
data  encryption  and  decryption.  The  PROM  output  structure  can 
hold  any  pattern  of  eight-bit  values  to  be  read  out  as  needed.  No 
trade-off  has  to  be  made  with  branching  complexity  to  provide  this 
output  flexibility.  Byte  or  word-oriented  expansion  can  be  made 
by  arranging  multiple  devices  in  parallel. 

PROSE  state  machines  are  fully  supported  by  PALASM  2  soft- 
ware. A  complete  design  example  of  a  PROSE-based  state 
machine  is  included  on  page  2-122. 
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Fuse  Programmable  Controller 
(Am29PL141) 

The  Am29PL141  is  a  high-performance,  single-chip,  fuse-pro- 
grammable controller  (FPC).  This  chip  is  designed  to  allow  the 
implementation  of  complex  state  machines  and  controllers  by 
programming  the  appropriate  sequence  of  microinstructions  in 
the  on-chip  microprogram  memory. 


Large  state  machines  require  the  capability  of  nested  routines. 
Nested  state  machines  are  control  sequences  that  may  be 
invoked  by  one  or  more  control  functions,  but  when  invoked, 
suspend  the  "calling"  control  sequence.  Following  completion  of 
the  low-level  routine,  the  higher-level  "calling"sequence  resumes 
where  it  left  off.  The  nested  task  may  have  modified  part  of  the 
machine  environment  or  the  data  processed  by  the  system.  The 
Am29PL141  provides  stacks  which  allow  nested  subroutines. 


CONDITION  CODE 
SELECT  LOGIC 


_0  u  u 

ZERO     SDI  DCLK 


'  '  '  MICROPROGRAM 

"I8  "I2  I6  MEMORY 


P(15:8)       P(7:6)  P(5:0) 
OUTPUTS 


Figure  42.  Architecture  of  Am29PL141 
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With  its  on-chip  intelligent  microprogram  address  sequencer 
(Figure  42),  high-speed  64  x  32-bit  PROM-based  microprogram 
memory,  on-chip  pipeline  register,  and  an  on-chip  diagnostics 
serial  shadow  (SSR)  register,  this  chip  offers  two  major  advan- 
tages: fast  operation,  and  testability.  This  device  is  available  with 
a  20-MHz  clock  rate  (50-ns  cycle  time)  in  a  28-pin  dual-in-line 
package. 

A  microprogram  address  sequencer  is  the  heart  of  the  FPC,  and 
performs  all  control  sequencing  functions.  The  Am29PL141  has 
29  high-level  microinstructions  which  include  jumps,  loops,  sub- 
routine calls,  and  multiway  branching.  These  microinstructions 
can  be  conditionally  executed  based  on  the  test  inputs.  The 
output  generation  function  is  similar  to  that  of  the  PROSE  device, 
where  output  data  is  stored  in  PROM  locations.  A  block  diagram 
of  the  FPC  is  shown  in  Figure  42. 

The  FPC  consists  of  five  main  logic  blocks:  the  microaddress 
control  logic,  condition  code  selection  logic,  branch  control  logic, 
microinstruction  decode  logic,  and  the  microprogram  memory. 

The  microaddress  control  logic  generates  the  proper  sequences 
of  addresses  for  the  microinstructions  in  the  microprogram 
memory.  A  PLA  microinstruction  decoder  is  used  to  decode  the 
opcode  from  the  microinstruction.  This  microinstruction  decoder 
also  generates  all  necessary  control  signals  for  executing  each 
microinstruction.  Depending  on  the  microinstruction,  the  mi- 
croaddress can  be  generated  from  a  number  of  different  sources: 
branch  control,  microprogram  counter  (PC+1 ),  subroutine  regis- 
ter, or  the  microprogram  counter  (PC). 

A  six -bit  stack  is  available  for  looping  and  subroutine  calls.  When 
the  loop  counter  is  not  being  used  for  counting  purposes,  it  can  be 
configured  with  the  stack  register  to  implement  a  two-deep  stack 
for  nested  loops  and  nested  subroutine  calls.  The  condition  code 
selection  logic  consists  of  an  8:1  test  multiplexer.  One  of  eight  test 
conditions  can  be  selected  on  which  to  base  the  conditional 
instructions.  The  polarity  of  the  selected  condition  code  input  is 
controlled  by  the  POL  bit  in  the  microword. 

The  branch  control  logic  is  used  for  generating  multiway  branch 
addresses  from  the  external  inputs  T(5-0).  This  logic  also  per- 
forms the  comparison  between  the  inputs  and  the  pipeline  data 
field  for  executing  the  COMPARE  instruction.  An  EQ  signal  is 
generated  as  an  output  of  the  COMPARE  instruction  and  can  be 
used  for  a  condition  code  test  in  the  next  clock  cycle. 

With  its  64-location  PROM,  the  Am29PL141  provides  up  to  64 
states  for  state  machine  designs.  It  offers  the  most  sophisticated 
control  sequencing  circuitry,  which  allows  multiple  branching  and 
nested  subroutines.  This  can  enhance  the  functionality  of  the 
device  even  beyond  64  states. 

The  Am29PL1 41  is  supported  by  an  assembler  (ASM1 4X)  which 
allows  the  user  to  write  instructions  in  a  high-level  language 
syntax.  The  assembler  then  converts  the  high-level  statements 
into  the  specific  device  microinstructions  and  PROM  data.  The 
JEDEC  device  output  file  is  also  generated  for  programming  the 
device  PROM. 


State  Machine  Design  Tutorial 

The  following  discussion  is  a  tutorial  on  state  machine  design 
methodology.  In  this  tutorial  we  will  use  the  PROSE  PMS14R21 
device  and  explain  the  state  machine  design  process.  The 
PROSE  PMS14R21  is  supported  by  the  state  machine  syntax  of 
PALASM  2  design  software.  This  software  syntax  is  identical  to 
and  can  be  used  for  programming  other  PAL  and  PLS  devices. 

There  are  four  stages  in  astate  machine  design.  The  first  requires 
conceptualizing  the  design  and  selecting  the  correct  device; 
second  converting  it  to  its  state  diagram  representation;  third 
converting  the  state  diagram  to  its  state  machine  syntax  text  file; 
and  fourth  assembling  the  file  and  programming  the  device.  To 
explain  all  the  stages  in  the  design  process  we  will  take  a  general 
design  example  and  then  go  through  the  stages  of  the  design 
process.  As  an  alternative  design  methodology,  we  will  also 
examine  the  flowchart  state  representation  in  the  design 
example. 

Conceptualizing  the  Design 

This  isthe  first  step  in  a  state  machine  design  process.  In  this  step 
a  complete  functional  description  of  the  design  is  formalized 
along  with  the  requisite  truth  tables  and/or  timing  diagrams.  We 
will  consider  a  simple  traffic  signal  controller  example  for  illustra- 
tive purposes. 

To  begin  with,  let  us  visualize  the  scene  of  a  traffic  intersection 
(Figure  43),  simplified  to  two  one-way  streets. 


RED  2      GRN  2 
YEL  2 


o  o  o 


o 

RED  1 

o 

YEL  1 

o 

GRN  1 

■  DIRECTION  2 


SEN  2 


SEN  1 


Figure  43.  A  Traffic  Intersection 
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The  traffic  intersection  shows  two  one-way  streets:  one  in  direc- 
tion 1  and  the  other  in  direction  2.  Each  direction  has  a  signal 
consisting  of  red,  yellow,  and  green  lamps.  These  lamps  are 
activated  with  appropriately-named  active-HIGH  signals  (RED,, 
YEL,,  GRN,,  RED2,  YEL2,  GRN2).  The  signals  are  generated  by 
the  state  machine  controller.  Also,  each  direction  has  a  sensor 
which  provides  an  active-HIGH  signal  (SEN,,  SEN2)  that  indi- 
cates the  presence  of  avehicle.  The  controller  has  to  managethis 
intersect  on,  with  the  sensors  as  inputs  and  the  lamps  as  outputs. 

The  assertion  of  SEN,  or  SEN2  signals  a  request  for  a  green  light 
for  traffic  in  the  corresponding  direction.  Once  SEN,  is  HIGH  and 
SEN2  is  LOW,  indicating  traffic  in  direction  I,  the  GRN,  light 
should  be  on  at  all  times.  Similarly,  when  SEN.,  is  HIGH  and  SEN, 
is  LOW,  the  GRN2  light  should  be  on  at  all  times.  When  SEN,  and 
SEN2  are  both  HIGH,  indicating  traffic  in  both  directions,  the  traffic 
controller  should  cycle,  allowing  equal  periods  of  green  signals 
(GRN,  and  GRN2)  for  both  directions.  This  cycling  is  also  done 
when  SEN,  and  SEN2  are  both  LOW,  indicating  no  traffic. 


SEN, 

SEN, 

L 
H 
H 
L 

H 
L 
H 
L 

Allow  traffic  in  direction  1 :  RED2,  GRN,  -  H 
Allow  traffic  in  direction  2:  RED,,  GRN2  -  H 
Cycle  with  equal  durations  in  both  directions 
Cycle  with  equal  durations  in  both  directions 

Whenever  the  signals  change  from  direction  1  to  direction  2,  the 
appropriate  yellow  light  (YEL,  or  YEL2)  is  turned  on  for  the 
transition  duration.  This  allows  time  for  the  traffic  in  the  intersec- 
tion to  pass,  before  allowing  traffic  in  the  other  direction.  The 
timing  diagram  is  shown  in  Figure  45.  When  both  sensors 
indicate  the  presence  of  traffic  (SEN,,  SEN2  =  H),  extra  time  is 
allowed  for  traffic  to  pass,  before  the  signals  change  the  direction. 
The  extra  time  is  also  allowed  when  there  is  no  traffic  in  both 
directions  (SEN,,  SEN2  =  L).  The  timing  diagram  forthis  is  shown 
in  Figure  46.  Finally,  the  traffic  signal  controller  shown  includes 
the  system  clock  (CLK),  and  an  initialize  or  reset  signal  (INIT). 
IN  IT  drives  the  controller  to  an  initial  state. 

Before  we  proceed  tothe  next  step,  we  must  select  a  device  which 
will  implement  this  design. 

In  the  previous  section  we  found  that  the  basic  selection  of  a  state 
machine  controller  device  depends  upon  the  speed  and  I/O 
considerations.  The  PROSE  PMS14R21  has  eight  inputs  and 
eight  outputs  in  addition  to  the  initialization  input  signal  PRESET, 
which  are  sufficient  forthis  design.  Another  selection  criterion  is 
the  intelligence  of  the  PROSE  device.  The  device  should  be  able 
to  implement  all  of  the  states  required  by  the  design.  The  PROSE 
device  is  the  only  24-pin  SKINNYDIP  device  which  has  the 
capacity  for  up  to  128  states.  Later  we  will  see  that  these  are 
sufficient  for  implementing  our  design. 


Figure  44.  Table  for  Traffic  Flow  Direction 


SEN 


RED 


RED 


ft 

II                                              y  / 

GRN 


YEL 


STATE         S0     S0     S3     S4  S4 

DIRECTION  1   *■  DIRECTION  2 


S4     S4     S7     S0  S0 
DIRECTION  2  *■  DIRECTION  1 


Figure  45.  Fast  Traffic  Flow  Transition 
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SEN 


SEN 


RED 


RED 


GRN 


STATE 


So  So  Si  S2  S3  &4  &4 
DIRECTION  1   »•  DIRECTION  2 


S4  S4  S6  S6  S7  S0  S0 
DIRECTION  2  ►  DIRECTION  1 


Figure  46.  Slow  Traffic  Flow  Transition 


State  Machine  Representation 

Once  the  design  problem  is  defined,  the  next  stage  is  to  convert 
it  to  its  state  machine  representation.  State  machines  can  be 
represented  by  bubble-and-arrow  state  diagrams  (Figure  4). 
Each  bubble  represents  a  state,  with  each  arrow  representing  a 
transition.  A  second  method  of  state  machine  representation  is 
with  flow  charts,  details  of  which  are  explained  on  page  2-104. 
State  diagrams  are  by  far  the  most  popular  representation. 
Flowcharts  are  included  for  completeness;  you  need  not  worry  if 
you  are  not  comfortable  with  them. 

The  transition  from  the  present  state  to  the  next  state  is  depend- 
ent upon  present  state  and  input  conditions.  Similarly,  outputs 
can  be  generated  based  upon  the  present  state  (called  a  Moore 
type  state  machine),  or  present  state  and  input  conditions  (called 
a  Mealy  type  state  machine).  Our  design  example  is  a  Moore  type 
state  machine.  The  PALASM  2  syntax  supports  both  Moore  and 
Mealy  type  state  machines. 

The  first  step  for  constructing  a  state  diagram  is  to  define  states. 
All  of  the  specific  events  in  a  design  are  assigned  a  state.  These 
include:  change  of  output  signals,  response  to  a  change  in  input 
signals, ortimedelays.  For  example,  in  Figure  45,  different  states 
(S0,  S3  and  S()  have  been  assigned  where  the  outputs  change. 
Often  states  with  the  same  outputs  can  be  merged  into  one 
unique  state,  as  shown  in  the  case  of  state  S0.  Once  the  states 
are  defined  as  bubbles,  the  transition  from  one  state  to  another 
(called  control  sequencing)  and  outputs  generated  for  each  state 
(output  generation)  are  defined.  The  task  is  to  use  the  timing 
diagram  along  with  the  functional  description  of  the  design  to 


develop  control  sequencing  and  output  generation  functions  for 
state  representations.  The  state  diagram  is  an  improvement  over 
thetiming  diagram  since  it  also  provide  additional  branch  decision 
information. 

Each  of  the  functions  performed  by  a  controller  can  be  viewed  as 
one  of  the  following  operations.  We  will  show  how  each  of  these 
operations  determine  the  control  sequencing  and  the  output 
generation  for  the  state  machine,  and  their  state  diagram  and 
flowchart  representations. 

•  Arbitration 

•  Multiple  condition  testing 

•  Event  monitoring 

•  Control  signal  generation 

•  Timing  delays 

Arbitration  is  the  decision-making  process.  It  is  inadequately 
represented  in  timing  diagrams  with  arrows  (Figures  45  and  46). 
The  table  in  Figure  44  represents  the  functional  requirement  for 
arbitration.  Based  upon  the  sensor  inputs,  the  state  machine 
stays  in  the  same  state  S0,  when  SEN,=H  and  SEN2=L  (for 
allowing  traffic  in  direction  1)  or  transitions  either  to  state  S3 
(SEN,-I_  SEN2=H),  or  to  state  S,  (SEN,,  SEN2  are  both  HIGH  or 
LOW).  This  is  represented  in  the  state  diagram  and  flowchart  as 
the  transition  from  S0  (Figure  47). 

S0  can  also  be  thought  of  as  an  event  monitoring  state  where  the 
controller  waits  for  the  command  from  the  sensors.  Based  upon 
the  two  bits  SEN0  and  SEN,,  the  state  machine  moves  to  different 
next  states.  This  is  also  an  example  of  multiple  condition  testing. 
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SEN  1  *  SEN  2 


a.  Slate  Diagram 


|  (SEN , ,  SENj )  

s, 

S3 

407  42a  b.  Flowchart 

Figure  47.  Conditional  State  Transitions 

An  example  of  control  signal  generation  (Figure  48)  is  signal 
RED.,  which  is  used  to  turn  on  the  red  light  for  direction  1 .  This 
control  signal  generation  example  requires  assigning  output 
signal  name(s)  (RED2)  to  the  state(s)  (S„,  S,,  S2  and  S3)  where  it 
is  asserted  (see  Figure  46).  For  simplicity,  we  are  only  consider- 
ing the  outputs  which  are  asserted.  Other  outputs  in  their  default 
states  must  be  accounted  for  in  the  final  state  machine  represen- 
tation. 

The  signal  GRN,  is  kept  asserted  for  two  extra  clock  cycles 
(Figure  46)  to  allow  the  traffic  to  pass  when  changing  the  traffic 
pattern  from  direction  2  to  direction  1 .  The  state  diagram  repre- 
sentation (Figure  48)  requires  assigning  unconditional  transition 
(control  sequencing)  from  one  state  to  another,  for  a  number  of 
states  (S  ,  S2  and  S3),  depending  upon  the  required  time  delay.  It 
also  requires  assigning  the  signal  name  (GRN,)  to  all  three  states, 


State  Diagram 


407  41a  b.  Flowchart 

Figure  48.  Output  Assignment  to  States 

which  results  in  signal  assertion  for  the  extra  two  clock  cycles. 

Once  the  design  has  been  converted  into  a  state  diagram  (Figure 
49)  or  flowchart  (Figure  50)  representing  all  of  its  timing  and  func- 
tional requirements,  the  next  stage  involves  conversion  to  state 
machine  syntax  for  its  textual  representation. 

State  Machine  Syntax 

The  PALASM  2  programming  software  design  file  (also  called  the 
PAL  device  Design  Specification  or  PDS)  allows  both  conven- 
tional Boolean  equations  and  state  machine  design  constructs.  It 
also  allows  both  Mealy  and  Moore  types  of  designs  with  extensive 
logic  minimization  capabilities  and  easy  menu-driven  operation. 
Other  programming  software  packages  which  provide  similar 
design  capabilities  are  also  available  from  various  vendors. 
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Figure  49.  State  Diagram— Traffic  Signal  Controller 

A  PALASM  2  state  machine  design  file  consists  of  four  sections: 

•  Declaration  section 

•  State  section 

•  Condition  section 

•  Simulation  section  (optional) 

Declaration  Section 

As  illustrated  in  Figure  51 ,  this  section  stores  the  basic  informa- 
tion about  the  device  type  and  pin  names.  It  is  also  used  to  store 
some  bookkeeping  information  such  as  the  company  name  and 
design  revision  numbers. 


s, 

GHN 

.HEDj, 

GRN 

,  RED2 

S3 

YEL, .  RED2 


RED,,  GRN 2 


2s  

% 

RED,.  GRN 2 

RED,,  GRN2 

3: 


S7 

YEL2,  RED, 


TITLE  TRAFFIC  CONTROLLER 

PATTERN  STATE  MACHINE 

REVISION  1 

AUTHOR  J.  ENGINEER 

COMPANY  MONOLITHIC  MEMORIES 

DATE  JANUARY  30,  1987 

CHIP       S_MACHINE  PMS14R21 

CLOCK  DCLOCK  SEN1  SEN2  12  13  14 
15  16  17  SDI  GND 
RESET  SDO  RED1  YEL1  GRN1  RED2 
YEL2  GRN2  01  00  MODE  VCC 


Figure  50.  Flowchart— Traffic  Signal  Controller 


State  Section 

The  state  section  contains  all  the  control  sequencing  and  output 
generation  information.  The  state  section  also  stores  all  of  the 
state  initialization  values  and  default  parameters.  As  illustrated  in 
Figure  52,  it  initially  defines  the  type  of  state  machine  being 
designed  with  keywords  MOORE_MACHINE  or 
MEALY_MACHINE.  For  the  PROSE  device  it  also  selects  the 
use  of  pin  13  as  either  MASTER_RESETor  OUTPUT_ENABLE. 


Figure  51.  Declaration  Section 
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It  also  defines  the  default  output  logic  values  as  HIGH  or  LOW 
with  the  keyword  DEFAULT_OUTPUT.  Once  an  output  is  as- 
signed a  default  logic  value,  it  is  explicitly  mentioned  only  when 
asserted;  otherwise  it  is  assumed  to  be  in  its  default  logic  value. 
This  usually  simplifies  the  task  of  drawing  a  state  diagram  (see 
Figure  49).  Similarly,  the  keyword  OUTPUTJHOLD  defines  the 
outputs  which  retain  their  previous  values  unless  explicitly  men- 
tioned in  a  new  state.  The  last  two  keywords  also  help  isolate  the 
user  from  the  implementation  details  of  J-K  or  D-type  flip-flops 
when  using  PAL-device-based  state  machines.  The  keyword 
OUTPUTJHOLD  is  not  used  in  this  design. 


STATE 

MOORE_MACHINE 
MASTER_RESET 
DEFAULT_OUTPUT  /RED1  /YEL1  /GRN1  /RED2  /YEL2  /GRN2 
POWER  UP  :=  VCC  ->  SO 

Figure  52.  State  Section 

Once  the  default  parameters  are  defined,  all  of  the  state  transi- 
tions are  represented.  Only  state  names  are  used,  since  the  state 
values  are  assigned  automatically  by  the  software.  Every  state 
transition  (Figures  53  and  54)  can  depend  upon  the  present  state 
for  direct  control  sequencing  (state_x  :=  VCC  ->  state_y),  or  the 
present  state  and  input  conditions  for  conditional  control  se- 
quencing (state_x  :=  Condition_name  ->  state_y).  The  syntax 
has  a  direct  relationship  to  the  state  diagrams.  For  conditional 
control  sequencing,  the  syntax  also  allows  a  default  state  when 
none  of  the  conditions  of  transitions  to  other  states  are  satisfied. 

The  output  generation  can  also  be  represented  easily.  Outputs 
are  specified  from  each  present  state.  These  outputs  can  be 
Moore  type  (state.OUTF  :=  output_name)  or  Mealy  type 
(state.OUTF  :=  Condition_name  ->  output_name),  depending 
upon  whether  or  not  conditions  are  used  to  generate  outputs. 


STATE_A>  Vcc  ->  STATE_B 
a.  Direct  Control  Sequencing 


b.  Conditional  Control  Sequencing 

(/state  _a) 


STATE_A.OUTF>  O, 


STATE_A.OUTF:-l,  ^O, 
d.  Mealy  Machine 


Figure  53.  Direct  Relationship  Between  State  Diagram 
and  State  Syntax 

Condition  Section 

This  is  the  third  section  of  the  PALASM  2  state  machine  file 
structure.  This  section  assigns  namesto  all  of  the  input  conditions 
used  by  the  state  section  for  both  control  sequencing  and  output 
generation. 

Design  Example  Syntax 

All  of  the  state  machine  tasks  represented  by  the  state  diagram 
can  be  converted  easily  to  textual  format.  The  transition  from 
state  S,  to  S2  and  S3  (Figure  55)  illustrates  how  outputs  RED2  and 
GRN,  are  generated.  These  represent  control  signal  generation 
and  timing  delay  tasks  of  a  state  machine.  Similarly,  the  arbitra- 
tion function  can  be  converted  to  textual  format,  showing  the 


STATE  _A 

HZ 


STATE_A:.Vcc->  STATE_B 
a.  Direct  Control  Sequencing 


STATE  _D 


STATE  _E 

STATE  _A 
O, 


STATE_C:=  I,  ->  STATE_D 
+1,  ->  STATE_E 

b.  Conditional  Control  Sequencing 


STATE  _B 


STATE_B 
O, 


STATE_A.OUTF:=li  ->  O, 
d.  Mealy  Machine 


STATE_A.OUTF:=0, 
c.  Moore  Machine 


Figure  54.  Direct  Relationship  Between  Flowchart  and  State/Condition  Syntax 
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transition  from  S0  to  various  other  states  (Figure  56).  This  is  also  a 
good  example  of  multiple  condition  testing  and  event  monitoring. 

SO.OUTF  :=  RED2  *  GRN1 

Sl.OUTF  RED 2  *  GRN1 

S2.0UTF  :=  RED 2  *  GRN1 

S3.0UTF  :-  RED2  *  YEL1 

Figure  55.  Output  Generation  Representation 

SO  :=        C3  ->  SI 
+  CO  ->  SI 
+  CI  ->  S3 
+->  SO 

Figure  56.  Control  Sequencing  Representation 


Simulation  Section 

The  simulation  section  is  similar  to  that  used  for  conventional 
Boolean  equation  simulation.  Please  see  page  4-1 37  for  details. 

Assembly  and  Programming 

Once  the  state  diagram  is  converted  to  its  textual  constructs,  the 
file  (Figure  57)  can  be  easily  assembled  by  using  PALASM  2 
software.  The  software  then  generates  the  device  programming 
JEDEC  format  file.  This  JEDEC  format  file  can  be  downloaded  to 
a  device  hardware  programmer  provided  by  various  vendors.  For 
detail  please  see  page  4-1 69  of  the  PALASM  2  software  section. 


Traffic  Signal  Controller 

TITLE        TRAFFIC  CONTROLLER 
PATTERN     STATE  MACHINE 
REVISION  1 

AUTHOR      J.  ENGINEER 
COMPANY    MONOLITHIC  MEMORIES 
DATE  JANUARY  30,  1987 

CHIP       S_MACHIN  PMS14R21 

CLOCK  DCLOCK  SEN1  SEN2  12  13  14 
15  16  17  SDI  GND 
RESET  SDO  RED1  YEL1  GRN1  RED2 
YEL2  GRN2  01  00  MODE  VCC 

STATE 

MOORE_MACH I NE  ;  Defined  as  a  Moore  Machine 

MASTER_RESET  ;  Initialization 

DEFAULT__OUTPUT  /RED1  /YEL1  /GRN1  /RED2  / YEL2 /GRN2 
POWER_UP : =VCC  ->  SO    ;  Power  up  state  defined 

;  Traffic  in  direction  1 

;  Slow  signal  change 

;  Fast  signal  change 

;  Otherwise  stay  in  state  SO 

;  Time  delay 

;  Time  delay 

;  Time  for  yellow 

;  Traffic  in  direction  2 

;  Slow  signal  change 

;  Fast  signal  change 

;  Otherwise  stay  in  state  SO 

;  Time  delay 

;  Time  delay 

;  Time  for  yellow 

Figure  57.  The  Design  File 


50  :=  C3  ->  SI 
+  CO  ->  SI 

+  CI  ->  S3 
+- >  SO 

51  :=  VCC  ->  S2 

52  VCC  ->  S3 

53  :-  VCC  ->  S4 

54  :-  C3  ->  S5 
+  CO  ->  S5 

+  C2  ->  S7 
+->  S4 

55  :-  VCC  ->  S6 

56  VCC  ->  S7 

57  :-  VCC  ->  SO 
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SO . OUTF 

GRN1 

* 

RED2 

Sl.OUTF 

GRN1 

* 

RED2 

S2 . OUTF 

GRN1 

* 

RED2 

S3 . OUTF 

YEL1 

* 

RED  2 

S 4. OUTF 

RED1 

* 

GRN2 

S5.0UTF 

RED1 

* 

GRN2 

S 6. OUTF 

RED1 

GRN2 

S7.0UTF 

RED1 

* 

YEL2 

Allow  direction  1  traffic 


Change  from  direction  1  to  2 
Allow  direction  2  traffic 


;  Change  from  direction  2  to  1 


CONDITIONS 


CO  =  /SEN1 
CI  =  /SEN1 
C2  =  SEN1 
C3  -  SEN1  *  SEN2 


/SEN2 
SEN2 
/SEN2 


Condition  for  no  traffic 

Condition  for  direction  2 

Condition  for  direction  1 

Condition  for  traffic  in  both  directions 


SIMULATION 

TRACE_ON  CLOCK  SEN1  SEN2  RED1  YEL1  GRN1  RED2  YEL2  GRN2 
SETF  RESET  /CLOCK 

CLOCKF  CLOCK  /STATE  TRANSITION  ONLY  ON  1ST  CLOCK 

CLOCKF  CLOCK  /LIGHTS  CHANGE  ON  2ND  CLOCK 

CHECK  /RED1  /YEL1  GRN1  /YEL2  /GRN2  RED2 

SETF  /SEN1  /SEN2 

CLOCKF  CLOCK 

CLOCKF  CLOCK 

CHECK  /RED1  /YEL1  GRN1  RED2  /YEL2  /GRN2 
CLOCKF  CLOCK 

CHECK  /RED1  YEL1  /GRN1  RED2  /YEL2  /GRN2 
CLOCKF  CLOCK 

CHECK  RED1  /YEL1  /GRN1  /RED2  /YEL2  GRN2 

CLOCKF  CLOCK 

CHECK  RED1  GRN2 

CLOCKF  CLOCK 

CHECK  RED1  YEL2 

CLOCKF  CLOCK 

CHECK  /RED1  /YEL1  GRN1  RED2  /YEL2  /GRN2 
SETF  /SEN1  SEN2 
CLOCKF  CLOCK 

CHECK  /RED1  /YEL1  GRN1  RED2  /YEL2  /GRN2 

CLOCKF  CLOCK 

CLOCKF  CLOCK 

SETF  SEN1  /SEN2 

CLOCKF  CLOCK 

CLOCKF  CLOCK 

CHECK  YEL2  RED1 

CLOCKF  CLOCK 

CHECK  GRN1  RED2 


TRACE_OFF 

Figure  57.   The  Design  File  (Cont'd.) 


£1  Monolithic  FJJ  Memories 
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Introduction 

The  microprocessor  has  become  the  standard  processing 
element  in  a  wide  variety  of  digital  systems.  Applications  ranging 
from  ignition  control  in  automobiles  to  high  performance  engi- 
neering workstations  utilize  microprocessors.  Usually  a  micro- 
processor with  its  associated  VLSI  components  (the  dynamic 
RAM,  EPROM  and  standard  microprocessor  peripherals)  can 
implement  most  application  problems.  The  microprocessor's 
general-purpose  nature,  ease  of  use,  and  cost-effectiveness 
makes  it  the  device  of  choice  in  all  but  the  most  performance- 
intensive  or  specialized  applications. 

All  the  designer  needs  is  the  necessary  "glue  logic"  to  connect 
these  VLSI  components  together.  A  typical  microprocessor 
based  design  is  illustrated  in  Figure  1 .  The  heart  of  the  design  is 
the  microprocessor.  Typically,  DRAMs  provide  the  data  storage 
and  EPROMs  provide  the  non-volatile  program  storage.  De- 
pending on  the  application  such  standard  microprocessor  periph- 
erals as  DMA  controllers,  serial  I/O  or  disk  controllers  are  usually 
present.  Glue  logic  is  needed  to  connect  these  devices  together 
to  create  a  working  system.  For  example,  memories  and  periph- 
erals need  to  be  selected  based  on  the  desired  address  range, 
af  unction  called  "address  decoding";  high-speed  address  strobe 
signals  RAS  and  CAS  must  be  generated  for  the  DRAMs;  wait 
states  must  be  generated  for  the  slower  memory  and  peripheral 
chips;  arbitration  and  prioritization  between  different  bus  masters 
(system  components  which  control  the  bus  transactions:  DMA 
and  CPU)  is  necessary  in  multi-master  systems. 
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Programmable  Logic  Devices  as 
Standard  Product  Glue 

The  typical  microprocessor  system  shown  in  Figure  1  highlights 
several  of  the  places  where  programmable  logic  is  used  to 
implementthe  "glue"  in  a  microprocessor  system.  These  simple 
random  logic  and  control-oriented  functions  are  necessary  to 
hold  the  standard  products  together  to  construct  a  complete 
system;  thus  the  name  "glue".  The  common  glue  functions  in  a 
microprocessorsystem  share  several  application  needs,  and  are 
best  satisfied  by  programmable  logic  devices. 

Microprocessor  glue  functions  compete  for  board  space  with  LSI 
devices  in  almost  all  cases.  If  the  glue  logic  uses  too  m  uch  board 
space,  the  design  will  need  to  sacrifice  features  (reducing  the 
design's  value  to  the  customer)  or  add  extra  boards  (increasing 
the  cost).  In  either  case  the  cost/value  ratio  increases,  which  is 
undesirable.  Programmable  logic  devices  can  reduce  the  board 
space  required  for  glue  functions  because  of  the  density  advan- 
tage the  devices  have  over  standard  SSI/MSI  devices.  This 
makes  more  board  space  available  for  standard  VLSI  devices  or 
customized  peripherals. 

Since  the  microprocessor  glue  functions  tend  to  be  control-  and 
timing-oriented,  they  are  subject  to  many  changes  as  the  board 
design  evolves,  processor  speeds  increase,  or  new  standard 
products  are  incorporated  in  the  design.  In  complex  systems, 
logicfunctionsare  commonly  "tuned"to  optimize  bus  band-width, 
peripheral  access  time,  or  memory  timing  only  after  the  system  is 
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Figure  1.  Typical  Microprocessor  System 
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running.  The  optimum  balance  is  usually  too  difficult  to  predict 
and  can  only  be  found  via  experimentation.  Because  of  their 
programmable  nature,  PLDs  offer  the  designer  the  capability  of 
changing  the  logic  at  any  time.  Thus  designs  can  quickly  adjust 
to  changing  requirements,  or  can  be  improved  without  adding 
components  or  changing  the  circuit  board. 

The  address  decoding,  timing,  and  simple  state  machines  are 
"shallow"  and  "wide"  logic  functions.  Shallow  logicfunctions  need 
only  a  few  levels  of  gating  at  a  high  speed.  The  standard  AND- 
OR  array  architecture  offers  3  to  5  levels  of  gating  (inverter,  AND- 
gate,  OR-gate,  flip/flop-master  and  flip/flop-slave)  at  very  high 
speed  (10  to  18ns).  Wide  logicfunctions  require  a  large  number 
of  inputs  on  each  gating  structure.  For  example,  address  decode 
functions  may  need  over  10  inputs  to  decode  a  microprocessor 
address  and  an  8-bit  refresh  counter  would  require  over  8  inputs 
for  the  most  significant  bit.  The  wide-input  structure  of  PLDs, 
where  practically  all  signals  are  available  (in  both  polarities)  at 
each  gating  structure,  is  a  good  match  for  this  application 
requirement. 

PLDs  offer  the  designer  several  important  charcteristics  which 
match  well  with  the  needs  of  glue  logic  functions.  Some  common 
application  areas  will  help  illustrate  the  advantages  of  PLDs  in 
microprocessor  glue  applications. 


GLUE  LOGIC  NEEDS: 

PROGRAMMABLE 
LOGIC 
ATTRIBUTES: 

•  Minimum  Board  Space 

•  Easy  and  Frequent 
Design  Changes 

•  High  Speed  Simple 
Logic  Functions 

•  High  Logic  Density 

•  Quick  and  Easy  Device 
Modifications 

•  Optimized  for  Fast 
Shallow  Logic 

Figure  2.  Table  for  Glue  Logic  Needs  and  Programmable 
Logic  Attributes 
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Address  Decoding 

The  single  most  common  application  of  combinatorial  PAL  de- 
vices is  address  decoding  (Figure  3).  In  order  for  a  microproces- 
sorto  communicate  with  a  peripheral  or  memory  chipachip  select 
signal  must  be  generated  by  the  glue  logic.  Usually  acombination 
of  address  signals  and  control  signals  are  required  to  generate 
the  chip  select  for  microprocessor  memory  and  peripheral  de- 
vices. For  example,  in  the  68000  microprocessor  some  of  the 
address  signals,  the  address  strobe  signal,  and  the  read/write 
signal  must  participate  in  the  address  decode  function.  The  logic 
needed  to  generate  the  chip  select  is  many  times  simply  a  NAND 
of  the  true  or  complement  of  the  necessary  address  and  control 
inputs. 

In  high-performance  microprocessors,  the  chip  select  must  be 
generated  fast  enough  to  allow  single  cycle  access  (no  wait 
states)  to  memory.  PLDs  provide  the  high-speed  wide-gating 
logic  functions  necessary  to  generate  these  chip  select  signals. 
Additionally,  the  addresses  for  peripherals  and  memory  devices 
are  often  not  known  at  the  beginning  of  the  design  or  may  change 
many  times  during  the  design.  PLDs  can  be  easily  changed  to 
adjust  to  such  modifications.  More  on  address  decoding  is  on 
page  2-35. 

Wait-State  Generation 

Wait-state  generation  is  in  many  ways  related  to  the  address 
decode  function  (Figure4).  Afterthe  addressed  chip  is  selected, 
the  CPU  must  be  notified  that  the  data  transfer  can  be  completed. 
The  signal  to  the  CPU  which  identifies  the  end  of  a  data  transfer 
may  need  to  be  extended  if  a  slow  device  is  being  accessed. 
DRAMs,  EPROMs,  EEPROMs,  and  microprocessor  peripherals 
may  all  need  different  delays  depending  on  their  access  time  bus 
bandwidth  requirements  and  frequency  of  access.  Usually  there 
is  at  least  afast  access  (no  wait-states)  a  slower  access  (one  wait- 
state)  and  a  longer  access  (several  wait-states).  The  wait  state 
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Figure  3.  Address  Decoding  Application  Example 
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generation  logic  must  create  a  variable  delay  depending  on  the 
device  being  presently  accessed.  Usuallythedelayonly  depends 
on  the  present  access  and  is  independent  of  previous  accesses. 

A  registered  PAL  device  has  the  capability  of  implementing  the 
simple  state  machine  needed  for  wait-state  generation.  Addition- 
ally, the  PLD  contains  enough  logic  to  provide  for  a  variety  of 
memory,  peripheral  and  processor  speeds,  without  requiring 
additional  board  space.  Thus  a  single  design  can  have  an 
extended  lifetime,  or  provide  a  quick  time  to  market  when  faster 
or  lower-cost  processors,  peripherals  and  memories  are 
available. 

DRAM  Control 

DRAM  control  logic  consists  of  several  distinct  functions:  address 
multiplex  ng,  RAS/MS/CAS  generation,  refresh  request  genera- 
tion, and  memory  request  arbitration.  Address  multiplexing,  as 
the  name  implies,  involves  multiplexing  the  upper  and  lower 
processor  addresses  to  the  DRAM  chips.  RAS/MS/CAS  genera- 
tion involves  the  generation  of  the  three  separate  DRAM  timing 
signals:  Row  Address  Strobe,  Column  Address  Strobe,  and  Mux 
Select  (on  the  address  multiplexer).  Tight  timing  relationships 
between  the  address  multiplexer  and  the  RAS/MS/CAS  logic  is 
necessary  in  high-performance  systems.  Refresh  request  gen- 
eration logic  is  used  to  time  the  interval  between  DRAM  refresh 
cycles.  The  logic  is  usually  a  wide  counter  using  a  slow  clock. 
Memory  request  arbitration  logic  decides  whether  the  refresh 
request  will  be  granted  or  a  memory  access  will  be  allowed. 

PLDs  have  a  variety  of  characteristics  which  make  them  well 
suited  for  DRAM  control  applications.  High  speed  and  minimum 
skew  specifications  are  important  considerations  for  address 
multiplexing,  RAS/MS/CAS  generation  and  refresh  arbitration  in 
high  performance  designs.  High-speed  PLDs  have  significantly 
tighter  skew  specifications  than  the  equivalent  multi-device  SSI/ 
MSI  solutons.  This  makes  PLD-based  DRAM  control  applica- 
tions easer  to  design  and  higher  performance  than  SSI/MSI 
based  designs.  Detail  on  DRAM  designs  is  provided  on 
page  2-179. 


Microprocessor  Peripheral  Interface 

Microprocessors  are  usually  connected  to  a  number  of  VLSI 
peripherals  performing  various  ancilliary  functions.  In  many 
cases  a  design  calls  for  the  use  of  "incompatible"  microprocessor 
and  VLSI  peripherals.  Cost,  performance,  availability,  backwards 
compatibility,  or  other  constraints  sometimes  dictate  that  different 
families  of  devices  need  to  coexist  in  the  same  design.  Glue  logic 
is  necessary  to  allow  these  incompatible  devices  to  communi- 
cate. Sometimes  logic  signals  need  to  be  translated.  The  most 
obvious  case  is  probably  the  different  way  a  processor  read  or 
write  cycle  is  encoded.  One  popular  processor  uses  two  separate 
read  and  write  signals  (/R  or  /W)  with  the  active  state  indicating 
the  presence  of  a  valid  address.  Another  popular  processor  uses 
an  "address  valid"  signal  to  indicate  the  presence  of  an  address 
and  a  single  read/write  signal  to  indicate  the  type  of  access. 
Peripherals  hooking  onto  the  other  processor's  bus  will  need 
some  simple  logicto  translate  the  meaning  of  these  signals.  Often 
signal  timing  is  also  different  between  families.  This  requires 
signals  to  be  delayed  or  qualified  by  glue  logic. 

The  two  most  popular  processor  families  for  high  performance 
designs  are  the  8086/801 86/80286  family  and  the  60000/6801 0/ 
68020  family.  Each  processor  has  its  own  peculiarities  which  the 
designer  must  take  into  account  when  hooking  up  a  peripheral 
device.  Subsequent  discussion  covers  several  applications 
examples  for  each  family.  Finally,  PLDs  can  also  be  used  to  make 
custom  microprocessor  peripherals.  An  application  example  of  a 
PLD-based  interrupt  controller  is  shown  on  page  2-172. 

Conclusion 

PLDs  offer  the  designer  several  important  characteristics  which 
match  well  with  the  needs  of  glue  logic  applications.  Thefollowing 
application  examples  will  illustrate  the  PLD's  versatility  at  imple- 
menting microprocessor  glue  functions. 


Figure  4.  Wait  State  Generator  Application  Example 


2-133 


Interfacing  to  the  8086/801 86/80286 


Overview 

The  8086  is  a  general-purpose  16-bit  microprocessor  CPU. 
The  CPU  has  a  16-bit  data  bus  multiplexed  with  16  address 
outputs.  There  are  4  additional  address  lines  (segment  ad- 
dresses which  are  multiplexed  with  STATUS)  that  increase  the 
memory  range  to  1  Mbyte.  8086  addresses  are  specified  as 
bytes.  In  a  16  bit  word,  the  least  significant  byte  has  the  lower 
address  and  the  most  significant  byte  has  the  higher  address. 
This  is  compatible  with  8080,  8085,  Z80  and  PDP1 1  address- 
ing schemes  but  differs  from  the  Z8000  and  68000 
addressing. 

The  data  bus  is  "asynchronous",  i.e.,  the  CPU  machine  cycle 
can  be  stretched  without  clock  manipulation  by  inserting  Wait 
states  between  t2  and  13  of  a  read  or  write  cycle  to  accom- 
modate slower  memory  or  peripherals.  Unlike  the  68000,  the 
8086  has  separate  address  space  for  I/O  (64  kBytes). 

The  8086  can  operate  in  MIN.  or  MAX.  mode.  Maximum  mode 
offloads  certain  bus  control  functions  to  a  peripheral  device 
and  allows  the  CPU  to  operate  efficiently  in  a  co-processor 
environment.  A  brief  discussion  on  both  the  MIN.  and  MAX. 
modes  are  as  follows: 

MIN.  mode:  I/O  addressing  is  defined  by  a  HIGH  or 
the  IO/M  output,  and  activated  by  the  RD 
output  for  reading  from  memory  or  I/O,  or 
activated  by  the  WR  output  for  writing  to 
memory  or  I/O. 

DMA:  The  Bus  is  requested  by  activating  the  HOLD  input  to 
the  8086  Bus  Grant  is  confirmed  by  the  HLDA  output 
from  the  8086. 


MAX.  mode:        I/O  operation  is  controlled  by  two  outputs 

(3086  plus  from  the  8288. 

8288)  IORC:     active  during  Read  from  I/O 

IOWC:    active  during  Write  to  I/O 
MRDC :   active  during  Read  from  memory 
MWTC:  active  during  Write  to  memory 

DMA:  The  Bus  is  requested  and  Bus  Grant  is  acknowledged 
on  the  same  pin  (RQ/GT0  OR  RQ/GTT)  through  a 
pulsed  handshake. 

Interrupts  in  Min.  and  Max.  Modes: 

Interrupt  is  requested  by  activating  the  INTR  or  NMI  inputs  to 
the  8086. 

Interrupt  is  acknowledged  by  the  INTA  pin  on  a  MIN.  mode 
6086  or  by  the  INTA  pin  on  the  8288  in  MAX.  mode. 

Note:  There  is  no  R~D  or  IORC  during  the  interrupt- 
acknowledge  sequence. 
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8086  and  Am7990  LANCE 

The  LANCE,  Am7990,  has  been  designed  to  be  interfaced 
easily  with  the  popular  16-bit  microprocessors  (8086/80186, 
68000,  Z8000,  LSI-1 1 ).  Most  of  the  interface  logic  is  embedded 
inside  the  chip  and  is  program  selectable. 

Although  the  LANCE  itself  has  a  multiplexed  bus,  it  can  easily 
be  interfaced  to  demultiplexed  buses  with  a  minimal  amount  of 
effort.  The  following  designs  assume  that  the  processor  and 
the  LANCE  reside  on  the  same  board.  Address  buffers  and 
data  transceivers  are  set  up  to  be  shared  between  the  proces- 
sor and  the  LANCE.  All  of  these  designs  use  PAL  devices  to 
reduce  the  parts  count. 

The  8086  to  LANCE  interface  requires  a  different  Bus  Request 
handshake,  depending  on  whether  the  8086  is  configured  in 


MAX.  or  MIN.  mode.  The  8086  has  a  bidirectional  signal  for 
both  Bus  Request  and  Bus  Grant  (RQ/GT).  Both  Bus  Re- 
quest (RQ),  and  Bus  Grant  (GT)  to/from  8086  are  one  CPU 
clock  wide,  and  are  synchronous  to  the  CPU  clock.  Figure  1 
shows  a  PAL  device  design  for  the  conversions  in  MAX. 
mode.  This  PAL  device  is  utilized  to  include  other  external 
logic  requirements  for  interfacing  the  LANCE  to  8086.  The 
interface  diagram  is  similar  to  the  one  for  the  80186  to 
LANCE  interface  except  for  the  changes  made  in  program- 
ming the  PAL  device.  The  interface  timing  diagram  is  shown 
in  Figure  3.  The  PAL  device  design  file  is  shown  in  Figure  2. 

Figure  4  shows  a  block  diagram  of  the  8086  to  Am7990 
interface,  in  MIN.  mode.  The  interface  also  employs  a  PAL 
device  to  minimize  parts  count.  The  PAL  device  equations 
are  given  in  Figure  5. 


Figure  1.  8086  RG/GT,  Am7990  HOLD/HOLDA  Conversion  PAL  Device 
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TITLE         808  6  RQ/GT  TO  LANCE  HOLD/HLDA  CONVERTOR 
PATTERN  PAT001 
REVISION  01 

AUTHOR       RASOUL  OSKOUY 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  CONVERT  PAL16R4 

CLK  NC  HOLD  /GT  NC  NC  NC  NC  NC  GND 
NC     NC  NC       /HLDA  /D2   /R2   /Rl  NC  NC  VCC 

EQUATIONS 

Rl    :=  HOLD       ;8086  RQ/GT  conversion  to  LANCE  HOLD/HLDA 
D2    :=  /Rl         ;both  REQ  and  GRANT  are  one  clock  wide  and 
;are  synchronous  to  CPU  clock 

R2    :=  Rl*/D2 
+  /R1*D2 
HLDA   :=  GT*/R2 

+  HLDA*/D2 

Figure  2.  8086  RG/GT,  Am7990  HOLD/HLDA  Conversion  PAL  Device  Design  File 


TITLE         8086  MIN  MODE  TO  LANCE  INTERFACE 
PATTERN  PAT 
REVISION  01 

AUTHOR       KHUYNH  NGUYEN 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  LANC_INT  PAL16L8 

ALE   /AS     DTR     NC     NC     DEN     NC     /READY     HLDA  GND 
NC     CPURDY     READ      /R     /T     /DAS      /WR     /RD     LE  VCC 

EQUATIONS 

/LE  =  /ALE 
+  /AS 

/CPURDY  =  /READY  ;   CPU    (86)    is  a  bus  master 

DAS  =  RD  +  WR 
DAS.TRST  =  /HLDA 
/READ  =  DTR 
/READ . TRST  =  /HLDA 
T  =  DTR 

T . TRST  =  /HLDA 
R  =   /DTR  *  DEN 

R.TRST  =  /HLDA       ;   LANCE  is  a  bus  master 

RD   =  READ   *  DAS 
RD . TRST  =  HLDA 
WR  =  /READ  *  DAS 
WR . TRST  =  HLDA 


Figure  5.  Source  Listing  for  Example  of  Figure  4 
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Figure  3.  AmPAL16R4,  8086  (Max.  Mode)  LANCE  Interface  Timing  Diagram 
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Figure  4.  Am7990  to  8086  Interface 
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Am9516  in  MIN.  Mode 

Figure  1  illustrates  the  interface  of  the  Am951 6  to  the  8086  in 
the  MIN.  mode  configuration.  Figure  3  illustrates  the  interface 
in  MAX.  mode.  The  interfaces  could  be  accomplished  by 
using  rather  complex  implementation  of  standard  SSI/MSI 
logics.  Examples  here  replace  the  logic  portion  with  a  PAL 
device.  The  MIN.  Mode  uses  the  PAL  device  16L8.  This  is  a 
good  example  of  "garbage  collection".  It  reduces  the  amount 
of  real  estate,  interconnections,  parts,  and  part  types. 


Both  interface  examples  accomplish  two  major  functions.  First, 
when  the  Am9516  is  bus  master,  it  converts  RD  and  WR  into 
R/W  and  DS,  and  vice  versa  when  the  Am9516  is  not  the  bus 
master.  Secondly  the  buffer  controls,  TBEN  and  REBN,  are 
generated  from  DEN  and  DT/R  . 

The  two  examples  show  different  types  of  latches  and 
transceivers  and  there  are  many  more  to  choose  from.  The 
designer  selects  those  that  best  meet  system  requirements 
while  trying  to  minimize  the  number  of  different  parts  that  must 
be  stocked.  Figure  2  shows  the  PAL  device  equations  for  this 
example. 
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Figure  1.  The  Am9516  UDC  to  8086  CPU  Interface  (Minimum  Mode) 
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TITLE         Am9516  TO  8086  MIN  MODE  INTERFACE 

PATTERN  01-3.66 

REVISION  01 

AUTHOR        JOE  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/05/87 

CHIP  DMA_INTFCE  PAL16L8 

;Am9516  to  8086  min  mode  interface  chip 


NC  ALED  ALEP  HLDA  BW  ADO  DT  /DEN  /SEL  GND 
NC  /RBEN  /RD  ALE  AO  /RW  /DS  /WR  /TBEN  VCC 


EQUATIONS 

DS  =  RD  +  WR 
DS.TRST  =  /HLDA 

RW   =  DT 

RW.TRST  =  /HLDA 

TBEN   =   /DT   *   /SEL   *  DEN 
TBEN.TRST   =  /HLDA 

RBEN   =   DT   *   /SEL   *  DEN 
RBEN.TRST   =  /HLDA 

RD   =   /RW   *  DS 
RD.TRST  =  /HLDA 

WR  =  RW  *  DS 
WR.TRST  =  /HLDA 

ALE  =  /ALEP   *  /ALED 

AO  =     /ADO   *  /BW  *  HLDA  *  ALED 
+  ADO   *   BW  *  HLDA  *  ALED 
+  /ADO   *  /HLDA  *  ALEP 
+  AO    *   /ALEP  +  AO   *  /ALED 

DESCRIPTION 

;THIS   PAL  DEVICE  CONVERTS   THE  CONTROL  SIGNALS   TO  INTERFACE   THE  flOfib 
;IN  MIN  MODE  TO  THE  AmlBlfc  DMA  CONTROLLER.      ANOTHER  EXAMPLE  SHOWS 
;HOW  THIS  IS  DONE  IN  MAX  MODE. 

;    SIMULATION  NOT  INCLUDED 


Figure  2.  Source  Listing  for  Example  of  Figure  1 
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Am9516  in  MAX.  Mode 

This  MAX.  mode  interface  between  the  8086  and  Am9516 
(Figure  3)  also  uses  PAL  devices  to  reduce  component 
count. 

The  example  makes  several  assumptions  which  result  in  a 
slightly  more  complex  design  than  absolutely  necessary.  The 
8086  is  assumed  to  be  in  MAX.  Mode,  and  the  design  has  to  be 
compatible  with  a  MULTIBUS  or  similar  interface;  the  16R4 
and  8288  could  be  eliminated  if  this  is  not  the  case. 
The  16R6  (Figure  4)  is  a  PAL  device  that  performs  a  function 
similar  to  the  8288,  that  is,  it  converts  the  processor's  status 
signals_and  clock  into  control  signals.  In  this  case,  the  signals 
are  R/W,  Data  Strobe  (DS),  Interrupt  Acknowledge  (INTA) 
and  Peripheral  Acknowledge  (PACK).  This  PAL  device 
basically  generates  ZBUS  signals  for  Zilog  peripherals. 
In  the  example,  it  connects  to  the  Am9516  (UDC).  The  DMA 
Controller  is  very  similar  to  the  AmZ801 6  except  its  bus  inter- 
face has  been  modified  to  interface  to  non-multiplexed 
buses.  This  was  changed  from  the  previous  design  using  a 
16R8  to  eliminate  problems  due  to  skew  between  OSC 
and  CLK. 

The  1 6R4,  Figure  5,  has  two  functions.  The  first  is  connecting 
the  MIN.  Mode  protocol  of  the  Am9516  or  similar  device  to 


the  MAX.  Mode  protocol  for  bus  exchange.  The  74LS03S  are 
used  to  aid  in  this  function.  The  timing  waveform  illustrates 
what  happens  in  detail.  The  basic  philosophy  is  that  a  rising 
edge  on  the  HOLD  input  generates  a  pulse  that  is  one  clock 
wide.  The  CPU  samples  this  pulse  and,  in  response,  issues  a 
one-clock-wide  pulse.  The  1 6R4  uses  this  response  pulse  to 
generate  HLDA.  When  the  Am9516  has  completed  the  nec- 
essary transfer,  HOLD  transitions  HIGH  to  LOW.  This  gener- 
ates another  pulse  to  the  CPU  signalling  that  the  Am9516  is 
done  and  that  the  CPU  may  continue. 

The  second  function  of  the  16R4  is  the  conversion  of  RW, 
DS  and  M/IO  into  the  MULTIBUS-compatible  signals 
/MRDC,  /MWTC,  /IORC,  and  IOWC,  when  HLDA  is  HIGH.  It  is 
possible  to  collapse  the  74LS03S  into  the  PAL  device,  thus 
reducing  the  external  logic  required  to  only  one  open  collector 
inverter.  The  disadvantage,  however,  is  that  it  adds  two  addi- 
tional clocks  for  the  bus  exchange  overhead,  one  during  ac- 
quisition of  the  bus  and  one  on  bus  release.  For  block 
transfers,  this  is  not  significant  but  it  may  be  undesirable  when 
performing  single  transfers,  or  short  burst,  or  when  in  Demand 
Mode.  To  eliminate  the  74LS03,  change  the  equation  for  R2  to 
/R1  -  /D2  +  R1  •  D2, 

and  then  drive  the  /[RQ/GT]  line  with  a  74LS05  from  the  R2 
output. 
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TITLE         8086  TO  85XX  PERIPHERAL  INTERFACE 

PATTERN  01-3.68 

REVISION  01 

AUTHOR       JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  DMA_INTRR  PAL16R6 

CLOCK  RESET  CLK  /SO  /SI  /S2  /LOCK  NC  NC  GND 
/OE         /AS         /PI       /RW       /DS     /PO     /IACK     /RDY     CLKD  VCC 

EQUATIONS 


/RESET 

* 

SO 

* 

/PO  * 

/PI 

+ 

/RESET 

SI 

/PO  * 

/PI 

+ 

/RESET 

S2 

/PO  * 

/PI 

+ 

/RESET 

* 

SO 

PI 

+ 

/RESET 

SI 

* 

PI 

+ 

/RESET 

* 

S2 

★ 

PI 

/RESET 

* 

/PO 

*  /PI 

+ 

/RESET 

PI 

* 

so 

+ 

/RESET 

* 

PI 

SI 

+ 

/RESET 

* 

PI 

S2 

DS   :-  /IACK  *   /PO  *  PI  *     SO  *  /SI  *  S2 

+  /IACK  *  /PO  *  PI  *  /SO  *     SI  *  S2 

+     IACK  *     SO  *  SI  *  S2  *  PO  *  /PI  *  LOCK 

+     DS  *  SO     *  SI  *  S2 

+     DS  *  SO     *  /SI  *  S2 

+     DS  *  /SO  *  SI  *  S2 

RW   :-  SO  *  /SI  *  S2 

IACK   :=   /RESET  *  SO   *   SI   *   S2   *   /PO   *   /PI   *  /LOCK 

+  /RESET  *  IACK  *  SO  *  SI   *  S2   *  PO  *  /PI  *  /LOCK 

+  /RESET  *  IACK  *  LOCK  *  /DS 

+   /RESET  *  IACK  *   /LOCK  *   DS   *   /PO   *  PI 

RDY   :=  /RESET  *     SO     *  /SI  *  S2   *  PO  *  PI 
+  /RESET  *  /SO     *     SI   *  S2   *  PO  *  PI 
+  /RESET  *  RDY     *     SO  *  /SI  *  S2 
+  /RESET  *  RDY     *   /SO  *  SI   *  S2 
+  /RESET  «  IACK  *     SO  *  SI  *  S2   *  DS 
+  /RESET  *  RDY     *     SO  *  SI  *  S2 

/CLKD  -  CLK 

AS  =  /CLKD  *  PO  *  /PI  *  /IACK  *  CLK 
.■DESCRIPTION 

;THIS  PAL  DEVICE  TRANSLATES  8086  BUS  SIGNALS  INTO  COMPATIBLE  SIGNALS  FOR 
; THE  9516.     IT  IS  ALSO  APPLICABLE  TO  85XX  PERIPHERALS  BY  ALTERING  /RW  AND 
;/DS  TO  /RD  AND  /WR .     ONE  FLIP-FLOP  IS  AVAILABLE  TO  GIVE  THE  NECESSARY 
; DELAY  TO  THE  FALLING  EDGE  OF  /WR.     THE  DATA  STROBE  TIMING  FOR  A  WRITE 
,-CYCLE  IS  DELAYED  UNTIL  THE  FALLING  EDGE  OF  T2  TO  MEET  THE  REQUIREMENTS 
;OF  THE  85XX  PARTS.     THIS  DESIGN  ASSERTS  RDY  TO  DEMAND  ONE  WAIT  STATE 
,-FROM  THE  8086.     THIS  WAIT  STATE  IS  NOT  LONG  ENOUGH  FOR  DESIGNS  WHICH  USE 
; AN  8  MHz  8086.     THEREFORE,   WITH  AN  8  MHz  CPU,    85XX  PERIPHERALS  SHOULD  BE 
,-USED.     AS  AN  ALTERNATIVE,   THREE  WAIT  STATES  CAN  BE  USED  BY  ALTERING  THE 
; RDY  EQUATION.     THIS  PAL  DEVICE  ALSO  TRANSFORMS  THE  8086  TWO-CYCLE 
.•INTERRUPT  ACKNOWLEDGE  INTO  A  SINGLE  CYCLE  OF  THE  TYPE  NECESSARY  FOR  85XX 
,-PARTS.     THIS  IS  MADE  POSSIBLE  BY  SAMPLING  THE  LOCK  STATUS,   PO,   PI,  AND 
;IACK  SIGNALS . 

Figure  4.  Source  Listing  for  Example  of  Figure  3 
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TITLE         Am9  516  TO  8086  INTERFACE 

PATTERN  01-3.67 

REVISION  01 

AUTHOR        JOE  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  DMA_INTFCE2  PAL16R4 

,-8086  to  Am9516  interface 

CLK  /RQGT  HOLD  NC  NC  NC  /RW  /DS  MIO  GND 
/OE  /MWTC  /MRDC  HLDA  /D2  /R2     /Rl     /IOWC  /IORC  VCC 

EQUATIONS 

IORC  =  /MIO  *   DS  *  /RW 
IORC.TRST  =  HLDA 

IOWC  =  /MIO  *  DS   *  RW 
IOWC.TRST  =  HLDA 

MRDC   =  MIO   *   DS   *  /RW 
MRDC.TRST  =  HLDA 

MWTC  =  MIO  *   DS  *  RW 
MWTC.TRST  =  HLDA 

Rl      :=  HOLD 

R2      :=  /Rl 

D2      :=  Rl 

/HLDA   :=  /Rl   +  /D2   *  /HLDA  + 
/RQGT  *  /HLDA 

.•DESCRIPTION 

;THIS  DEVICE  CONVERTS  THE  MIN  MODE  SIGNALS  HOLD  AND  HLDA  TO  THE 
;MAX  MODE  RQGT  PROTOCOL.     ADDITIONALLY  IT  GENERATES  THE  flEflfl 
; EQUIVALENT  CONTROL  OUTPUTS  MRDC,   MWTC,   IORC,   AND  IOWC.  THIS 
;PAL  DEVICE  WAS   USED  TO  CONNECT  THE  AmqSlb  TO  THE   ADflb   IN   MAX  MODE. 

;    SIMULATION  NOT  INCLUDED 


Figure  5.  Source  Listing  for  Example  of  Figure  3 
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80286  to  Am9568  Data  Ciphering 
Processor  Interface 

This  interface  is  designed  for  an  8-MHz  CPU  where  the  Data 
Ciphering  Processor  (DCP)  is  synchronously  operating  at  the 
maximum  clock  rate  of  4  MHz.  A  block  diagram  for  the  inter- 
face is  shown  in  Figure  1 .  The  Am9568  requires  a  narrower 
width  of  address  strobe  than  the  Am9518.  This  works  comfor- 
tably with  the  60-ns  address  strobe  width  of  an  8-MHz  CPU. 

The  MULTIBUS  Mode  Select  input  of  the  Bus  Controller  82288 
is  tied  LOW  to  optimize  the,  command  and  control  signals  for 
short  bus  cycles.  The  Command  Delay  (CMDLY)  becomes 
active-HIGH  for  one  16-MHz  clock  cycle  whenever  the  DCP  is 
selected  to  delay  the  Read  and  Write  strobes  by  125  ns.  This 
satisfies  the  timing  requirement  of  the  minimum  delay  between 
ALE  inactive  and  Read  or  Write  strobe  active  of  the  DCP.  An 
open-collector  gate  must  be  added  to  allow  other  peripherals  to 
drive  this  input. 

The  ALE,  IORC  and  IOWC  outputs  of  the  82288  are  wired 
directly  to  the  DCP.  ALE  strobes  a  D-Flip-Flop  to  store  the  state 
of  Chip  Select  for  the  entire  cycle. 

Q  3  and  the  latched  Chip  Select  CSL  are  ANDed  externally  to 
generate  the  Synchronous  Ready  for  the  82284.  The  82284 
samples  the  line  at  the  falling  edge  of  the  clock.  The  registered 


output  Q  3  is  clocked  with  the  rising  edge  of  the  same  clock, 
thus  satisfying  the  setup  and  hold  time  requirements  of  the 
82284.  Two  Wait  States  are  inserted. 

Half  of  the  PAL  device  operates  as  a  bidirectional  Ad- 
dress/Data Multiplexer.  During  the  Address  Latch  Enable  ac- 
tive phase,  the  state  of  A-|  and  A2  is  transferred  to  the  AD-|  and 
AD2  pins  of  the  PAL  device.  The  DCP  latches  this  two  bit- 
address  with  the  falling  edge  of  ALE. 

When  IORC  and  CSL  are  active,  the  states  of  AD,  and  AD2 
are  passed  to  D1  and  D2  respectively.  The  DCP  Register  can 
be  read.  If  IOWC  and  CSL  are  active,  the  data  path  is  turned 
around:  D,  and  D2  are  inputs,  AD1  and  AD2  are  outputs. 

The  Address  Hold  Time  of  the  PAL  device  is  sufficient  because 
the  address  information  is  passed  to  AD,  and  AD2  whenever 
IORCCSL  or  lOWC'CSL  are  not  true,  i.e.  whenever  data  is 
not  transferred  between  the  CPU  and  the  DCP. 

The  Read  Data  Hold  Time  requirement  of  5  ns  of  the  Am9568 
is  satisfied  by  the  propagation  delay  of  the  PAL  device. 

The  Read  Data  Hold  Time  requirement  of  5  ns  of  the  iAPX286 
is  also  satisfied  by  the  PAL  device. 

The  Master  Port  Chip  Select  (MCS)  input  of  the  DCP  is  con- 
nected to  the  unlatched  address  decoder  output. 

Figure  3  shows  the  source  listing  and  the  pin  descriptions. 
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Figure  1.  80286  to  Am9568  Interface 
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The  DCP  Clock 

The  PAL  device  synchronizes  the  DCP  clock  to  the  Data 
Strobes  IORC  and  IOWC  (Figure  2).  It  also  divides  the 
16  MHz  system  clock  (8  MHz  CPU  clock)  down  to  the  maxi- 
mum DCP  clock  rate  of  4  MHz.  At  this  clock  rate,  the  Data 
Strobe  Delay  to  the  DCP  clock  must  be  0-30  ns.  The  Bus 
Controller  is  specified  to  generate  a  Data  Strobe  timing  of 
3-15  ns  to  the  falling  edge  of  CLK  (16  MHz).  Because  of  the 
higher  propagation  delay  of  a  standard  PAL  device,  the  regis- 
tered outputs  are  toggled  at  the  rising  edge  of  CLK  before  the 
Data  Strobes  become  inactive.  This  gives  an  additional 
32.5  ns  for  the  DCP  clock  signal  path. 

Qi  to  Q3  are  three  outputs  of  the  PAL  device  state  machine. 
The  registered  outputs  are  clocked  with  thejising  edge  of  the 
16-MHz  82284  clock.  Whenever  ALE  and  CS  are  active,  Qi  to 
Q3  are  set  to  the  initial  state.  Qi  to  Q3  are  outputs  of  a  3-bit 
down  counter,  with  Q3  as  the  most  significant  bit. 

Q3  is  used  to  generate  the  SRDY  signal  for  the  82284  as 
mentioned  above. 

Q2  is  the  DCP  clock.  This  design  must  guarantee  that  the 
minimum  DCP  clock  HIGH  or  LOW  time  is  at  least  115  ns  or 
two  16-MHz  clock  cycles.  This  is  done  by  toggling  Q2  only 


during  phase  2  cycles  of  the  CPU.  The  CPU  design  guarantees 
that  there  is  always  a  phase  1  cycle  between  two  phase  2 
cycles. 

Assuming  a  typical  PAL  device  propagation  delay  of  25  ns, 
timing  parameter  tCDS  (Time  Clock  Data  Strobe)  is  10.5  to 
22.5  ns  ( 3  +  32.5  -  25  ns  to  1 5  +  32.5  -  25  ns).  This  satisfies 
the  0  to  30  ns  requirement. 

The  AmPAL16R4  has  active-LOW  outputs.  But  one  output,  Q2, 
should  be  active-HIGH.  The  equation  for  Q2  was  derived  to  be 

Q2  =  ALE  *  CS  +  Q,  *  Q2  +  Q,  *  Q2 

To  compensate  for  the  inversion  in  the  PAL  device,  the 
PALASM  software  minimizer  (or  16RP4)  can  be  used  to 
convert  it  to  the  form  shown  in  the  PAL  device  Design  Speci- 
fication. 

Improvements 

The  DCP  needs  two  Wait  States  only  when  the  Control  Reg- 
isters are  read.  Data  Register  read  or  writes  and  Control 
Register  reads  can  be  executed  with  only  one  Wait  State, 
which  improves  the  Data  Ciphering  speed  of  this  interface. 
The  more  sophisticated  Wait  control  logic  and  the  two  exter- 
nal TTL  gates  can  be  integrated  into  one  PAL22V10  device. 
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80286  to  Am9568  Data  Ciphering  Processor  Interface 


TITLE         80286  TO  Am9568    (DCP)  INTERFACE 
PATTERN     DCP0  4  3 
REVISION  01 

AUTHOR       JUERGEN  STELBRINK 
COMPANY     ADVANCED  MICRO  DEVICES 
DATE  11/06/87 

HIP  DCP_INTFCE  PAL16R4 

CLK  /CS  CSL  ALE  /IORC  /IOWC  Al  A2  NC  GND 
/OE     Dl       D2       /Ql     Q2  /Q3         CMDLY  AD1     AD 2  VCC 

EQUATIONS 

Ql         :=  ALE*CS 
+  /Ql 

/Q2  :=  Q1*/Q2*/ALE 
+  Q1*/Q2*/CS 
+  Q1*Q2*/ALE 
+  Q1*Q2*/CS 

Q3  :=  ALE*CS 

+  Q1*Q2*Q3 
+  /Q1*Q2*Q3 
+  Q1*/Q2*Q3 
+  /Ql*/Q2*/Q3 

/CMDLY : =  /ALE 
+  /CS 

/Dl   =  /AD1 

Dl.TRST  =  CSL  *  IORC 
/D2   =  /AD2 

D2.TRST  =  CSL  *  IORC 

/AD1  =  /A1*ALE  +  /D1*/ALE 
AD1.TRST  =  CSL  *  IORC 

/ AD2   =  /A2*ALE  +  /D2*/ALE 
AD2.TRST  =  CSL  *  IORC 

;    SIMULATION  NOT  INCLUDED 


Figure  3.  Design  File  for  the  Example  of  Figure  1 
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80286  to  Am8530  Interface 

The  Am8530  is  a  high-speed,  dual-channel  serial  communi- 
cations controller  that  supports  a  variety  of  advanced 
communication  protocols.  It  supports  data  rates  up  to 
1.5M  bps. 

This  design  note  shows  an  80286  to  an  Am8530  CPU  inter- 
face when  no  interrupts  are  used  (Figures  1  and  2).  The 
design  is  for  a  CPU  running  at  5  MHz  with  a  100  ns  system 
clock  cycle  time.  This  clock  is  used  to  generate  the  clock  for 
the  6  MHz  Am8530  SCC. 

The  chip  select  generated  by  the  external  decoder  in  an 
80286  system  that  is  enabled  by  the  M/IO  line  will  go  inactive 
with  the  falling  edge  of  TCLK-phase  1 .  However,  the  chip 
enable  for  the  SCC  should  stay  valid  for  the  entire  read  or 


write  cycle.  The  PAL  device  generates  the  chip  enable  signal 
for  the  SCC  using  a  structure  equivalent  to  two  cross-coupled 
NOR  gates. 

The  Am8530  requires  a  minimum  pulse  width  of  250  ns  for  RD 
and  WR  control  signals.  For  a  100-ns  cycle  time  80286  with  no 
wait  state,  these  controls  will  be  active  for  200  ns  only.  SRDY 
control  is  generated  for  the  82284  bus  controller  to  guarantee 
minimum  RD  and  WR  pulse  widths  for  the  Am8530. 

CMDLY  is  not  used  in  this  design  because  the  address  to  read 
or  write  control  setup  time  required  is  only  80  ns  for  the 
Am8530;  with  a  100  ns  system  cycle  time,  the  time  between 
ALE  valid  and  /IORC  or  /IOWC  valid  is  100  ns.  Even  after 
accounting  for  the  address  latch  delay.  80  ns  timing  will  still 
be  met. 
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"  THIS  PLPL  FILE  IS  FOR  A  16R4  THAT  IMPLEMENTS  THE  LOGIC  "  COUNT  DOWN  SPECIFICATION  FOR  THE  COUNTER.  " 

NECESSARY  TO  INTERFACE  AN  Am8530  (SCO  TO  AN  80286  SYSTEM.  »      «  Q2  OF  THE  COUNTER  IS  THE  CLOCK  INPUT  OF  THE  Am8530.  » 


DEVICE  Am8530_to_80286  (PAL16R4) 
PIN 


CLK 

1 

VCC 

20 

/CS 

2 

m 

19 

ALE 

3 

/INTSO 

18 

/IORC 

4 

/Q[1] 

17 

/I  owe 

5 

/Q  [2] 

16 

NC1 

6 

/Q  [3] 

15 

NC2 

7 

NC5 

14 

NC3 

8 

/SRDY 

13 

NC4 

9 

NC6 

12 

GND 

10 

NC7 

= 

11  ; 

BEGIN 

"  CHIP  ENABLE  FOR  THE  Am8530  IS  DERIVED  FROM  ALE  AND 
THE  EXTERNAL  DECODER  CHIP  SELECT  OUTPUT.  " 

CE  =  CS  *  ALE  +  INTSO  ; 

INTSO  =  /(  IORC  *  IOWC)  +  CE  ; 

11  SYNCHRONIZE  THE  COUNTER  WITH  ALE.  " 

IF  (  CS  *  ALE  )  THEN  Q[3:1]  =  7  ; 


CASE  (  Q[3:1]  ) 


BEGIN 

0  )  Q[3:1] 

7 

1  )  Q[3:1] 

0 

2  )  Q[3:1] 

1 

3  )  Q[3:1] 

2 

4  )  Q[3:1] 

3 

5  )  Q[3:1] 

4 

6  )  Q[3:1] 

5 

7  )  Q[3:1] 

6 

END  ; 

"  SRDY  IS  GENERATED  FOR  WAIT  STATE  GENERATION,  IT  IS  SENT 
TO  THE  82284  CLOCK  GENERATOR.  " 

SRDY  =  /(Q[3]  *  CE)  ; 

END. 


Figure  2.  Source  Listing  for  AmPAL16R4  for  the  Example  of  Figure  1 


Interfacing  to  the  68000/68020 


The  68000  has  an  asynchronous,  16-bit,  bidirectional  data 
bus.  Data  types  supported  by  the  68000  are:  bit  data,  integer 
data  of  8,  16,  or  32  bits,  32-bit  addresses  and  binary-coded 
decimal  data.  It  can  transfer  and  accept  data  in  either  words 
or  bytes.  The  DTACK  input  indicates  the  completion  of  a  data 
transfer.  When  the  processor  recognizes  DTACK  during  a 
read  cycle,  the  data  is  latched  and  the  bus  cycle  terminates. 
When  DTACK  is  recognized  during  a  write  cycle,  the  bus 
cycle  also  terminates.  An  active  transition  of  DTACK  indi- 
cates the  termination  of  a  data  transfer  on  the  bus.  All  control 
and  data  lines  are  sampled  during  the  68000's  clock  HIGH 
time.  The  clock  is  internally  buffered,  which  results  in  some 
slight  differences  in  the  sampling  and  recognition  of  various 
signals.  The  68000  mask  sets  prior  to  CC1  and  allows 
DTACK  to  be  recognized  as  early  as  S2,  and  all  devices 
allow  BERR  or  DTACK  to  be  recognized  in  S4,  S6,  etc., 
which  terminates  the  cycle.  If  the  required  setup  time  is  met 
during  S4,  DTACK  will  be  recognized  during  S5  and  S6,  and 
data  will  be  captured  during  S6.  DTACK  signal  is  internally 
synchronized  to  allow  for  valid  operation  in  an  asynchronous 
system.  If  an  asynchronous  control  signal  does  not  meet  the 
required  setup  time,  it  is  possible  that  it  may  not  be 
recognized  during  that  cycle.  Because  of  this,  synchronous 
systems  must  not  allow  DTACK  to  precede  data  by  more 
than  40  to  240  nanoseconds,  depending  on  the  speed  of  the 
particular  processor.  I/O  is  memory-mapped,  i.e.,  there  are 
no  special  I/O  control  signals,  any  peripheral  is  treated  as  a 
memory  location. 


DMA:  This  Bus  is  requested  by  activating  the  BR  input  of  the 
68000.  Bus  Arbitration  is  started  by  the  BG  output  going  active. 
The  Bus  is  available  when  AS  becomes  inactive.  The  request- 
ing  device  must  acknowledge  bus  mastership  by  activating  the 
BGACK  input  to  the  CPU. 

The  23-bit  address  (A-|  ...A23)  is  on  a  unidirectional,  three-state 
bus,  and  can  address  8  M  words  '16  M  bytes)  of  memory  or 
I/O.  It  provides  the  address  for  bus  operation  during  all  cycles, 
except  the  interrupt  cycles.  During  interrupt  cycles,  address 
lines  A1,  A2  and  A3  provide  information  about  the  level  of 
interrupt  being  serviced.  Instead  of  Ao  and  BYTE/WORD , 
there  are  two  separate  data  strobe  lines  for  the  two  bytes  in  a 
word.  A  note  of  caution  here,  the  68000  treats  the  MSB  of  the 
lower  byte  as  an  even  byte,  or  word  address.  The  same  goes 
with  processors  such  as  the  Z8000.  Processors  such  as  the 
8086  treats  the  lower  byte  as  the  odd  byte. 

Interrupt  is  requested  by  activating  any  combination  of  the  in- 
terrupt inputs  to  the  68000  (IPL0...2),  indicating  the  encoded 
priority  level  of  the  interrupt  requester  (inputs  at  or  below  the 
current  processor  priority  are  ignored).  The  68000  automati- 
cally saves  the  status  register,  switches  to  supervisor  mode, 
fetches  a  vector  number  from  the  interrupting  device,  and  dis- 
plays the  interrupt  level  on  the  address  bus.  For  interfacing 
with  old  68000  peripherals,  the  68000  issues  an  Enable  signal 
at  one-tenth  of  the  processor  clock  frequency.  There  are  a 
number  of  AMD  proprietary  third  generation  peripherals  that 
can  be  interfaced  to  the  68000  CPU,  to  improve  system  perfor- 
mance. This  section  deals  mainly  with  the  interfacing  of  the 
68000  and  some  of  the  AMD  proprietary  peripherals. 
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The  68000  to  Am8530  Interface 
with  Interrupts 


— 


This  example  shows  how  a  PAL  device  simplifies  the  task  of 
interrupt  generation  compared  to  the  MSI  implementation. 
The  block  diagram  for  the  interface  via  a  PAL  device  is  shown 
in  Figure  1 .  The  timing  diagram  (Figure  2)  illustrates  the  Inter- 
rupt  Acknowledge  cycle.  As  in  the  other  designs,  RD  is 
generated  during  Interrupt  Acknowledge  to  place  the  vector 
on  the  bus. 


The  timing  during  register  programming  is  not  shown.  The  PAL 
device  allows  selection  of  one  or  two  Wait  States  by  making  W0 
HIGH  or  LOW,  respectively.  The  table  below  shows  the  appro- 
priate number  of  Wait  States  as  a  function  of  CPU  speed. 


Part 

CPU  Speed 

Wait  States 

85XX 

4  MHz 

1 

85XX 

6  MHz 

2 

85XXA 

8  MHz 

2 

85XXA 

10  MHz 

2 

Figure  1.  68000  to  Am8530  Connection  Using  a  PAL  Device 
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The  68000  to  Am8530  Interface  with  Interrupts 
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Figure  2.  Timing  Diagram  for  68000  to  Am8530  Interface 
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The  68000  to  Am8530  Interface  with  Interrupts 


TITLE         68000  TO  8500  OR  9500  PERIPHERALS 

PATTERN     PAT00  2 

REVISION  01 

AUTHOR       JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  PERI_INTFC  PAL16R4 

CLOCK     /CS     RW     /LDS     /W0     /AS     FCO     FC1     FC2  GND 
/OE       /INTA     /ACK       /C     /B     /A     /DLDS     /RD     /WR  VCC 

EQUATIONS 

=  A*/B 
+  B  *C 
+  /AS 

=  A*/C 
+  /A*C 
+  /AS 

=  /A*/B*AS 
+  B*C*AS 

DLDS    :=  LDS 

5  =  LDS * DLDS *RW*/ INTA 
+  A*C*INTA*AS 
+  A*/B*INTA*AS 


WR 


LDS  *  /RW 


INTA  =  FC0*FC1*FC2*AS 

ACK  =  /INTA*/A*/B*/C*/WO 
+  /INTA*/A*/B*C*WO 
+  INTA*/B*A 
+     ACK   *  LDS 

; DESCRIPTION 

;THIS  PAL  DEVICE  INTERFACES  85XX  TYPE  PERIPHERALS  TO  THE  68000 
; MICRO  PROCESSOR.     IT  INSERTS  1  OR  2  WAIT  STATES  AS  SELECTED  BY 
;/W0=0  IS  ONE  AND  /W0=1  IS  TWO  WAIT  STATES.     FOUR  WAIT  STATES  ARE 
; INSERTED  DURING  INTERRUPT  ACKNOWLEDGE  CYCLES.     ALSO  THE  RD  OUTPUT 
; GENERATED  DURING  INTA  IS  A  FUNCTION  OF  THE  INTERNAL  STATE 
; MACHINE  AND  NOT  A  FUNCTION  OF  LDS.     OE  CAN  BE  LEFT  OPEN  SINCE 
; THE  FLIP  FLOP  OUTPUTS  ARE  NOT  USED  DIRECTLY.     THE  FALLING  EDGE 
;OF  RD  IS  DELAYED  IN  ORDER  TO  GUARANTEE  THE  CS  TO  RD  SETUP  TIME 
REQUIREMENTS. 

;    SIMULATION  NOT  INCLUDED 


Figure  3.  Source  Listing  for  the  Example  of  Figure  1 
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68000  and  Am7990  LANCE  Interface 


The  design  on  the  LANCE  has  made  it  easier  for  the  user  to 
interface  the  device  with  demultiplexed  buses.  The  example 
shown  here  is  an  interface  to  be  compatible  with  an  8  MHz  or 


faster  68000  (Figure  1 ).  The  two  flip-flops  are  needed  to  adapt 
the  LANCE  bus  request  handshake  to  the  68000. 
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Figure  1 .  Am7990  to  68000  Interface 
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68000  and  Am7990  LANCE  Interface 


Autovectoring  is  used  since  the  Am7990  does  not  return  a 
vector  during  interrupt  acknowledge  cycles.  The  BYTE  and 
DAS  signals  of  LANCE  are  used  to  generate  the  UPS  and  LDS 
when  LANCE  is  in  Bus  Master  mode;  the  UDS  and  LDS  is  used 
to  generate  the  DAS  when  LANCE  is  in  Bus  Slave  mode.  It 


takes  two  latches  to  demultiplex  the  LANCE  address/data 
lines  to  adapt  to  the  68000  address  bus.  The  flip-flops  can  be 
replaced  by  a  PAL22V10  to  minimize  parts  count.  Equations 
for  the  PAL16L8  are  shown  in  Figure  2. 


TITLE         68000  TO  LANCE  INTERFACE 

PATTERN  PAT002 

REVISION  01 

AUTHOR       JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP   LANCE   IN  PAL16L8 

/AS     RW     BYTE     /HOLD     NC     /BG     AO     NC     /BGACK  GND 
/CS     /TB     /UDS     /DAS     /CLR1     BR     /CLR2     /LDS     /RB  VCC 

EQUATIONS 

RB  =  CS*RW*UDS 
+  CS*RW*LDS 
RB.TRST  =  /BGACK 

TB   =  CS*/RW 
TB.TRST  =  /BGACK 

UDS   =   DAS  */ AO  * BYTE 

+  BYTE*DAS 
UDS.TRST  =  BGACK 

LDS   =   DAS*A0  *BYTE 

+  /BYTE*DAS 
LDS.TRST  =  BGACK 

DAS   =  UDS*LDS 
DAS.TRST  =  /BGACK 

CLR1   =  /AS*BG 

CLR2  =  BGACK  ; DELAY 

/BR  =  /HOLD 

; DESCRIPTION 

; THE  GOAL  OF  THIS  INTERFACE  WAS  TO  BE  COMPATIBLE  WITH  8  MHZ 

; AND  FASTER  68000 'S  WHILE  MINIMIZING  PARTS  COUNT.  THE 

;Am22V10  COULD  BE  USED  TO  ELIMINATE  THE  TWO  FLIP-FLOPS 

; SHOWN.      AUTOVECTORFING  IS  USED  SINCE  THE  7990  DOES  NOT 

; RETURN  A  VECTOR  DURING  INTERRUPT  ACKNOWLEDGE  CYCLES. 

;  NOTE  PROGRAM  BSWP,   BCON  TO  1,   AND  ACON  TO  0   IN  CSR3  REG. 

;    SIMULATION  NOT  INCLUDED 


Figure  2.  Source  Listing  for  the  Example  of  Figure  1 
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68000  to  AmZ8068  Data  Ciphering 
Processor  Interface 


Figures  1  and  2  show  the  68000-DCP  interface  and  the  inter- 
face timing.  This  interface  provides  a  two-chip  solution  to  add 
high-speed  data  ciphering  to  a  68000-based  system.  About 
500  kbyte/sec  are  possible  in  a  CPU-controlled  transfer.  The 
ciphering  rate  can  be  increased  with  a  sophisticated  DMA 
controller,  or  with  several  DCPs  operating  in  parallel.  The 
CPU  operates  at  8  MHz  and  the  DCP  operates  synchro- 
nously at  4  MHz.  The  Interface  Controller,  a  PAL  device,  gen- 
erates the  Address  and  Data  Strobes  for  the  DCP  and  the 
Data  Acknowledge  for  the  CPU.  It  also  divides  the  CPU  clock 
by  two  and  synchronizes  it  to  the  Data  Strobes. 

The  main  features  of  this  interface  are: 

—  Multiplexed  Control  Mode 

—  Demultiplexed  address  and  data  bus 

—  Two-Cycle  Operation 


—  Clock  Synchronization  with  two  Low  Cycles  after  the  Data 
Strobes 

—  About  500  kbyte  sec  ciphering  speed 

Data  transfers  between  the  CPU  and  the  DCP  are  accom- 
plished by  a  two-cycle  operation.  First  the  address  of  an  inter- 
nal register  is  latched  in.  then  the  data  is  transferred.  This 
causes  a  small  overhead  in  the  initialization  phase,  but  im- 
proves the  ciphering  rate  in  a  high-speed  data  ciphering  ses- 
sion. The  rate  of  500  kbyte.sec  can  be  reached  only  if  a 
high-speed  peripheral  device  is  connected  to  the  Slave  Port 
and  the  DCP  is  programmed  for  dual  port  configuration. 

The  PAL  device  is  programmed  to  allow  only  DCP  transfers  to 
the  DCP.  The  PAL  device  equations  are  shown  in  Figure  5. 
A0  must  be  odd  to  make  the  CPU  transfer  the  data  on  the  Low 
byte  of  the  data  bus.  A  "0"  on  A,  indicates  an  Address  Latch 
Cycle,  whereas  a  "1 "  on  A,  indicates  a  Data  Transfer  Cycle.'  A0 
must  be  *1"  in  both  cycles. 
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Figure  1.  AmZ8068  to  68000  Interface 


68000  to  Am  Z8068  Data  Ciphering  Processor  Interface 
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Figure  2.  68000-AmZ8068  Address  Latch  Cycle  (A1  =  Low) 


An  address  decoder  generates  the  Chip  Select  for  the  DCP. 
The  Address  Strobe  indicates  a  valid  address.  The  PAL  device 
is  only  activated  if  the  Lower  Data  Strobe  becomes  active  while 
the  Upper  Data  Strobe  stays  inactive.  This  means  that  data  is 
transferred  in  MOVE.B  instructions  with  an  odd  peripheral 
address. 

The  PAL  device  provides  two  Data  Acknowledge  outputs. 
DTACK ,  is  an  active  Low  TTL  output.  DTACK  2  has  the  same 
timing  as  DTACK,  ,  but  is  an  Open  Collector  output.  (The 
Open  Collector  output  is  realized  by  a  three-state  output  which 
has  only  two  states,  Low  or  Floating.) 

Address  Latch  Cycle 

In  this  cycle  only  a  Master  Port  Address  Strobe  (MAS)  is 
generated.  Master  Port  Chip  Select  (MCS)  is  tied  to  Low.  LDS 
is  sent  to  the  MAS  output.  The  minimum  pulse  width  of  LDS  is 
115  ns;  80  ns  are  required  for  the  AmZ8068. 


DTACK  is  activated  with  the  falling  edge  of  the  CPU  clock  after 
cycle  S2.  The  CPU  inserts  no  Wait  states.  DTACK  is  deac- 
tivated with  the  first  edge  of  CLK  after  AS  becomes  inactive. 

Data  Read  Cycle:  (Figure  3) 

The  generation  of  MDS  in  a  Data  Read  Cycle  is  similar  to  the 
Data  Write  Cycle.  Because  the  CPU  activates  LDS  one  cycle 
earlier,  there  is  no  need  for  a  Wait  State.  The  minimum  pulse 
width  of  LDS  is  240  ns;  the  DCP  requires  200  ns  for  a  Status 


Register  read.  DTACK  is  activated  using  the  same  logical  con- 
dition as  in  the  Data  Write  cycle.  Because  of  the  earlier  activa- 
tion of  LDS.  DTACK  becomes  active  earlier  and  the  CPU 
inserts  no  Wait  states. 

Data  Write  Cycle:  (Figure  4) 

A  Data  Write  Cycle  is  performed  with  A0  is  HIGH,  AS,  CS  and 
LDS  are  LOW.  The  minimum  pulse  width  of  LDS  is  not  suffi- 
cient for  the  DCP  which  requires  at  least  125  ns.  One  Wait 
state  or  a  slower  system  clock  will  satisfy  this  parameter.  In  this 
interface,  one  Wait  state  is  inserted  by  activating  DTACK  at  the 
end  of  S„. 

The  DCP  clock  is  synchronized  in  Data  Read  or  Write  Cycles 
by  forcing  it  Low  when  DTACK  becomes  active.  This  guaran- 
tees that  the  DCP  clock  has  a  falling  edge  just  before  LDS 
(MDS)  rises.  The  delay  of  the  DCP  clock  to  CLK  is  typically 
8  ns  for  a  PAL  device.  The  delay  of  LDS  to  MDS  is  typically 
1 2  ns.  The  delay  of  LDS  to  the  system  clock  is  0-70  ns  for  the 
8  MHz  version.  This  results  in  a  delay  of  4-74  ns  of  MDS  to 
the  DCP  clock.  The  DCP  requires  0-50  ns  when  operating  at 
the  maximum  clock  rate. 

This  problem  is  solved  by  stretching  the  clock  for  one  cycle. 
The  DCP  clock  stays  LOW  for  two  cycles  in  the  end  of  a 
transfer  cycle.  This  is  done  automatically  by  the  PAL  device 
(see  Figure  4). 
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68000  to  AmZ8068  Data  Ciphering  Processor  Interface 


So        Si        S2        S3        S4        S5        S6        S7        So        Si  S2 
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Figure  3.  68000-AmZ8068  Data  Read  Cycle  (A1  =  HIGH) 
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Figure  4.  68000-AmZ8068  Data  Write  Cycle  (&i  =  HIGH) 
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68000  to  AmZ8068  Data  Ciphering  Processor  Interface 


TITLE  68000 
PATTERN  DCP044 
REVISION  01 

AUTHOR       JUERGEN  STELBRINK 
COMPANY     ADVANCED  MICRO  DEVICES 
DATE  11/06/87 

CHIP  AmZ_INTFCE  PAL16R4 

CLK2     /CS       /AS       /UDS  /LDS 
/OE       /MAS     /MDS     DCPCLK  NC 


AmZ8068    ( DCP )    INTERFACE  DEVICE 


RW     Al  CLK       NC  GND 

NC     /DTACK1     CLK1     /DTACK2  VCC 


EQUATIONS 
/CLK1        =  CLK 


; INVERT  CLOCK  TO  TRIGGER  THE  REGISTERED 
; OUTPUTS  WITH  THE  FALLING  EDGE  OF  CLK 


MAS  =  AS*LDS*/UDS*/RW*/A1*CS 

MDS  =  AS*LDS*/UDS*A1*CS 

/DCPCLK    :=  DCPCLK 

+  /DTACK1*CS*AS*LDS*/UDS 
+  DTACK1*/AS*/LDS*/UDS 


DTACK1      :=  AS*LDS*/UDS*A1 *CS 
+  AS*/RW*/A1*CS 


DIVIDE  BY  TWO 

TWO  CLOCKS  LOW  IN 

THE  END  OF  A  DATA  CYCLE 

DATA  TRANSFER  CYCLE 
ADDRESS  LATCH  CYCLE 


DTACK2  =  DTACK1 
[■ACK2.TRST  =  DTACK1*AS*CS 
SIMULATION  NOT  INCLUDED 


DTf 


Figure  5.  Source  Listing  for  Example  of  Figure  1 


68000  and  Dual  Am951 6  DMA 
Controllers  Interface 


There  has  been  interest  shown  in  connecting  two  Am9516 
DMA  Controllers  to  obtain  four  channels.  The  example  here 
shows  that  such  a  system  can  be  built  by  incorporating  one 
PAL  device,  AMD's  22V10  (Figure  1).  Address  and  data 
buses  are  not  shown  as  they  are  straightforward  and  require 
no  explanation.  The  PAL  device,  designated  68K16D2,  con- 
verts the  two  DREQs  into  the  68000  three-wire  handshake, 
prioritizes  the  request,  and  converts  the  control  signals 


appropriately.  Equations  for  the  PAL  device  are  shown  in 
Figure  2. 

The  key  parameters  are:  1 )  data  hold  with  respect  to  the  rising 
edge  of  DS  during  a  write,  and  2)  DTACK  setup  time.  Control 
for  a  data  bus  transceiver  is  shown  because  it  will  be  required 
in  most  systems.  The  PAL  device  provides  these  signals  when 
the  CPU  is  bus  master;  the  Am9516  generates  these  control 
signals  directly  when  it  is  bus  master. 
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Figure  1.  Dual  Am9516  UDCs  to  68000  CPU  Interface 


68000  and  Dual  Am951 6  DMA  Controllers  Interface 


The  Am9516s  are  shown  with  independent  clocks.  The  clocks 
may  be  divided  from  the  CPU  clock  or  may  be  generated  inde- 
pendently of  the  CPU.  Because  WAIT  must  meet  the  set-up 
and  hold  times,  DTACK  may  need  synchronization  by  the  use 
of  a  flip-flop,  as  shown.  This  flip-flop  may  be  eliminated  in 
synchronous  systems. 

The  clock  is  inverted  to  the  PAL  device  to  meet  DTACK  set-up 
time  in  systems  of  10  MHz  or  faster.  This  inverter  may  be 
deleted  in  slower  systems,  if  appropriate  changes  to  the  PAL 
device  equations  are  made.  This  PAL  device  implements  fixed 
priority  between  the  Am9516s.  BREQi  is  the  highest  priority. 
Rotating  priority  can  be  implemented  by  adding  another  PAL 
device. 

EOPs  are  pulled  up  separately,  but  could  be  tied  together 
since  they  affect  a  channel  only  if  it  is  active.  The  bus-error 
function  can  be  supported  by  connecting  BERR  and  EOP. 


If  a  bus  error  occurs,  EOP  will  stop  the  current  transfer  and 
interrupt  the  CPU.  The  Interrupt  Service  routine  can  read  the 
status  to  determine  if  EOP  caused  the  interrupt  or  if  termina- 
tion was  normal.  If  EOP  caused  the  interrupt,  the  Address 
Register  can  be  read  to  determine  where  the  bus  error  oc- 
curred. After  the  problem  is  corrected,  the  CPU  can  program 
the  Am9516  to  complete  the  transfer  or  do  an  alternate  trans- 
fer, as  appropriate. 

When  operating  the  DMA  in  interleave  mode,  an  external  EOP 
should  be  gated  with  DACK  to  prevent  affecting  the  wrong 
channel.  This  is  unnecessary  if  interleave  is  not  used,  since  the 
UDC  releases  the  bus. 

This  arbiter' design  supports  both  serial  and  parallel-expansion 
techniques  and  is  therefore  compatible  with  VME  bus  protocol. 
Bus  grant  out  was  implemented  with  an  external  gate  due  to  a 
shortage  of  pins.  The  VME/BCLR  function  was  not  imple- 
mented because  the  Am9516  does  not  support  pre-emption. 
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68000  and  Dual  Am9516  DMA  Controllers  Interface 


TITLE         68  000  TO  DUAL  9516  INTERFACE 

PATTERN  PAT001 

REVISION  01 

AUTHOR       JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 


CHIP  DUAL  9516  PAL22V10 


CLK  RW  AO  BREQ1  BREQ2  /BG  NC  ALE1  /BW  ALE 2  /BGACK  GND 
/CS     /LDS       /UDS     /DS     /C       /AS     /BR     BACK 2     BACK1     /TB     /RB  VCC 

EQUATIONS 

BR  =  BREQ 1  * /BGACK 
+  BREQ1 *BG 
+  BREQ2  */BGACK 
+  BREQ2*BG 

BACK1   =  BREQ1 *BG*/AS 
+  /BG* BGACK 

BACK 2  =  BREQ2*/BREQ1 *BG*/AS 
+  /BG*BGACK 


RB  =  CS*RW*UDS 
+  CS*RW*LDS 
RB.TRST  =  /BGACK 

TB  =  CS*/RW 
TB.TRST  =  /BGACK 

DS  =  AS*/C*/RW 

+  AS*RW 
DS.TRST  =  /BGACK*AS 

AS  =  ALE1 
+  ALE2 
AS.TRST  =  BGACK 


UDS  =  DS*/A0*/BW 

+  BW*DS 
UDS.TRST  =  BGACK 

LDS  =  DS*A0*/BW 

+  BW*DS 
LDS.TRST  =  BGACK 

C   :=  UDS* BGACK 
+  LDS* BGACK 
C.TRST  =  AS 


Figure  2.  Source  Listing  for  the  Example  of  Figure  1 
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m8530  to  68020  Interface 


This  design  shows  the  logic  to  interface  a  6  MHz  Am8530 
Serial  Communications  Controller  to  the  68020  CPU  (Fig- 
ure 1)  running  at  10  MHz  with  a  system  clock  cycle  time  of 
100  ns.  The  Am8530  is  a  high-performance,  dual-channel 
SCC  that  supports  data  rates  up  to  1 .5M  bps  and  a  variety  of 
communication  protocols. 

The  PAL  device  generates  the  RD  and  /WR  control  signals  for 
the  Am8530  from  the  68020  IDS  and  R/W-  control  signals.  It 
also  generates  the  clock  for  the  SCC  by  dividing  the  68020 
system  clock.  This  meets  the  165-ns  minimum  cycle  time  for 
the  6-MHz  SCC  clock. 


Also,  a  state  machine  is  implemented  to  perform  the  hand- 
shake necessary  for  byte  transfers  on  the  68020  bus.  The 
state  machine  transition  diagram  is  shown  in  Figure  2.  Nor- 
mally, DSACK[1 :0]  is  inactive.  When  this  device  is  selected,  a 
wait  state  is  inserted  and  then  data  transfer  is  acknowledged 
by  asserting  DSACK[1 :0]  for  byte  transfer.  When  the  CPU 
deasserts  the  address  strobe,  DSACK[1:0]  is  negated. 
DSACK[1 :0]  will  be  driven  by  other  devices  on  the  68020 
system  bus.  The  PAL  device  enables  DSACK[1 :0]  only  when 
the  CPU  performs  a  read  or  write  cycle  for  the  SCC;  at  other 
times  DSACK[1 :0]  lines  are  three-stated. 

Since  the  Am8530  is  a  byte-wide  peripheral,  it  is  connected  to 
the  upper  byte  of  the  32-bit  wide  68020  data  bus. 


D[24:31] 


A  BUS 


68020 
CPU 


R/W- 

DSACK[1] 
OSACk[0) 


3 


I/O 

DECODER 


o 


ms 

/AS 
IDS 
RW 
PAL16R4 

CLK 


0(0:7] 

ICE 

8530 

SCC 

/WR 

/RD 

8530CLK 

Figure  1.  Am8530  to  68020  Interface 
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Figure  2.  68020  Byte  Port  DSACK  Handshake  State  Diagram 
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Am8530  to  68020  Interface 


»  THIS  PLPL  FILE  IS  FOR  A  16R4  THAT  IMPLEMENTS  THE  LOGIC 
NECESSARY  TO  INTERFACE  AN  Am8530  (SCO  TO  A  68020  SYSTEM. 


DEVICE  Am8530_TO_68020  (PAL16R4) 


PIN 


11  THE  FOLLOWING  CODE  IMPLEMENTS  THE  STATE  MACHINE  TO 
PERFORM  DSACK  OPERATION.   IT  INSERTS  WAIT  STATES, 
ASSERTS  DSACK  FOR  BYTE  TRANSFER  AND  REMOVES  DSACK 
WHEN  AS  IS  NEGATED.  « 

IF  (  /CS  )  ENABLE  (  DSACKM:0]  ) 


CLK 

=  1 

VCC         •  20 

/AS 

=  2 

DSACKM]  =  19 

IF  (  ST  [1:0]  =  #B10  ) 

/DS 

=  3 

DSACK tO]  =  18 

THEN  DSACK[1:0]  =  #B10; 

RW 

=  4 

ST[1]      =  17 

ELSE  DSACK  [1:0]  =  #811; 

/CS 

=  5 

STCO]      =  16 

NC1 

=  6 

0  =15 

CASE  (  ST [1:0]  ) 

NC2 

=  7 

NC5         =  H 

BEGIN 

NC3 

=  8 

WR           =  13 

#800)  BEGIN 

NC4 

=  9 

RD           ■  12 

IF  (  /CS  ) 

GND 

=  10 

NC6         =  11  J 

THEN  ST [1:0]  := 

"  READ  AND  WRITE  CONTROL  SIGNALS  ARE  DERIVED  FROM 
R/W-  AND  DS- .  « 

WR  =  /(DS  *  /RW)  J 
RD  =  /(DS  *  RW)  j 

"  THE  INCOMING  CLOCK  IS  DIVIDED  BY  2  TO  GENERATE  THE 
CLOCK  FOR  THE  Am8530.  » 


ELSE  ST [1:0]  : 
END  ; 

#B01)  ST [1:0]  :=  #B10; 

#B10)  BEGIN 

IF  (  /AS  ) 
THEN  ST [1:0]  : 
ELSE  ST[1:0]  : 
END  ; 

END  ; 


#B00  ; 
#B01 ; 


#B00 
#B10; 


CASE  (  Q  ) 

BEGIN 

0)  Q  :=  1; 

1)  Q  :=  0; 
END; 


Figure  3.  PLPL  Specification  for  the  8530  to  a  68020  Interface 
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Interfacing  to  the  8088 


Overview 

The  8088  CPU  is  an  8-bit  processor  designed  around  the  8086 
internal  structure.  Most  functions  of  the  8088  are  identical  to 
the  equivalent  8086.  The  8088  fetches  and  writes  16-bit  words 
in  two  consecutive  bus  cycles.  Both  the  8086  and  the  8088 
handle  the  external  bus  the  same  way  but  the  8088  handles 
only  8-bits  at  a  time:  and  both  appear  identical  to  the  software 
engineer,  with  the  exception  of  execution  time. 

The  hardware  interface  of  the  8088  contains  the  major  differ- 
ences between  the  two  CPUs.  The  pin  assignments  are  nearly 
identical,  however,  with  the  following  functional  changes: 

A8_A1S  These  pins  are  only  address  outputs  on  the 

8088.  They  are  latched  internally  and  remain 
valid  throughout  a  bus  cycle  in  a  manner 
similar  to  the  8085  upper  address  lines. 

SS0  Provides  the  SO  status  information  in  the  mini- 

mum mode.  This  output  occurs  on  pin  34  in 
minimum  mode  only. 

DT/R  ,  IO/M ,    and  SS0  provide  the  complete  bus  status  in 
minimum  mode. 

IO/M  has  been  inverted  to  be  compatible  with  the 

MCS-85  bus  structure. 

ALE  is  delayed  by  one  clock  cycle  in  the  minimum  mode  when 
entering  HALT,  to  allow  the  status  to  be  latched  with  ALE. 


The  8088  and  AMD  Proprietary 
Peripherals 

The  evolution  of  chip  design  has  taken  the  8-bit  environment 
into  the  16-bit  environment.  While  the  new  generation  of  pe- 
ripheral devices  are  often  16  bits  wide,  the  older,  established 
8-bit  orientation  of  CPUs  and  peripherals  are  still  significant. 
Interfacing  a  16-bit  peripheral  with  an  8-bit  CPU  often  encoun- 
ters data  path  incompatibility  and  involves  bus  control 
manipulation.  This  type  of  integration  mainly  involves  separat- 
ing the  control  and  data  paths  from  the  new  peripheral  and  the 
system. 

The  ability  to  mix  different  data  path  widths  can  improve  sys- 
tem functionality,  performance,  and  cost.  It  is  less  expensive 
to  use  an  8-bit  bus  in  a  new  design  because  the  memory 
requirements  are  generally  cheaper.  A  designer  can  use  this 
data  path  mixing  to  upgrade  the  existing  system  until  a  new 
system  design  is  warranted,  or  the  designer  can  simply  im- 
prove on  the  existing  design  as  new  peripherals  become  avail- 
able. 
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088  to  Am951 6  UDC  Interface 


Figure  1  shows  the  Data  Bus  and  Control  Interface  between 
an  8088  microprocessor  and  an  Am9516.  This  interface  is 
accomplished  by  using  an  AmPAL22V10,  a  74LS161 
counter,  an  Am2952  bidirectional  I/O  port,  and  an  Am2949 
bidirectional  transceiver.  Figures  2  and  3  show  the  PAL 
device  timing  for  both  the  Bus  Master  and  the  Bus  Slave 
cases. 

In  this  design,  certain  simplifying  constraints  have  been  made. 
Word  operations  are  only  allowed  during  command  chaining 
operations  when  the  Am9516  is  Bus  Master.  During  Slave 
Write  operations,  the  first  byte  output  to  the  Am951 6  must  have 
an  odd  address,  and  the  following  second  byte  an  even  ad- 
dress. Conversely,  during  a  Slave  Read  cycle,  the  first  byte 
read  from  the  Am9515  must  be  at  an  even  address  and  the 
second  at  the  next  higher  odd  address.  Furthermore,  for  both 


slave  and  master  operations,  the  system  must  use  the 
latched  A0  (LAo)  from  the  PAL22V10  as  its  sole  A0.  That  is, 
this  LAo  must  circumvent  any  address  latching  to  memory 
and  so  is  to  be  used  directly. 

As  can  be  seen  from  the  figures,  the  AmPAL22V10  manip- 
ulates the  control  signals  to  the  transceivers  and  latches  so 
that  bytes  can  be  tunneled  into  words  during  Am9516  Com- 
mand Chaining  operations  and  Slave  Writes,  and  words  can  be 
tunneled  into  bytes  during  Am9516  Slave  Reads.  Also,  LAO  is 
alternately  toggled  in  order  to  provide  the  dual  address  (16- 
bits)  during  chaining  operations. 

Figure  4  shows  the  PLPL  equations  and  resulting  sum-of- 
products  equations  for  the  design  (PLPL  is  a  higher  level 
"C-like"  language  for  PAL  devices).  For  those  not  familiar 
with  "C"  or  Pascal,  the  intermediate  sum-of-products  form 
shows  the  equations  in  Boolean  form. 


0*  Qb  Qc 
„  74LS161 
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CLOCK 
CLEAR         <  * 
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Figure  1.  The  Am9516  UDC  to  8088  CPU  Data  Bus  and  Control  Interface 
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8088  to  Am951 6  UDC  Interface 


8088  to  Am951 6  UDC  Interface 


DEVICE  _8088_9516  (pal22V10) 

"This  is  a  PLPL  file  to  implement 
8088  and  the  16-bit  Am9516.  This 
is  programmed  for  byte  operatio 
words  during  chaining. 


an  interface  between  the  8-bit 
design  assumes  that  the  Am9516 
and  so  only  accesses  16-bit 

-James  Williamson" 


PIN 


CK 

1 

/RD 

23 

S[0:2] 

2:4 

/WR 

22 

AO 

5 

LAO 

21 

/SEL 

6 

/DS 

20 

ALE 

7 

/RW 

19 

HLDA 

8 

/WAIT 

18 

/BW 

9 

/A 

17 

READY 

10 

/B 

16 

RESET 

11 

It 

15 

/D 

14; 

BEGIN 

IF    (RESET)    THEN  ARESET ( ) ; 

"=  =  =  =  =  =  :=:=:  =  =:  —  —==============——========—  —  =  =  =  :=  =  =  =  —  —  —  z=  =  =;=:  =  =i  =  =:=:=  =  =  =  =" 

"This  section  defines  the  wiggles  when  the  Am9516  is  Bus  Master' 

IF    (HLDA)      THEN  ENABLE () ; 
IF    (/S[2]    *   HLDA)    THEN  BEGIN 
IF    (/S[l]   *   /S[0])  THEN 


ELSE 


LAO  = 


LAO  = 


/CK  *  BW  +  /BW  *  AO  *  ALE  + 
/BW     *  LAO   *  /ALE 

BW     +   /BW   *   AO   *   ALE  + 
/BW   *   LAO   *  /ALE 


END; 

IF    (HLDA)  THEN 

CASE  (S[2:0]) 
BEGIN 
1)  BEGIN 


END; 


RD 

/RW 

* 

DS 

A 

/BW 

* 

/RW  *  /CK 

WR 

/BW 

+ 

RW     *  DS 

C 

/BW 

* 

RW 

WAIT 

1 

EGIN 

RD 

/RW 

* 

DS 

B 

BW 

A 

/BW 

* 

/RW 

WR 

/BW 

* 

RW  *  DS 

C 

/BW 

RW 

WAIT 

BW 

END; 


Figure  4.  PLPL  Specification  for  8088  to  Am9516  Interface 


2-168 


£1  MonollthloftSR Memories  £1 


8088  to  Am951 6  UDC  Interface 


3)  BEGIN 

RD        =   /RW   *   DS   *  B 

B  =   BW   *  CK 

A  =   /BW  *  RD 

WR       =  /BW  *  RW  *  DS 

C         =  /BW  *  RW 

WAIT  =  BW 

END ; 

5)  BEGIN 

RD        =   /RW   *  DS 
A         =  /BW  *  /CK 
WAIT  =  BW 

END; 

6)  BEGIN 

RD       =  /RW  *  DS 
A         =  BW 

END; 

7)  BEGIN 

RD        =   /RW  *  DS 
A  =  /RD 

END; 


END; 

"This     section     defines  the  wiggles  when  the  8088  is  Bus  Master" 


BEGIN 


LAO   =  AO   *   ALE   *   SEL  + 
LAO  *   /ALE   *  SEL 
B       =  LAO  *  WR  *  SEL 
A       =   /LAO   *  WR  *  SEL 
DS     =  A  + 

/LAO  *  RD  *  SEL 
C  =  /LAO  *  RD  *  SEL 
D       =  LAO  *   RD  *  SEL 


END; 


END. 


Figure  4.  PLPL  Specification  for  8088  to  Am9516  Interface  (Continued) 
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80186  to  Am9516  Universal  DMA 
Controller  Interface 


The  addition  of  the  Am9516  to  an  80186  design  is  a  natural 
choice  in  systems  requiring  four  channels  of  DMA.  Figure  1 
shows  the  interface  between  the  801 86  and  the  Am951 6  with 
a  PAL  device.  PCS5  is  programmed  to  provide  a  latched  A, 
signal. 


This  interface  accomplishes  two  major  control  transformations. 
First,  it  converts  R D  and  WR  into  R/W  and  DS  when  the  801 86 
is  Bus  Master,  and  vice  versa  when  the  Am9516  is  Bus  Master. 
Secondly,  the  transceiver  control  signals,  TBEN  and  RBEN, 
are  generated  from  DEN  and  DT/R.  This  example  shows  only 
one  possible  configuration.  Other  configurations  can  be  made 
as  dictated  by  system  requirements.  A  PAL  device  is  used  here 
to  reduce  board  space. 


16  MHz 

HDh 


PCS5 

PCSo 
HOLD 
HLDA 


80186 


DT/fi 
BEN 
ALE 


RTJ 
WR 


CLKOUT 
AD15-AD0 


I 


6 


DT 
/DEN 
ALE  P 


PAL16L8 


ADo 

(RD 

/WR 
ALE 


/TBEN 
/RBEN 
ALED 
/SEL 
Ao 
BW 
IDS 
/RW 


CP  Q 

74LS74 
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IE 


D  Q 
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CP 


Vcc  Vcc 
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P/D 
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C5 
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DS 

R/W 
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Figure  1.  Am9516  to  80186  Interface 
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80186  to  Am9516  Universal  DMA  Controller  Interface 


TITLE         Am9516  TO  80186  INTERFACE 

PATTERN  PAT001 

REVISION  01 

AUTHOR       JOE  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  DMA_INT  PAL16L8 

NC  ALED  ALEP  HLDA  BW  ADO  DT  /DEN  /SEL  GND 
NC  /RBEN  /RD  ALE  AO  /RW  /DS  /WR  /TBEN  VCC 

EQUATIONS 

DS  =  RD  +  WR 
DS.TRST  =  /HLDA 

RW  =  DT 

RW.TRST  =  /HLDA 

TBEN  =  /DT   *  /SEL  *  DEN 
TBEN.TRST  =  /HLDA 

RBEN  =  DT  *  /SEL  *  DEN 
RBEN.TRST  =  /HLDA 

RD  =  /RW  *  DS 
RD.TRST  =  HLDA 

WR  =  RW  *  DS 
WR.TRST  =  HLDA 

ALE  =  /ALEP  *  /ALED 

AO   =  /ADO    *   /BW  *   HLDA  *  ALED 

+  ADO   *  BW  *  HLDA  *  ALED 

+  /ADO   *  /HLDA  *  ALEP 

+  AO   *  /ALEP 

+  AO   *  /ALED 

;  DESCRIPTION 

;  THIS  PAL  DEVICE  CONVERTS  THE  CONTROL  SIGNALS  TO  INTERFACE  THE  80186 
;   TO  THE  Am9516  DMA  CONTROLLER. 

;    SIMULATION  NOT  INCLUDED 


Figure  2.  Source  Listing  for  Example  of  Figure  1 
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Introduction 

Commercial  and  industrial  microprocessor-based  systems  typi- 
cally consist  of  a  processor  interfaced  with  many  peripherals 
which  randomly  require  service.  To  fully  utilize  the  processor's 
computing  potential,  sources  of  hardware  interrupts  must  be 
used  to  free  the  processor  from  software  routines.  The  68000 
16-bit  microprocessor  has  256  different  exception-processing 
vectors  which  point  to  predetermined  locations  in  the  program 
memory  space.  There  are  1 92  external  user  interrupt  vectors  and 
seven  autovector  interrupts.  Table  1  shows  the  exception  vector 
assignments.  The  interrupt  structure  of  the  68000  will  be  dis- 
cussed in  more  detail  along  with  two  methods  of  designing  an 


N 

Vector 
umber(s) 

Address 

Assignment 

Dec 

Hex 

Space 

0 

0 

000 

SP 

Reset:  Initial  SSP 

4 

004 

SP 

Reset:  Initial  PC 

2 

8 

008 

SD 

Bus  Error 

3 

12 

OOC 

SD 

Address  Error 

4 

16 

010 

SD 

Illegal  Instruction 

5 

20 

014 

SD 

Zero  Divide 

6 

24 

018 

SD 

CHK  Instruction 

7 

28 

01c 

SD 

TRAPV  Instruction 

8 

32 

020 

SD 

Privilege  Violation 

9 

36 

024 

SD 

Trace 

10 

40 

028 

SD 

Line  1010  Emulator 

1  - 

44 

02C 

SD 

Line  1111  Emulator 

12" 

48 

030 

SD 

(Unassigned,  reserved) 

13* 

52 

034 

SD 

(Unassigned.  reserved) 

14* 

56 

038 

SD 

(Unassigned,  reserved) 

15 

60 

03C 

SD 

Uninitialized  Interrupt  Vector 

16-23" 

64 

04C 

SD 

(Unassigned.  reserved) 

95 

05F 

24 

96 

060 

SD 

Spurious  Interrupt 

25 

100 

064 

SD 

Level  1  Interrupt  Autovector 

26 

104 

068 

SD 

Level  2  Interrupt  Autovector 

27 

108 

06C 

SD 

Level  3  Interrupt  Autovector 

28 

112 

070 

SD 

Level  4  Interrupt  Autovector 

29 

116 

074 

SD 

Level  5  Interrupt  Autovector 

30 

120 

078 

SD 

Level  6  Interrupt  Autovector 

31 

124 

07C 

SD 

Level  7  Interrupt  Autovector 

32-47 

128 

080 

SD 

TRAP  Instruction  Vectors 

191 

OBF 

48-63* 

192 

occ 

SD 

(Unassigned,  reserved) 

255 

OFF 

34-255 

256 

100 

SD 

User  Interrupt  Vectors 

1023 

3FF 

*  Vector  numbers  12. 1 3, 1 4. 16  through  23  and  48  through  63  are  reserved  for 
future  enhancements.  No  user  peripheral  devices  should  be  assigned  these 
numbers. 

Table  1.  Exception  Vector  Assignment 


interrupt  controller  using  PAL  (Programmable  Array  Logic) 
devices. 

68000  Exception  Processing  and 
Pin  Description 

The  interrupt  structure  of  the  68000  can  grant  up  to  256  types  of 
interrupt  requests.  These  interrupts,  or  exceptions,  are  generated 
either  by  external  or  internal  causes  and  are  serviced  by  directing 
the  flow  of  program  to  an  exception  processing  routine.  Table  1 
lists  these  exceptions  and  their  respective  address  in  memory. 
Exception  vectors  are  locations  in  memory  where  the  processor 
fetches  the  address  of  a  routine  or  subprogram  which  wi  1 1  service 
that  exception.  The  exception  vector  is  either  internally  or  exter- 
nally generated  depending  on  the  cause  of  the  interrupt.  The 
externally  generated  exceptions  are  interrupts  which  are  placed 
on  the  interrupt  control  pins  (IPL2,  IPL1  and  IPLO),  bus  errors, 
and  reset  requests.  The  interrupts  placed  on  control  pins  IPL2- 
I PLO  are  requests  from  peripheral  devices.  The  internally-gener- 
ated exceptions  come  from  instructions,  address  errors,  or 
tracing.  These  different  types  of  exceptions  and  their  priorities 
are  shown  in  Table  2.  We  will  focus  on  group  1  interrupt 
exceptions. 


Group 

Exception 

Processing 

0 

Reset 
Bus  Error 
Address  Error 

Exception  processing  begins 
within   two  clock  cycles. 

1 

Trace 
Interrupt 
Illegal  Privilege 

Exception  processing  begins 
before  the  next  instruction. 

2 

TRAP.  TRAPV. 
CHK, 
Zero  Divide 

Exception  processing  is  started 
by  normal  instruction  execution. 

Table  2.  Exception  Grouping  and  Priority 


The  pinout  of  the  68000  is  shown  in  Figure  1.  The  three 
interrupt  control  pins  (IPL2,  IPL1,  IPLO)  are  asynchronous 
active  low  inputs  which  indicate  the  encoded  priority  level  of 
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the  device  requesting  an  interrupt.  The  least  significant  bit  is 
IPLO  and  the  most  significant  bit  is  IPL2.  Level  seven,  (IPL2, 
IPL1.  IPLO)  =  (000).  has  the  highest  priority,  while  level  zero. 
(111).  indicates  that  no  interrupts  are  requested.  All  lower  or 
equal  priority  interrupts  are  masked  during  the  interrupt 
service  routine  of  the  present  interrupting  device.  There  are 
two  ways  by  which  a  peripheral  device  can  interrupt  the 
normal  flow  of  program:  autovectoring  or  external  vector 
number  generation.  The  function  code  pins  (FC2.  FC1,  and 
FCO)  all  become  high  during  an  interrupt  acknowledge  cycle. 
The  interrupt  acknowledge  cycle  always  follows  an  interrupt 
request  only  after  the  present  instruction  cycle  is  completed. 
Thus  interrupt  acknowlege  cycles  come  in  between  the 
instruction  cycles  and  no  information  is  lost. 

The  signals  which  are  used  during  an  external  interrupt 
request  are  listed  below  with  a  description  of  their  behavior 
and  function. 

ADDRESS  BUS:  the  24-bit  address  bus  holds  the  address  of  the 
data  to  be  accessed.  During  an  interrupt  acknowledge  cycle  the 
lower  three  bits  (A3,  A2,  A1 )  hold  the  encoded  level  of  the  inter- 
rupt being  serviced.  If  a  level  5  interrupt  is  being  serviced  then 
(A3,  A2,  A1)  will  be  (101)  respectively. 

DATA  BUS:  The  lower  eight  bits  of  this  16-bit  data  bus  must 
contain  the  vector  number  during  a  user  interrupt  acknowledge 
cycle.  If  an  autovector  routine  is  in  process  then  the  data  bus  is 
ignored. 

AS:  The  processor  asserts  address  strobe  anytime  there  is  valid 
data  on  the  address  bus.  It  remains  asserted  for  as  long  as  the 
address  is  valid. 

R/W:  This  signal  defines  the  data  bus  transfer  as  a  read  or  a  write 
cycle.  During  an  interrupt  acknowledge  cycle  the  processor  is  in 
a  read  mode. 

UDS,  LDS:  The  upper  and  lower  data  strobes  are  asserted  when 
the  processor  is  in  a  read  or  write  instruction  cycle.  Upper  strobe 
enables  the  most  significant  byte  of  data  while  the  lower  strobe 

INTERRUPT  ACKNOWLEDGE  SEQUENCE 
FLOW  CHART 

PROCESSOR  INTERRUPTING  DEVICE 


enables  the  least  significant. 


DTACK:  Data  transfer  acknowledge  is  an  externally-generated 
signal  which  tells  the  processor  that  valid  data  is  present  on  the 
data  bus.  If  DTACK  is  not  asserted  before  the  falling  edge  of  S4 
(S4  is  the  fourth  CPU  clock  state  in  a  seven-state  instruction  cycle) 
then  wait  states  are  introduced. 

IPL2-IPL0:  The  three  interrupt  control  pins  are  inputs  which 
contain  the  encoded  priority  level  of  the  external  interrupting 
device.  Note  that  these  are  active  low  pins  where  (000)  is  level 
seven  encoded. 

FC2-FC0:  Function  code  output  pins  indicate  the  processor's 
status.  When  they  are  all  high  the  processor  is  in  an  interrupt 
acknowledge  cycle. 

E,  VMA,  VPA:  Enable,  valid  memory  address,  and  valid  peripheral 
address  are  the  standard  6800  family  signals.  VPA  is  also  used 
when  autovectoring. 

The  following  two  design  examples  use  PAL  units  as  a  simple 
and  cost-effective  interrupt  controller  interface  to  the  68000.  The 
first  introduces  external  vector  generation  while  the  second 
shows  how  autovectoring  can  be  done  on  the  68000. 


Prioritized  Individually ' 
Interrupt  (Method  1 ) 

As  shown  in  the  interrupt  acknowledge  sequence  flow  chart  and 
timing  diagram  (Figure  2),  a  peripheral  device  must  interrupt  the 
processor  by  placing  the  encoded  level  of  priority  on  the 
interrupt  control  pins  (IPL2-IPL0).  The  processor  then  grants 
the  interrupt  after  the  completion  of  the  current  instruction  cycle. 
The  encoded  priority  level  of  the  interrupt  is  placed  on  address 
bus  bits  A3-A1  during  the  interrupt  acknowledge  cycle.  The 
status  code  lines  (FC2-FC0)  becomehigh,  indicating  an  interrupt 
acknowledge  cycle.  Once  the  lower  data  strobe  (LDS)  is  asserted 
the  external  device  must  place  an  8-bit  vector  on  the  least  signif- 
icant byte  of  the  data  bus.  This  data  is  latched  in  state  S7. 

INTERRUPT  ACKNOWLEDGE  SEQUENCE 
TIMING  DIAGRAM 


REQUEST  INTERRUPT 


GRANT  INTERRUPT 

1)  COMPARE  INTERRUPT  LEVEL  IN  STATUS  REGISTER 
AND  WAIT  FOR  CURRENT  INSTRUCTION  TO  COMPLETE 

2)  PLACE  JNTERRUPT  LEVEL  ON  A1,  A2,  A3 

3)  SET  R/W  TO  READ 

4)  SET  FUNCTION  CODE  TO  INTERRUPT  ACKNOWLEDGE 

5)  ASSERT  ADDRESS  STROBE  (AS)  

6)  ASSERT  LOWER  DATA  STROBE  (LDS) 


<  J-ljnJTJ-ljnJlTJTJTJTJ^^ 


A4-A23~_X 


 1 

PROVIDE  VECTOR  NUMBER 

1)  PLACE  VECTOR  NUMBER  ON  D0-D7 

2)  ASSERT  DATA  TRANSFER  ACKNOWLEDGE  (DTACK) 


ACQUIRE  VECTOR  NUMBER 

1)  LATCH  VECTOR  NUMBER 

2)  NEGATE  LDS 

3)  NEGATE  AS 

I  


 » 

RELEASE 
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1)  NEGATE  DTACK 
 I 
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Figure  2. 
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As  shown  on  Table  1  vectors  64-255  are  assigned  to  the  user 
interrupt  vector  numbers.  These  vectors  must  be  generated 
externally  and  placed  on  the  data  bus  during  an  interrupt 
acknowledge  cycle.  In  our  interrupt  controller  we  arbitrarily 
choose  vectors  249-255  as  the  vectors  assigned  to  the  inter- 
rupting peripheral  devices.  Table  3a  shows  this  assignment 
and  how  the  vector  can  be  decoded  from  the  address 
bits  A1-A3. 


DTACK 


Priority 
level 


Device  # 


1  (low  priority) 

2 
3 
4 
5 
6 

7  (high  priority) 


Vector 
assigned 


Decimal  # 


249 
250 
251 
252 
253 
254 
255 


Data 


D7  D6  D5  D4  D3  D2  D1  DO 


1  1 

1  1 

1  1 

1  1 
1 

1  1 


0 
0 
0 

1 

1  1 


11111 


A3A2A1 


0  0  1 


0  1 

0  1 

1  0 
1  0 

1  1 

1  1 


Table  3a. 

The  two  PAL  devices  used  on  this  design  example  generate  the 
interrupt  vectors  and  all  the  necessary  control  signals.  The  var- 
ious signals  and  their  implementation  on  the  PAL  devices  are 
explained  below. 

INT7-INT0:  Any  of  the  seven  peripheral  devices  can 
request  an  interrupt  by  asserting  one  of  these  inputs.  The 
interrupt  must  remain  asserted  until  acknowledged  by  the 
CPU. 

FC2-FC0  and  AS:  Function  code  or  processor  status  code 
become  logical  high  during  an  interrupt  acknowledge  cycle 
Address  strobe  is  asserted  anytime  valid  address  is  on  the 
bus.  DTACK  and  Data  output  control  are  decoded  from  these 
four  outputs  of  the  68000. 

A1-A3:  The  three  least  significant  bits  of  the  address  bus 
contain  the  encoded  level  of  the  interrupting  device.  These 
signals  are  used  in  generating  the  vector  number  which  is 
to  be  put  on  the  data  bus. 

RESET:  Reset  is  an  input  which  clears  all  outputs,  and  is 
used  for  system  initialization. 

IPL2-IPL0:  These  PAL  device  outputs  are  active  low  synchro- 
nous signals  which  interface  directly  to  the  CPU.  They  represent 
the  encoded  level  of  the  highest  priority  interrupt  that  is 
requested.  Table  3b  shows  the  truth  table  of  our  priority  encoder 
implemented  on  a  PAL  device. 


Interrupt  request  input  from  peripheral 

Encoded  int.  level 

1 

NT7  INT6 

INT5 

INT4 

INT3 

INT2 

INT1 

IPL2 

IPL1  IPLO 

0  X 

X 

X 

X 

X 

X 

0 

0  0 

1  0 

X 

X 

X 

X 

X 

0 

0  1 

1  1 

0 

X 

X 

X 

X 

0 

1  0 

1  1 

1 

0 

X 

X 

X 

0 

1  1 

1  1 

1 

1 

0 

X 

X 

1 

0  0 

1  1 

1 

1 

1 

0 

X 

1 

0  1 

1  1 

1 

1 

1 

1 

0 

1 

1  0 

1  1 

1 

1 

1 

1 

1 

1 

1  1 

Table  3b. 


DTACK:  Data  transfer  acknowledge  must  be  asserted  by  outside 
circuitry  during  a  data  transfer  operation.  The  logic  diagram 
shown  below  illustrates  how  DTACK  is  derived  from  address 
strobe  and  processor  status  signals. 


D7-D0:  The  three  least  significant  bits  of  the  data  output  can  be 
decoded  straight  from  the  address  bits  A3,  A2  and  A1.  That  is 
D2=A3,  D1  =A2  and  D0=A1 .  The  other  five  bits  of  data  can  be  held 
high  with  pull-up  resistors.  Outputs  of  the  three  data  bits  become 
enabled  by  using  the  same  scheme  which  enables  the  DTACK 
output. 


INTACK7-INTACK1:  Only  one  of  these  signals  will  be 
asserted  during  the  interrupt  acknowledge  cycle.  This  signal 
feeds  back  into  the  interrupting  device  to  tell  that  device 
that  its  interrupt  has  been  acknowledged.  We  can  use  the 
3-bit  addresses  to  decode  these  signals  as  shown  in  Table  3c. 


A3 

A2 

A1 

7 

6 

5 

INTACK 
4  3 

2  1 

0 

0 

0 

1 

1 

1  1 

1  1 

0 

0 

1 

1 

1 

1  1 

1  0 

0 

1 

0 

1 

1 

1  1 

0  1 

0 

1 

1 

1 

1 

1  0 

1  1 

1 

0 

0 

1 

1 

0  1 

1  1 

1 

0 

1 

1 

0 

1  1 

1  1 

1 

1 

0 

0 

1 

1  1 

1  1 

1 

1 

1 

0 

1 

1 

1  1 

1  1 

Table  3c. 


The  logic  diagram  of  the  controller  is  shown  in  Figure  3.  The 
control  ler  can  operate  without  any  wait  states  at  the  fastest  CPU 
clock  rate  of  12.5  MHz  as  shown  in  the  timing  diagram  of  Figure  4. 


Auto-vectored  Interrupts  (Method  2) 

The  seven  autovector  interrupts  are  assigned  vector  numbers  25 
through  31.  Interrupts  are  requested  by  placing  the  encoded 
level  of  the  request  on  the  interrupt  control  pins  (IPL2-IPL0).  The 
processor  responds  to  this  request  by  placing  the  requested  level 
in  the  processor's  status  register  and  by  inhibiting  all  requests  of 
lower  priority.  When  the  current  instruction  cycle  is  completed 
an  interrupt  acknowledge  cycle  takes  place.  If  Valid  Peripheral 
Address  (VPA),  is  asserted  before  the  falling  edge  of  S4  then  an 
autovector  routine  takes  place.  The  data  or  the  vector  number  is 
generated  internally  depending  on  the  priority  level  of  the  inter- 
rupt request;  the  vector  assigned  is  shown  on  the  table  at  the 
beginning  of  this  report.  The  autovector  timing  diagram  and  a 
very  simple  and  practical  interrupt  controller  implemented  on  a 
PAL  device  is  shown  in  Figure  5.  The  PAL  design  specifications 
are  included  in  the  appendix.  Note  that  VPA  is  generated_by 
enabling  the  PAL  output  when  FC2-FC1  are  high  and  AS  is 
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Interrupt  Controller  Logic  Diagram 
External  Vector  Generation 


AS> 


FC2> 
FC1> 


FC0> 


PAL20L10 


Figure ; 


SO         S1         S2         S3         S4         S5         S6  S7 


-*  12  *\ 

13 

u  h- 

WRITE  MACHINE  CYCLE 


INTERRUPT  ACKNOWLEDGE  CYCLE 


1  CLOCK  PERIOD  (80  ns) 

2  CLOCK  LOW  TO  ADDRESS  (55  ns) 

3  CLOCK  HIGH  TO  ADDRESS  Hi-Z  (60  ns) 
i  CLOCK  HIGH  TO  AS  LOW  (55  ns) 

5  CLOCK  LOW  TO  AS_HIGH  (50  ns) 

6  CLOCK  HIGH  TOJ3S  LOW  (55  ns) 

7  CLOCK  LOW  TO  DS  HIGH  (50  ns) 


S  CLOCK  HIGH  TO  R/W  HIGH  (60  ns) 
9  CLOCK  HIGH  TO  R/W  LOW  (60  ns) 

10  CLOCK  HIGH  TO  FC  VALID  (55  ns) 

11  CLOCK  HIGH  TO  DATA  Hl-Z  (60  ns) 

12  AS  HIGH  TO  DTACK  HIGH  (70  ns) 

13  PAL  CLOCK  TO  OUT  (25  ns) 

14  MINIMUM  SETUP  TIME  (20  ns) 


15  DATA  VALID  TO  55  LOW  (15  ns) 

16  CLOCK  HIGH  TO  AS,  51  LOW  (55  ns) 

17  PAL  INPUT  TO  Hl-Z  (25  ns) 

18  PAL  INPUT  TO  Hl-Z  (25  ns) 

19  PAL  INPUT  TO  Hl-Z  (25  ns) 


Figure  4. 
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AUTOVECTOR  OPERATION  TIMING  DIAGRAM 
SO  S2  S4  S6  SO  S2  S4  w    w    w    w   w   w   w    w   w    w   S6  SO  S2 


A1-A3)-( 


PAL16R4 


INT7> 
iNT6> 


'"""NORMAL*"'"*'  AUTOVECTOR  OPERATION  *^ 


ypT  (TIE  TO  PULLUP 
RESISTOR) 


Figure  5. 


Conclusions 

We  have  just  seen  how  to  implement  an  interrupt  controller 
circuit  using  one  or  two  PAL  devices.  This  circuit  can  operate  at 
the  maximum  operating  frequency  of  the  68000  which  is  12.5 
MHz.  Since  the  most  critical  timing  of  the  circuit  is  the  PAL 
device  unit's  input  to  Hi-Z,  we  see  that  we  can  operate  the  circuit 
with  a  wide  spectrum  of  frequencies  and  with  slower  PAL  devi- 
ces. To  guarantee  operation,  timing  spec.  No.  16  and  No.  19  on 
Figure  4  must  add  up  to  assert  DTACK  20  ns  prior  to  falling  edge 
of  S4.  Thus  55  ns  +  (PAL  device  input  delay  to  Hi-Z)  should  be 
less  than,  or  equal  to,  100  ns  for  a  12.5-MHz  clock.  We  see  that 
this  qualifies  fast,  regular,  half-power  and  quarter-power  PAL 
devices  for  this  circuit. 
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TITLE  INTERRUPT  C0NTR0LLER1 

PATTERN  INTA.DAT 
REVISION  01 

AUTHOR       DANESH  TAVANA 

COMPANY     MONOLITHIC  MEMORIES,  INC. 

DATE  11/06/87 


CHIP   INTA  PAL16R4 


CLK     /INT7  /INT6     /INT5  /INT4  /INT3  /INT2  /INT1     /AS  GND 
/OC     FCO       FC1         NC         /IPLO  /IPL1  /IPL2  /DTACK  FC2  VCC 


EQUATIONS 


DTACK  =  VCC 

DTACK. TRST  =  FC2*FC1*FC0*AS 


; ASSERT  IF  OUTPUT  ENABLE 


IPL2 


IPL1 


IPLO 


:=  INT7 
+  /INT7*  INT6 
+  /INT7*/INT6*  INT5 
+  /INT7*/INT6*/INT5*  INT4 

:=  INT7 
+  /INT7*  INT6 

+  /INT7*/INT6*/INT5*/INT4*  INT3 

+  /INT7*/INT6*/INT5*/INT4*/INT3*  INT2 


; PRIORITY  ENCODED  BIT 


:=  INT7 
+  /INT7*  INT6 

+  /INT7*/INT6*   INT5*/INT4*  INT3 

+  /INT7*/INT6*/INT5*/INT4*/INT3*/INT2*  INT1 


; PRIORITY  ENCODED  BIT 


.•PRIORITY  ENCODED  BIT 


;    SIMULATION  NOT  INCLUDED 
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TITLE  INTERRUPT  CONTROLLER 2 

PATTERN      INTC . DAT 
REVISION  01 

AUTHOR       DANESH  TAVANA 

COMPANY     MONOLITHIC  MEMORIES,  INC. 

DATE  11/06/87 


CHIP   INTC  PAL20L10 


/AS         A3         A2  Al  FC2  FC1  FCO  NC         NC  NC  NC  GND 

NC     /INTACK1  /INTACK2  /INTACK3  /INTACK4  /INTACK5  /INTACK6  /INTACK7  DO 
Dl  D2  VCC 

EQUATIONS 

/D2   =  /A3 

/D2.TRST  =  FC2*FC1*FC0*AS 

/Dl   =  /A2 

/Dl.TRST  =  FC2*FC1*FC0*AS 

/DO   =  /Al 

/DO.TRST  =  FC2*FC1*FC0*AS 

INTACK7   =  A3*  A2*  Al 

INTACK7 . TRST  =  FC2*FC1*FC0*AS 

INTACK6   =  A3*  A2*/A1 

INTACK6 . TRST  =  FC2*FC1*FC0*AS 


INTACK5  =  A3*/A2*  Al 

INTACK5 . TRST  =  FC2*FC1*FC0*AS 

INTACK4  =  A3*/A2*/A1 

INTACK4 . TRST  =  FC2*FC1*FC0*AS 


INTACK3   =/A3*   A2*  Al 

INTACK3 . TRST  =  FC2*FC1*FC0*AS 


INTACK2  =/A3*  A2*/A1 

INTACK2 . TRST  =  FC2*FC1*FC0*AS 


INTACK1  =/A3*/A2*  Al 

INTACK1 . TRST  =  FC2*FC1*FC0*AS 

SIMULATION  NOT  INCLUDED 
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Memories  are  an  integral  component  of  any  computer  system. 
PROMs  (Programmable  Read  Only  Memory),  SRAMs  (Static 
Random  Access  Memory),  and  DRAMs  (Dynamic  Random 
Access  Memory)  are  used  most  often  to  store  programs  and  data. 
PROMs  are  used  to  store  programs  which  can  only  be  read  by  the 
computer.  The  SRAMs  and  DRAMs  are  used  for  reading  and 
writing  data  and  programs.  Figure  1  shows  a  system  processor 
interfaced  to  PROMs,  SRAMs  and  DRAMs. 

DRAMs  are  cheaper  bulk  memory,  and  are  used  instead  of 
SRAMs  in  many  systems.  DRAMs  are  available  in  speeds 
ranging  from  70  ns  to  250  ns  access  times,  enough  to  keep  up 
with  the  new  generation  processor  speeds.  This  performance  is 
also  sufficient  to  match  the  usually  high  speed  SRAMs  in  many 
existing  and  new  designs.  DRAM  densities  have  also  progressed 
from  64K-bit  in  1983  to  1  M-bit  in  1986,  and  4M-bit  and  16M-bit 
DRAMs  are  being  developed.  DRAMs  are  also  available  in  space 
efficient  SIP,  SIMM  and  ZIP  (Zig-zag  Inline  Package)  packages 
with  modules  that  can  carry  up  to  eight  or  nine  DRAM  devices.  In 
this  discussion  we  will  focus  on  the  design  of  PROM,  SRAM  and 
DRAM  control  circuitry  using  PLDs. 


Large  memory  requirements  result  in  the  use  of  many  DRAMs 
which  cannot  be  conveniently  located  on  the  same  board  with  the 
CPU.  DRAM  devices  are  placed  on  separate  boards  and  con- 
nected to  the  processor  board  via  a  bus.  Off-board,  DRAM 
devices  mandate  the  use  of  specialized  outputs  to  drive  the 
address  and  control  traces  of  the  DRAM  array.  In  systems  with 
many  off-board  DRAMs  the  soft  error  rate  is  high  and  sometimes 
calls  for  the  use  of  error  detection  and  correction  circuits.  A 
detailed  discussion  of  EDC  (Error  Detection  and  Correction) 
circuit  design  with  PLDs  is  also  included  here. 

The  proximity  of  the  DRAMs  to  the  processing  units  depends  on 
the  hierarchy  of  the  DRAM  memory  in  the  system.  This  hierarchy 
depends  heavily  on  the  speed  of  the  DRAM  system  versus  the 
processor  speed,  and  on  the  required  addressing  space.  When 
the  processor  speed  can  be  accommodated  by  the  DRAM 
devices,  it  is  efficient  to  place  a  DRAM  subsystem  as  close  to  the 
processor  as  possible  and  to  use  it  as  the  immediate  storage 
space.  When  DRAMs  are  not  fast  enough  for  the  processor,  they 
must  be  used  as  remote,  main-storage,  while  static-RAM-based 
caches  are  used  in  the  processor's  immediate  proximity.  A 
review  of  such  cache  systems  design  using  PLDs  is  also  included 
here. 
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Figure  1 .  Processor  Interface  to  PROMs,  SRAMs  and  DRAMs 
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PROM  and  SRAM  Control 

The  control  of  static  RAM  or  PROM  devices  is  very  simple.  Both 
of  these  devices  provide  achip  select  input  signal  which  activates 
the  devices  to  read  or  write  data.  The  chip  select  control  of  these 
devices  is  enabled  by  the  decoded  processor  address  signals. 
The  address  bits  selected  depend  upon  the  range  of  addresses 
covered  by  each  SRAM  or  PROM.  More  detail  on  address 
decoders  can  be  found  on  page  2-35. 

To  optimize  the  number  of  product  terms  used,  the  address  range 
is  generally  selected  so  that  the  boundary  addresses  are  power- 
of-two  numbers.  From  page  2-53,  for  an  address  decoder  we 
have  the  following  equations: 

For  an  n-bit  address  X  in  the  range  from  B+1  to  A-1  (A>X>B) 

A=  0010000-00  (a  power-of-two  number) 
n  p  1 

B=  0000011—11  (a  power-of-two  number  minus  one) 
n      q  1 

CS  -  /Xn  *  /Xn-1  *  *  /Xp  * 

(Xp-1  +  Xp-2  +  +  Xq) 

Typical  SRAMs  also  include  a  read/write  input  signal  which 
selects  between  a  read  and  a  write  operation.  This  signal  is 
usually  directly  connected  to  the  processor  read/write  signal. 
PROMs  are  "read  only",  and  thus  do  not  require  this  signal. 

PROM  chip  select  signals  should  only  be  asserted  for  read 
cycles.  Attempting  a  write  to  a  PROM  can  cause  bus  contention 
problems  where  the  memory  and  processor  data  signals  attempt 
to  drive  the  same  bus  lines  simultaneously.  This  can  be  easily 
avoided  by  ensuring  that  the  chip  selects  are  not  asserted  for  a 
processor  write  cycle.  The  PROM  chip  select  signals  are  gated 
with  the  read  signal  (R)  to  insure  a  chip  select  only  when  R  is  low. 

/CS_PROM  -   (X)    *   /R  ; (X  is  a 

combination  of  address  signals) 

Most  of  the  "second  generation"  SRAMs  have  output  enable  (OE) 
and  write  enable  (WE)  pins  in  addition  to  the  chip  select  (CS)  pins. 
A  separate  and  independent  OE  line  eliminates  the  data  bus 
contention  problem  entirely.  The  OE  line  is  disabled  before 
writing  to  the  SRAM,  allowing  the  data  pins  to  be  driven  by  the 
external  bus.  The  OE  line  is  enabled  only  during  the  read  cycles. 
Most  microprocessors  provide  a  data  signal  which  can  be  directly 
connected  to  the  OE  lines,  providing  the  proper  timing  to  avoid 
bus  contention. 
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Some  memories  so  not  provide  a  separate  OE  pin  (Figure  2). 
Generating  CS  for  these  memories  can  cause  serious  bus 
contention  problems.  Often  this  problem  is  solved  by  gating  the 
CS  with  read  (R)  or  write  (W)  signals  as  follows: 

/cs  =  (X)  *  /R 
+  (X)   *  /w 

/WE  =  /W 

This  approach  allows  usually  sufficient  time  for  other  buffers  on 
the  data  bus  to  be  disabled.  This  works  well  for  the  read  cycles 
in  which  the  memory  is  expected  to  be  the  only  bus  driver. 
However,  for  the  write  cycles  where  the  processor  is  usually  the 
data  bus  driver,  this  does  not  quite  solve  the  problem  completely. 

In  the  above  equation,  the  CS  is  generated  either  from  the  read 
or  the  write  signal.  If  the  CS  line  is  generated  before  the  WE  is 
asserted  low,  the  memory  will  drive  the  data  lines,  which  are 
already  being  driven  by  the  processor,  causing  bus  contention. 
There  is  still  no  way  to  ensure  that  CS  is  generated  only  after  the 
WE  signal  to  the  memory  is  asserted.  Some  delay  can  be  added 
by  controlling  the  CS  line  from  the  WE  signal,  by  gating  the  W 
signal  with  the  WE  signal  (for  CS)  (Figure  3)  as  follows: 
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Figure  3.  Waveform  for  SRAM  Control  Without  OE 

/CS  -    (X)    *   /R  +    (X)    *   /W  *  /WE 

This  ensures  that  the  WE  signal  of  the  memory  is  low  before  the 
CS  is  asserted.  This  solves  only  part  of  the  problem.  On  the  other 
edge,  WE  should  be  removed  only  after  CS  is  removed.  This  can 
be  accomplished  by  controlling  WE  from  the  CS  by  gating  CS 
along  with  the  WE  signal: 

/WE  =  /W  *  /CS 

This  ensures  the  assertion  of  WE  as  long  as  the  CS  is  asserted 
and  W  is  low.  As  soon  as  the  CS  is  unasserted,  the  WE  of  the 
memory  will  be  unasserted.  These  signals  CS  and  WE  used  for 
controlling  each  other  are  simply  the  feedback  from  the  appropri- 
ate I/O  pins.  The  feedback  path  provides  the  extra  delay. 


Figure  2.  Controlling  an  SRAM  Without  OE 
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Access  Time  Considerations 

PROMs  and  SRAMs  with  data  access  time  which  match  the 
processor  speeds  are  commercially  available.  Sometimes, 
however,  there  is  a  speed  mismatch.  Systems  with  different 
types  of  memories  encounter  this  problem  the  most.  For  such 
systems,  different  memories  generate  their  own  data  available 
(DAV)  and  write  complete  signals  and  inform  the  processor  of 
cycle  completion.  This  allows  processors  to  prolong  their  cycles 
and  add  wait  states,  to  match  the  memory  speeds.  These  signals 
are  generated  by  using  PAL  devices  which  can  generate  timing 
delays.  Registered  PAL  devices  are  ideal  candidates  for  such 
timers.  The  application  note  on  page  2-184  illustrates  one  such 
memory  design  where  a  PAL  device  generates  the  memory 
control  timing  signals.  This  design  also  implements  the  memory- 
to-processor  interface  handshake  logic. 

DRAM  Systems 

In  any  system  that  uses  dynamic  RAMs,  special  circuitry  in  the 
address  and  control  lines  must  be  introduced  to  drive  the  memory 
chips  and  link  them  to  the  CPU  (see  Figure  4).  Acircuit  known  as 
a  DRAM  controller  performs  several  functions: 

•  Multiplexes  the  row  and  column  addresses 

•  Provides  access  timing 

•  Provides  refresh  timing  and  addresses 

•  Arbitrates  between  access  and  refresh 

•  Drives  the  DRAMs'  capacitive  load  inputs 
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Figure  4.  Functions  of  a  DRAM  Controller 


Multiplexing  processor  addresses  into  row  and  column  ad- 
dresses is  required  because  the  DRAM  architecture  divides  the 
addresses  into  rows  and  columns.  This  reduces  the  number  of 
pins  required  for  addressing  the  DRAMs  and  consequently  the 
package  size.  The  external  DRAM  controller  multiplexes  the 
addresses  and  also  generates  the  RAS  (Row  Address  Strobe) 
and  CAS  (Column  Address  Strobe)  signals  to  the  DRAMs  to 
distinguish  between  these  two  addresses.  Every  DRAM  access, 
for  reading  or  writing  data,  requires  a  predetermined  timing 
sequence  of  RAS,  CAS  and  WE  (Write  Enable)  signals.  A  DRAM 


controller's  main  task  is  to  generate  these  read  and  write  cycle 
timings.  In  addition  to  the  regular  read  and  write  cycles,  many 
DRAMs  are  available  which  provide  higher  data  bandwidth  read 
and  write  cycle.  Such  nibble  mode,  ripple  mode  or  page  mode 
cycles  require  different  sequences  of  RAS  and  CAS  signals. 
Depending  upon  the  data  requirements,  these  schemes  are  used 
for  high  speed  data  throughput  in  various  systems. 

DRAMs  also  require  intermittent  refresh  cycles  to  charge  the 
capacitors  for  every  row  of  data.  This  refresh  is  provided  by  a 
specific  sequence  of  RAS  and  CAS  signals.  There  are  three 
different  refresh  cycles,  one  of  which  is  selected  according  to  the 
system  requirements.  These  are: 

•  Regular  RAS  only  refresh  cycle 

•  CAS  before  RAS  refresh  cycle 

•  Hidden  refresh  cycle. 

The  regular  RAS-only  refresh  cycle  requires  an  external  refresh 
address  generator  which  sequences  through  all  the  row  ad- 
dresses of  a  DRAM.  The  CAS  before  RAS  refresh  cycle  is  only 
provided  by  certain  DRAMs.  These  DRAMs  are  usually  costlier, 
since  they  require  internal  circuitry  for  row  address  generation. 
The  hidden  refresh  cycle  DRAMs  retain  the  last  read/write  cycle 
data  at  the  outputs  during  the  refresh  cycles,  which  is  helpful  in 
certain  designs. 

In  addition  to  the  type  of  refresh  cycle,  the  frequency  of  refresh 
cycles  also  varies  in  different  systems.  Some  systems  provide 
intermittent  refresh  when  refresh  cycles  are  executed  at  fixed 
time  intervals.  The  DRAM  accesses  are  sometimes  delayed 
pending  an  intermittent  refresh  cycle.  With  burst  mode  refresh, 
a  number  of  refresh  cycles  are  completed  simultaneously,  allow- 
ing the  processor  an  uninterrupted  access  for  read  and  write 
cycles  later.  Time  critical  systems  usually  find  burst  mode  more 
useful. 

DRAM  Controllers  and  PLDs 

Many  DRAM  systems  make  use  of  a  VLSI  DRAM  controller. 
Most  VLSI  DRAM  controllers  perform  the  DRAM  control  func- 
tions which  do  not  change  greatly  form  one  system  to  another. 
Excellent  representatives  of  this  approach  are  the  2964,  2968 
and  the  673103  Dynamic  RAM  Controller/Drivers  (Figure  5). 

Most  VLSI  DRAM  controllers  do  not  support  all  of  the  various 
types  of  read/write  cycles,  refresh  cycles  and  refresh  timings. 
Custom  DRAM  controllers  are  required  forthe  following  reasons: 

•  Higher  throughput  bandwidth 

•  Different  memory  access  times 

•  Different  refresh  cycles  and  mechanisms 

•  Processor  related  requirements. 

Depending  upon  the  specific  system  requirements,  DRAM  con- 
trollers can  be  made  from  PLDs  which  support  the  specific  cycles 
required  by  a  design. 

Fast  registered  PLDs  are  used  as  DRAM  controllers  in  many 
designs.  Functions  of  a  DRAM  controller  are  usually  split  into  a 
number  of  PLDs.  The  application  note  on  page  2-187  shows  a 
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Figure  5.  A  VLSI  DRAM  Controller 
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DRAM  controller  for  a  68020  microprocessor  which  implements 
a  custom  data  sizing  and  alignment  function  required  by  the 
microprocessor. 

VLSI  DRAM  controllers  use  PALdevices  for  a  variety  of  functions. 
These  functions  depend  upon  the  capabilities  of  individual  VLSI 
controllers.  Three  detailed  application  notes  are  included  in  the 
subsequent  sections  which  illustrate  the  interface  of  these  VLSI 
DRAM  controllers  to  the  various  microprocessors.  The  applica- 
tion note  on  page  2-202  shows  an  AMD  8088  microprocessor 
interfaced  to  a  2968  DRAM  controller  using  a  PAL1 6R4.  The  ap- 
plication note  on  page  2-210  shows  a  68000  microprocessor 
interface  tothe  2968.  The  interface  between  the  DRAM  controller 
and  the  CPU  accommodates  the  specific  access  protocols  and 
speeds  of  the  CPU  as  well  as  the  access  and  refresh  timing  of  the 
dynamic  RAMs.  This  interface  between  the  controller  and  the 
rest  of  the  system  is  best  implemented  using  fast  programmable 
logic  devices  such  as  the  PAL16R8D  family  devices. 

Error  Detection  and  Correction 

Compared  to  PROMs  and  SRAMs,  the  physical  dimensions, 
signal  levels,  and  the  stored  charge  of  the  DRAM  cells  are  greatly 
reduced  to  allow  higher  density.  As  the  stored  charge  in  the 
DRAM  cells  decreases,  the  device  is  more  susceptible  to  soft 
errors  caused  by  alpha  particles  as  well  as  by  environmental 
noise. 

Soft  errors  are  temporary,  random,  and  they  can  be  corrected  by 
rewriting  into  the  erroneous  cell.  Therefore,  if  the  system  has  the 
ability  to  locate  the  bit  in  error,  the  soft  error  can  be  corrected.  One 
method  to  locate  the  bit  in  error  is  to  append  a  fixed  number  of 
check-bits  to  the  data  word  and  store  these  check-bits  along  with 
data.  This  increases  the  data  width  of  the  DRAM  system.  Upon 
reading,  both  the  data  and  check-bits  are  read  into  the  EDAC. 
The  check-bits  are  regenerated  from  the  read  data,  and  these 
newly  generated  check-bits  are  compared  with  the  read  check- 
bits.  The  comparison  produces  the  syndrome  bits.  The  syn- 
drome bits  constitute  a  binary  number  that  points  tothe  bit  in  error. 

This  encoding  scheme  of  generating  the  check-bits  and  the 
syndrome  bits  is  called  a  Hamming  code,  after  its  inventor.  The 
modified  Hamming  code  has  one  more  check-bit  than  the  original 
Hamming  code,  but  it  can  detect  all  double-bit  errors  as  well  as 
detect  and  correct  all  single-bit  errors.  In  addition  it  can  detect  a 
substantial  number  of  multiple-bit  errors.  The  modified  Hamming 


code  has  proved  particularly  useful  for  its  relatively  low  overhead 
in  today's  wide-data-word  systems  and  its  capability  todetect  and 
correct  single-bit  errors  and  detect  all  double-bit  errors. 

A  number  of  XOR  logic  functions  are  required  for  generating 
Hamming  codes.  PALdevices  which  provide  XOR  gates  are  ideal 
candidates  for  such  designs.  Details  of  Hamming  codes  and  a 
sample  eight-bit  error  detection  and  correction  application  note 
which  uses  the  PAL16X4  is  included  on  page  2-229. 

Cache  Memory  Systems 

Fast  access  time  SRAM  caches  are  used  for  very  high-speed 
processors.  These  allow  high  system  performance  while  retain- 
ing the  low  cost  of  DRAM  bulk  memory.  Based  upon  how  the 
SRAM  cache  addresses  map  to  the  DRAM  addresses,  three 
major  cache  schemes  are  used. 

For  fully  associative  caches,  data  in  a  DRAM  location  may  be  in 
any  cache  location.  For  direct  mapped  caches,  data  may  be  in 
one  of  the  many  locations  determined  by  higher  order  address 
bits.  For  set  associative  caches,  data  may  be  in  one  particular 
location  of  every  set.  Depending  upon  the  performance  required 
and  othersuch  system  constraints  such  as  cost,  board  space  etc. 
one  of  thethree  schemes  is  used.  Details  of  cache  system  design 
are  included  in  the  application  note  on  page  2-239. 

A  major  task  in  cache  memory  systems  is  replacement,  which 
requires  transferring  data  between  the  cache  and  the  DRAMs. 
This  is  done  in  an  effort  to  keep  the  cache  filled  with  only  those 
blocks  of  memory  which  are  in  use  by  a  program.  The  replace- 
ment task  requires  a  controller  with  intelligence,  which  can  read 
cache  data  and  write  to  the  DRAMs  and  vice  versa.  This  requires 
a  state-machine-based  controller.  Since  most  cache  designs  are 
custom  designs,  due  to  various  system  constraints,  limited  VLSI 
solutions  are  available.  State-machine-based  controllers  such 
as  the  Am29PL141  and  the  PMS14R21  can  perform  these 
functions  very  effectively. 

Other  uses  of  PLDs  in  cache  memory  systems  include  general 
glue  logic  which  is  required  for  controlling  SRAM  caches  or  tag 
buffers.  Such  PLDs  are  usually  very  high-speed  combinatorial 
PLDs.  Monolithic  Memories  also  provides  a  number  of  ECL 
PLDs  which  can  be  used  in  ECL  cache  memory  systems.  These 
ECL  PAL  devices  allow  6  ns  access  times. 
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A  typical  control  logic  problem  is  the  memory-to-processor 
handshake  on  memory  transfer  used  in  many  computer  ar- 
chitectures. The  processor  makes  a  transfer  request  by  activat- 
ing a  request  line  (REQ)  and  specifies  a  read  or  write  operation 
on  a  Read/Write  line  (R/W). 

During  a  read  operation,  the  processor  waits  for  a  Data  Avail- 
able signal  at  which  time  the  data  bus  is  sampled  and  the  re- 
quest line  lowered,  thus  completing  the  cycle.  During  a  write 
operation,  the  processor  places  data  on  the  bus  and  waits  for  a 
Write  Complete  signal  after  the  write  cycle  is  finished.  Upon 


write  complete,  the  request  line  is  lowered,  hence  completing 
the  cycle.  Figure  2  shows  the  state  assignments  and  the  appro- 
priate outputs.  The  state  diagram  is  shown  in  Figure  1 .  Also  the 
handshaking  operation  is  illustrated  in  the  timing  diagram  of 
Figure  3. 

The  memory-board  logic  to  implement  this  function  may  be  de- 
signed with  gates  and  edge-triggered  flip-flops  as  shown  in  Fig- 
ure 4.  This  particular  design  would  require  about  five  SSI/MSI 
packages,  but  the  same  design  can  be  implemented  by  a  single 
PAL16RP6.  The  PAL  device  design  specification  using  state 
equations  is  shown  on  page  2-186. 
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Figure  1.  State  Diagram— Memory  Handshake  Logic 
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PAL  Device  Design  Specification 
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Logic  Symbol 


->  WRITE1  +  C_l  ->  READ1  +->  WAIT 

->  WAIT  +->  READ 2 

->  WAIT  +->  READ 3 

->  WAIT  +  C_5  ->  WAIT  +->  READ3 
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Customize  a  DRAM  Controller 
Using  Advanced  PAL  Devices 


Ever  increasingly  dense  DRAMs  and  their  interface  to  the  emerg- 
ing sophisticated  32-bit  microprocessors  provide  an  increasing 
number  of  system  options  to  the  designer.  In  order  to  utilize  these 
advanced  DRAM  features  and  to  satisfy  the  individual  system  de- 
sign requirements,  custom  DRAM  controllers  are  required.  PAL 
device  sequencers,  such  as  the  PAL23S8,  provide  effective  con- 
trol sequencing  power,  flexibility,  customizability  and  ease  of  use. 
They  offer  a  very  desirable  alternative  to  the  dedicated  function- 
ality of  VLSI  DRAM  controllers. 

The  following  megabit  DRAM  controller  design  based  on  the 
PAL23S8  sequencer  is  illustrative  of  a  design  which  is  easily 
customized  to  a  system  requirement.  It  supports  features  such  as 
data  bus  sizing  and  data  alignment  which  are  required  for  the 
emerging  32-bit  microprocessors,  but  which  are  notyet  easily  and 
adequately  supported  by  the  VLSI  controllers. 

A  DRAM  controller  needs  to  support  at  least  the  following  func- 
tions: 

•  Flexible  Refresh  Generation 

•  Flexible  Arbitration 

•  Processor  Handshake 

•  Processor  Cycle  Timing  and  Control 

•  Refresh  Cycle  Timing  and  Control 

Flexible  Refresh  Generation 
Mechanisms 

For  refresh  generation,  different  system  designs  may  require 
either  burst  mode,  intermittent  mode  or  both  kinds  of  refresh.  In 
burst  mode,  refresh  requests  occur  once  very  4  milliseconds  to 
meet  the  dynamic  RAM's  speed.  For  a  megabit  RAM,  which 
needs  512  cycles  for  a  complete  refresh,  all  512  cycles  will  be 
performed  consecutively.  The  major  disadvantage  of  the  Burst 
refresh  approach  is  that  memory  is  "out  of  service"for  a  long  time. 
The  intermittent  refresh  scheme  minimizes  the  out  of  service  time 
by  stealing  single  cycles  between  processor  cycles.  This  cycle 
stealing  is  performed  by  a  pre-defined  arbitration  mechanism  for 
prioritization  of  different  cycle  requests.  This  design  incorporates 
the  arbitration  in  a  manner  which  allows  both  intermittent  and 
burst  mode  refreshes. 

Flexible  Arbitration  Scheme 

The  arbitration  in  most  designs  involves  prioritizing  memory 
accesses  for:  read  cycles,  write  cycles,  read-modify-write  cycles, 
and  refresh  cycles. 

The  DRAM  controller  must  allow  both  asynchronous  memory  and 
refresh  cycle  requests.  It  should  decide  whether  a  memory  cycle 
is  an  accesscycle  or  a  refresh  cycle.  When  refresh  and  processor 
accesses  are  requested  simultaneously,  the  refresh  is  given 
priority.  If  a  refresh  cycle  is  in  progress  and  a  memory  cycle 


request  is  made,  then  the  memory  access  is  inhibited  until  the 
refresh  is  complete.  The  memory  request  in  this  case  is  kept 
pending  by  asserting  a  processor  wait  signal. 

For  a  typical  intermittent  refresh  scheme,  the  cycle  requests  are 
derived  from  the  oscillator,  which  operates  asynchronously  with 
respect  to  the  system  clock.  The  arbiter  grants  a  refresh  request 
when  no  other  memory  cycle  is  either  occurring  or  pending.  If  a 
memory  cycle  is  in  progress,  the  arbiter  inhibits  the  refresh  cycle 
until  the  current  cycle  is  completed.  However,  it  needs  to 
remember  that  a  refresh  request  was  made  and  grant  that  request 
on  completion  of  the  processor  cycle.  The  PAL23S8's  buried 
registers  are  very  useful  in  performing  such  arbitration  functions, 
even  between  asynchronous  events.  These  registers  can  also  be 
used  for  remembering  refresh  cycle  requests  and  initiating  ac- 
tions later  for  execution  of  the  refresh  cycles. 

However,  some  designs  may  require  additional  functions,  such 
as  prioritizing  different  processor  requests.  Some  may  require  a 
specific  kind  of  handshake  mechanism  between  the  processor 
and  the  DRAM  controller.  All  such  functions  can  be  performed  in 
the  PAL  sequencer  by  using  more  of  chip  resources  such  as  I/O 
pins,  buried  registers  and  product  terms.  On  the  other  hand  some 
systems  may  not  require  functions  such  as  read-modify-write 
cycles,  in  which  case  the  PAL  sequencer  resources  can  be  saved 
and  used  for  other  system  functions. 

Flexible  Handshake  Protocol 

The  PAL23S8  sequencer  can  provide  a  customizable  interface  to 
the  system  processorwithout  requiring  additional  parts.  The  logic 
required  for  interface  protocols  for  the  APX  family  or  the  68000 
family  microprocessors  can  be  very  easily  built  using  the  internal 
resources  of  the  PAL23S8.  The  design  example  below  is  for 
68020  processor  interface  signals. 

Processor  Cycle  Execution 

Different  DRAMs  can  have  different  processor  cycles.  Some  just 
support  the  basic  read/write  cycles,  whereas  others  support 
extended  functions  such  as  page  mode,  ripple  mode  or  static 
column  mode  cycles.  Some  DRAMs  also  support  additional 
functions  for  extended  applications  such  as  the  transfer  cycle  in 
VRAMs  (DRAMs  optimized  for  video  applications).  PAL  devices 
allow  user-definable  functionality  to  be  implemented  in  the  de- 
sign, which  optimizes  resource  utilization  and  improves  system 
performance.  Another  advantage  offered  by  such  designs  is  the 
customizability  of  the  design  to  the  DRAM  timing  requirements, 
which  can  vary  between  -12,  -15,  and  -20  parts  significantly.  All 
these  timing  functions  can  be  easily  implemented  and  modified 
using  buried  state  registers. 

Our  design  implements  the  basic  read,  write,  and  read-modify- 
write  cycles  required  in  most  designs.  The  timing  required  has 
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been  calculated  for  a  -12  memory  part.  The  processor  cycle 
execution  involves  the  following  functions: 


Address  multiplexing 
Timing  and  control  signal  generation 

Am29841  ten-bit  buffers  are  used  for  address  multiplexing.  The 
multiplexing  of  the  row  or  column  address  is  under  the  control  of 
the  DRAM  controller. 

Generation  of  the  timing  and  control  signals  RAS,  CAS,  WE  and 
DACK  is  also  the  core  function  of  the  controller.  The  controller 
multiplexes  the  address  and  produces  RAS,  CAS,  WE  in  a 
sequence  that  is  understandable  by  the  DRAMs.  It  also  gener- 
ates a  handshake  signal  "DACK'forthe  CPU,  to  indicate  whether 
or  not  the  memory  is  ready  to  be  accessed. 

Refresh  Cycle  Execution 


DRAMs  support  different  types  of  refresh  cycles.  Some  DRAMs 
support  CAS  before  RAS  refresh.  These  DRAMs  have  the 
refresh  address  counter  embedded  along  with  the  memory,  thus 
eliminating  the  need  for  an  external  counter.  Some  designs  may 
require  "hidden  refresh",  or  the  standard  RAS  only  refresh. 
Depending  upon  the  system  requirements,  the  designer  can 
optimize  the  design  and  resource  utilization.  For  this  example, 
the  PALdevice  has  been  designed  to  execute  a  RAS  only  refresh. 
However,  it  can  be  easily  modified  to  support  CAS-before-RAS 
refresh,  or  hidden  refresh  also,  if  needed. 

Megabit  DRAM  Controller  with  Data 
Alignment  and  Dynamic  Bus  Sizing 

In  many  cases  the  currently  available  DRAM  controllers  do  not 
support  the  esoteric  requirements  of  different  system  designs. 
The  designer  is  left  to  make  a  custom  DRAM  controller.  The 
PAL23S8,  with  its  embedded  extra  buried  state  registers,  pro- 
vides an  excellent  tool  for  designing  such  a  flexible,  fast  DRAM 
controller.  It  also  offers  the  right  speed  for  interfacing  to  the 
emerging  8  and  16  MHz  processors,  allowing  superior  synchro- 
nous designs  with  few  or  no  wait  states. 

Some  of  the  emerging  32-bit  microprocessors,  such  as  the  68020 
and  the  80386,  support  the  concept  of  dynamic  bus  sizing  and 
mis-aligned  data  transfers.  Presently,  there  is  no  DRAM  control- 
ler which  utilizes  the  advantages  offered  by  these  two  features 
effectively. 

Dynamic  Data  Bus  Sizing 

Dynamic  bus  sizing  allows  the  processor  to  handle  variable  width 
data  buses  (8, 1 6  or  32  bits)  on  a  cycle-by-cycle  basis.  Dynamic 
bus  sizing  makes  the  size  of  the  R/W  port  or  channel  transparent 
to  the  software  designers. 

The  68020  allows  operand  transfers  to  or  from  8,  16,  and  32-bit 
ports  by  dynamically  determining  the  port  size  during  each  bus 
cycle.  Similarly,  80386  allows  operand  transfers  to  or  from  1 6  and 
32-bit  ports  by  determining  the  port  size  during  each  bus  cycle. 
During  an  operand  transfer,  the  slave  device  signals  its  port  size 
and  transfer  status  (complete  or  not)  to  the  processor  using  the 


DSACK  inputs  for  the  68020,  and  the  Ready  &  BS16  signals  for 
the  80386. 


DSACK1 

DSACK0 

H 

H 

Insert  Wait  States 

H 

L 

Complete  Cycle 

Data  bus  port  size  8  bits 

L 

H 

Complete  Cycle 

Data  bus  port  size  16  bits 

L 

L 

Complete  Cycle 

Data  bus  port  size  32  bits 

The  design  implemented  shows  a  68020  interface,  but  can  be 
easily  modified  to  accommodate  the  80386.  The  68020  always 
attempts  to  transfer  the  maximum  possible  amount  of  data  (32 
bits)  in  a  single  cycle.  If  the  port  responds  with  a  size  smallerthan 
32  bits,  the  processor  repeats  the  cycles  with  the  remaining  bytes 
of  data.  For  each  cycle,  the  68020  informs  the  slave  of  the  number 
of  bytes  it  is  attempting  to  transfer  by  two  signals  SIZ0,  SIZ1 ,  as 
follows: 


SIZ0 

SIZ1 

SIZE 

0 

1 

Byte 

1 

0 

Word 

1 

1 

3  Bytes 

0 

0 

Long  Word 

Due  to  various  considerations,  memories  with  different  data  width 
can  exist  on  the  same  system.  For  example,  it  is  often  preferable 
to  have  smaller  external  bus  sizes,  while  retaining  a  32-bit  internal 
memory  size  on  a  board.  Such  instances  are  common  when 
using  DMA  in  microprocessor  based  systems.  Varied  memory 
widths  are  also  required  in  graphics  and  image  processing 
systems.  PALdevices  allow  customizable  dynamic  memory  data 
bus  sizing  based  on  various  system  requirements — such  as 
address  range  (address  signals)  or  jumper  selectable  static 
control  for  increasing  memory  width. 

Mis-Aligned  Transfers 

Dynamic  sizing  is  used  in  conjunction  with  address  bits  AO  and  A1 
to  also  support  mis-aligned  data  transfers:  for  example,  a  word 
write  to  a  long  word  data  bus  size  with  an  address  offset  of  1  byte 
(A0/A1=01).  The  68020  and  the  80386  contain  internal  bus 
multiplexers  to  route  the  data  to  the  correct  data  lines.  Both  of 
these  also  generate  extra  cycles  necessary  to  complete  a  mis- 
aligned transfer.  However,  external  circuitry  is  still  needed  to 
generate  the  correct  write  enables  and  chip  selects  to  support 
both  these  functions.  PAL  devices,  however,  can  be  customized 
to  support  both  these  functions  easily. 
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AO 
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0 

0  Bytes 

0 

(  1 
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1 
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1 
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By  decoding  AO  and  A1 ,  the  two  transfer  data  size  signals  SIZEO, 
SIZE1 ,  and  the  size  of  the  port  being  addressed,  PALdevices  can 
easily  support  dynamic  bus  sizing  and  generate  the  appropriate 
chip  select  signals  for  the  four  banks  of  memory. 

The  memory  organization  selected  (Figure  1 )  allows  access  to 
individual  data  bytes  under  the  control  of  the  CAS  signal.  The  four 
banks  of  memory  are  addressed  by  the  four  RAS  signals. 


The  DRAM  Controller 

The  DRAM  controller  consists  of  two  PAL23S8s:  the  first  is  called 
the  Timing  and  Arbitration  PAL  device,  and  the  second  device  is 
called  the  Data  Sizing  and  Alignment  PAL  device  (Figure  2). 


rra  CAS1        CAS2         CAS3      EACH  BANK  OF 

1  MX32  BITS 


20  BIT  ROW  AND 
COL  ADDRESS  TO 
ALL  FOUR  BANKS 


D31-D24     D23-D16    D15-D8  D7-D0 


Figure  1 .  Memory  Organization 
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Figure  2.  A  Megabit  DRAM  Controller 
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The  Timing  and  Arbitration  Controller 

The  timing  and  arbitration  PAL  device  performs  the  following 

functions: 

Arbitration  between  current  cycle,  Refresh  cycle  &  CPU  cycle 
Read  and  Write  Cycle  Execution 
Read-Modify-Write  Cycle  Execution 
Refresh  Cycle  Execution 
Interface  Signaling 

The  timing  and  arbitration  PAL  device  arbitrates  between  the 
processor  request  MREQ  signal  and  the  refresh  request  RFCLK, 
giving  the  highest  priority  first  to  the  current  cycle,  next  to  the 
refresh  request  and  finally  to  the  processor  cycle.  This  allows 
support  for  both  intermittent  refresh  and  burst  mode  refresh.  It 
also  uses  a  mechanism  to  store  the  refresh  requests  when  the 
processor  cycle  is  in  progress  for  servicing  later. 

This  device  also  generates  the  appropriate  ROWADD  and 
COLADD  signals  to  multiplex  the  row  and  column  addresses  onto 
the  memory  address  bus  respectively.  During  refresh  cycles  it 
asserts  the  RFSH  signal  enabling  the  refresh  row  address  onto 
the  DRAMs. 

This  PAL  device  executes  the  read  and  write  cycles  as  well  as 
readimodify-write  cycles.  The  selection  between  these  two  is 
under  the  control  of  the  processor  signal  RMC.  The  timing  of  the 
different  cycles  is  under  the  control  of  internal  registers. 

The  timing  requirements  of  the  different  cycles  are  shown  in 
Figure  3.  The  operation  of  the  state  machine  to  perform  all  these 
funct  ons  is  also  shown  in  Figure  4.  This  state  diagram  has  been 
derived  directly  from  the  timing  diagrams  and  the  arbitration 
requirements  of  the  design.  This  state  machine  is  implemented 
by  five  of  the  six  available  buried  state  registers  on  the  device,  and 
is  used  mainly  for  timing  and  arbitration  functions.  The  sixth 
buried  state  register  implements  a  flag  function  and  is  repre- 
sented as  a  small  independent  state  machine,  as  described 
below.  The  state  description  (operation)  has  been  written  in 
CUPL  software  which  generates  the  JEDEC  file  directly  for 
programming  the  device. 

Arbitration  Between  Current  Cycles,  Refresh  Cycle 
&  CPU  Cycle 

The  request  for  a  refresh  is  received  by  signal  line  RFCLK.  The 
CPU  requests  a  cycle  by  asserting  MREQ  line.  If  a  cycle  is  in 
progress  as  indicated  by  states  other  than  state  0,  no  arbitration 
for  a  fresh  cycle  is  performed.  In  such  a  case,  the  PALdevice  just 
continues  to  execute  the  current  cycle.  This  effectively  implies 
that  the  current  cycle  is  given  the  most  priority  and  no  other  cycle 
is  initiated  until  its  completion. 

When  the  current  cycle  is  complete  (indicated  by  state  0)  the  PAL 
device  arbitrates  between  the  MREQ  signal  and  the  buried 
register  B5  flag.  This  buried  register  flag  is  used  to  latch  the 
request  for  refresh  (RFCLK)  which  is  usually  only  one  clock  cycle 
duration.  The  operation  of  this  flag  is  controlled  by  a  second  small 
state  machine  (two  states  only)  in  the  same  device.  When  RFCLK 
is  high  this  state  machine  toggles  to  state  1  (B5=1 )  and  stays  in 
that  state  till  the  refresh  cycle  is  executed  (indicated  by  RFSH 
signal  low),  and  then  it  toggles  back  to  state  0  (B5=0)  preparing 
itself  for  the  next  refresh  request  RFCLK. 


For  the  first  state  machine,  the  arbitration  begins  on  state  0.  If  B5 
is  high,  a  refresh  cycle  is  given  priority  and  the  state  machine 
jumps  to  state  1 6  to  execute  a  refresh  cycle.  If  on  the  other  hand 
there  is  no  refresh  request  pending  (indicated  by  the  B5  flag  being 
equal  to  0)  and  MREQ  is  asserted,  a  processor  cycle  is  started  by 
jumping  to  state  1.  If  none  of  these  conditions  exists  the  state 
machine  stays  in  state  0,  polling  for  any  of  these  events  to  occur. 
Such  a  scheme  can  support  both  burst  mode  and  the  intermittent 
refresh. 

Processor  Read  Write  Cycle  Execution 

When  the  processor  cycle  wins  the  arbitration,  indicated  by  state 
1 ,  the  timing  and  control  signals  required  for  the  processor  cycle 
are  initiated. 

There  are  four  memory  banks  in  the  design,  each  with  a  capacity 
of  4  Mbytes  of  memory  organized  as  1 M  X  32  bits  (Figure  1 ).  The 
Timing  &  Arbitration  PAL  device  generates  the  4  RAS0-RAS3 
signals,  one  for  each  memory  bank.  The  selection  of  one  of  these 
four  RAS  signals  depends  upon  the  two  input  signals  ADSEL0 
and  ADSEL1 .  In  most  systems  these  would  constitute  the  high 
bits  of  the  system  processor  address  bus. 

After  assertion  of  the  appropriate  RAS  signal  the  state  machine 
jumps  to  state  2  where  it  manipulates  its  address  handling 
signals.  In  state  2  it  removes  the  ROWADD  signal  and  asserts  the 
COLADD  signal  to  allow  assertion  of  the  column  address  to  the 
memory.  At  the  same  time  it  asserts  another  signal,  SIG,  which 
is  used  to  inform  the  Data  Sizing  and  Alignment  PAL  device  of  a 
processor  cycle  request,  and  allows  the  Data  Sizing  and 
Alignment  PALdevice  to  generate  the  appropriate  CAS  signal  to 
the  memory. 

The  state  machine  then  sequences  to  state  4,  allowing  for  the 
data  access  time.  In  state  4  it  decides  between  a  read  write  cycle 
or  a  read-modify-write  cycle,  based  on  the  processor  signal  RMC; 
it  jumps  either  to  state  5  for  a  read  or  write  cycle  or  to  state  8  (for 
read-modify-write  cycle.  For  read  or  write  cycle  it  then  removes 
the  RAS  signals  and  waits  for  another  clock  period  for  the  RAS 
precharge  time  before  completing  the  cycle.  Simultaneously  it 
also  removes  COLADD  signal  and  asserts  ROWADD  signal 
preparing  the  memory  for  the  next  processor  cycle.  The  SIG 
signal  to  the  data  sizing  and  alignment  PAL  device  is  also 
removed.  The  DSACK  signal  (acknowledge  to  the  processor) 
and  the  WE  signals  (for  a  write  cycle)  are  generated  if  and  when 
required  by  the  Data  Sizing  and  Alignment  PALdevice. 

Processor  Read-Modify-Write  Cycle  Execution 

When  the  processor  requests  a  read-modify-write  cycle  by  as- 
serting the  RMC  signal,  the  Timing  and  Arbitration  PAL  device 
automatically  increases  the  length  of  the  cycle.  It  keeps  the  RAS 
asserted  for  the  appropriate  duration  of  a  read-modify-write 
cycle.  At  the  same  time  the  Data  Sizing  and  Alignment  PALdevice 
generates  the  appropriate  WE  and  DSACK  signals.  After  this 
extended  RAS  assertion  the  Timing  and  Arbitration  PAL  device 
completes  the  cycle,  similar  to  a  read  or  write  cycle. 

Refresh  Cycle  Execution 

When  the  arbitration  is  won  by  the  refresh  cycle,  the  timing  and 
arbitration  control  PALdevice  executes  a  RAS-only  refresh  cycle. 


Customize  a  DRAM  Controller  Using  Advanced  PAL  Devices 


1 


1 


WRITE  CYL 


I 


1 


1 


™  ru^n^u\JUTJUirLJUinjui_r^n_m^j^ru 


MREQ   ~~\  f 

ROWADD   | 

RAS  | 
COLADD  | 
CAS 
DSACK 
WE 
RFSH 
SIG 


a  r 


a  r 


l  r 


i  r 


i  r 


Figure  3.  The  Timing  Diagram 
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On  state  16  it  removes  the  ROWADD  signal  and  asserts  the 
RFSH  signal  which  applies  the  refresh  row  address  to  the 
memory  address  bus.  In  state  17  it  generates  all  four 
RAS0-RAS3  signals,  refreshing  all  the  banks  of  the  memory.  It 
then  counts  for  a  few  states  (depending  upon  the  refresh  timing 
requirements  of  the  memory)  before  removing  the  RAS  and 
RFSH  signals.  On  removal  of  the  RFSH  signal  the  external  10- 
bit  refresh  row  address  counter  (PAL22V1 0)  is  also  incremented 
for  the  next  row. 

Interface  Signaling 

One  additional  function  performed  by  this  PAL  device  is  to  inform 
the  Data  Sizing  and  Alignment  PALdevice  of  the  execution  timing 
reference  of  a  processor  cycle.  This  it  does  by  asserting  /SIG 
signal  low  for  all  processor  cycles.  This  signal  is  basically  used 
as  a  synchronizing  signal  between  these  two  PAL  devices  doing 
two  independent  functions. 

Data  Sizing  and  Alignment  PAL  Device 

The  Data  Sizing  and  Alignment  PALdevice  performs  the  following 
functions: 

1 .  CAS  generation  for  dynamic  sizing  &  alignment. 

2.  DSACK  generation  for  dynamic  sizing  &  alignment. 

3.  WE  generation  for  write  and  read-modify-write  cycles. 


Its  state  diagram  is  shown  in  Figure  5.  This  state  machine  is 
synchronized  to  the  operation  of  the  state  machine  of  the  Timing 
and  Arbitration  PALdevice  by  using  signal  SIG.  This  PALdevice 
has  nofunction  for  a  refresh  cycle.  For  a  processor  cycle  it  asserts 
the  appropriate  CAS0-CAS3  and  WE  signals  and  generates  the 
cycle  complete  signals  DSACK0-DSACK1.  The  state  machine 
functions  are  implemented  in  the  four  buried  state  registers  on 
this  device. 

CAS  Generation 

The  state  machine  starts  at  the  initial  polling  state  0,  and  remains 
there  until  it  receivesthe  SIG  signal  from  the  timing  and  arbitration 
PAL  device  indicating  the  start  of  a  processor  cycle.  The  maxi- 
mum allowable  data  width  forthe  68020  processor  is  32  bits.  The 
data  bus  is  split  into  four  byte  wide  segments.  The  CAS  signals 
for  these  four  byte  segments  are  independently  controlled,  allow- 
ing independent  byte-wide  read  and  write  capability.  These  four 
CAS  signals  allow  read  and  write  for  low-low,  low-middle,  high- 
middle  and  high-high  bytes  of  a  32-bit  data  port.  For  a  1 6-bit  data 
port  size,  CAS  2-3  provide  the  low  and  high  byte  strobes, 
respectively.  For  an  8-bit  port,  only  CAS3  is  used. 

In  state  1  the  CAS  signals  are  asserted.  The  CAS  signals 
asserted  depend  upon  the  PORT  size  available,  the  size  of  data 
being  transferred,  andthealignmentofthetransfer.  Asexplained 
before,  the  attempted  transfer  data  size  is  encoded  on  two  SIZO, 


Figure  5.  State  Diagram  of  Data  Sizing  and  Alignment  PAL  Device 


Customize  a  DRAM  Controller  Using  Advanced  PAL  Devices 


The  data  from  this  table  is  directly  used  in  the  state  description 
syntax  to  generate  the  CAS  signals  for  state  1 .  The  software 
compiles  the  Boolean  equations  for  this  logic  directly. 

Once  asserted,  the  CAS  signals  are  held  by  the  state  machine 
until  the  appropriate  read,  write,  or  read-modify-write  cycles  are 
complete. 

DSACK  Acknowledge  to  CPU 

The  amount  of  data  receivable  by  any  device  is  dependent  upon 
its  port  size.  Based  on  the  port  size  (PRTO-1 )  the  encodings  for 
DSACKO-1  are  generated  to  acknowledge  the  CPU  and  inform  it 
of  the  port  size.  The  timing  of  acknowledge  is  synchronized  by  the 
state  machine.  For  read  and  write  cycles,  the  DSACK  signals  are 
generated  in  state  1  and  removed  on  cycle  completion  in  state  4. 

For  a  read-modify-write  cycle,  a  complication  exists.  The  read- 
modify-write  cycle  requires  two  DSACK  signals;  one  for  read  and 
one  for  a  write.  The  first  DSACK  is  generated  in  state  1  and 
removed  in  either  state  4  or  state  5.  The  second  is  generated  in 
state  8  and  removed  in  state  11. 


WE  Generation 

In  state  1  the  WE  is  generated  for  normal  write  cycles.  For  read- 
modify-write  cycles,  a  delayed  WE  signal  is  generated  based 
upon  the  internal  four-bit  state  counter  in  state  8  (Figure  3). 


DATA  BUS  ACTIVE  SECTIONS 

BYTE  (B)-WORD  (W)-LONG  WORD  (L)  PORTS 

TRANSFER 

SIZE 

SIZ1 

SIZO 

A1 

AO 

D31-D24 

D23-D16 

D15-D8 

D7-D0 

0 

1 

0 

0 

BWL 

0 

1 

0 

1 

B 

WL 

BYTE 

0 

1 

1 

0 

BW 

L 

0 

1 

1 

1 

B 

W 

L 

0 

0 

0 

BWL 

WL 

0 

0 

1 

B 

WL 

L 

WORD 

0 

1 

0  . 

BW 

W 

L 

L 

0 

1 

1 

B 

W 

L 

1 

0 

0 

BWL 

WL 

L 

THREE- 

1 

0 

1 

B 

WL 

L 

L 

BYTE 

1 

1 

0 

BW 

W 

L 

L 

1 

1 

1 

B 

W 

L 

0 

0 

0 

0 

BWL 

WL 

L 

L 

LONG 

0 

0 

0 

1 

B 

WL 

L 

L 

WORD 

0 

0 

1 

0 

BW 

W 

L 

L 

0 

0 

1 

1 

B 

W 

L 

Figure  6.  Data  Bus  Activity  for  Byte,  Word  and  Long  Word  Ports 


SIZ1  bits  by  the  processor,  and  the  alignment  is  indicated  by  the 
least  significant  address  bits  AO,  A1  of  the  processor. 

The  port  size  is  very  application  dependent.  Different  systems 
might  require  either  static  or  dynamic  selection  of  this  port  size. 
For  example,  the  designer  can  define  different  address  ranges 
where  the  port  size  may  be  different.  Alternatively,  jumpers  can 
also  define  the  port  size.  A  general-purpose  means  for  this  has 
been  selected,  in  the  form  of  two  input  pins  PRTO  &  PRT1 ,  which 
define  the  size  of  the  port. 

Port  Size 

The  size  of  the  data  bus  is  decided  based  on  the  input  pins  PRT1 , 
PRTO. 


PRTO 

PRT1 

SIZE 

L 

L 

8  bits 

L 

H 

1 6  bits 

H 

L 

1 6  bits 

H 

H 

32  bits 

For  32-bit  ports,  all  four  CAS  signals  need  to  be  asserted.  For  a 
16-bit  port  only  two  CAS  signals  need  to  be  asserted.  For  an 
eight-bit  port  only  CAS3  is  asserted.  As  mentioned  before,  the 
processor  always  attempts  to  transfer  the  maximum  possible 
amount  of  data.  Its  SIZO-1  signals  encode  the  amount  of  data 
being  transferred.  The  table  in  Figure  6  illustrates  the  relationship 
of  the  data  size,  port  size  and  alignment  which  is  decided  by 
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NAME  MEGABIT  DRAM  TIMING  &  ARBITRATION  CONTROLLER; 

PARTNO  AmPAL  23S8  ; 

DATE  03/24/86  ; 

REV  01  ; 

DESIGNER  KAPIL  SHANKAR  ; 

COMPANY  ADVANCED  MICRO  DEVICES  ; 

ASSEMBLY  NONE  ; 

LOCATION  SUNNYVALE,  CA  ; 

DEVICE  P23S8  ; 


NODE    r.B0,B1,B2,83,B4,B5]  ; 

/**  Declarations  and  Intermediate  Variable  Definitions  **/ 


FIELD  STMA  =  CB0.B1 ,B2,B3,B4]  ; 

FIELD  STM8  =  [B5]  J 

FIELD  BANK  =  [ADSEL1,  ADSELO]  ; 


/**  STATE  MACHINE  A  «*/ 
/**  STATE  MACHINE  B  **/ 
/**  MEMORY  BANK  ADDRESSED  "/ 


BANKO  ■ 

BANK  : 

0  ; 

BANK1  = 

BANK  : 

1  ; 

/* 

V 

BANK2  ■ 

BANK  : 

2  ; 

/* 

*/ 

BANK3  = 

BANK  : 

3  ; 

/* 

*/ 

/***********< 

******************************************* 

.„««««./ 

IDE  FINE 

SAO 

'b'00000 

/** 

STATE  0  **/ 

/*   Allowable  Target  Device  Types:    THE  ONLY  ONE 

*/ 

SOEFINE 

SA1 

'b'00001 

/** 

STATE  1  **/ 

/************ 

SDEFINE 

SA2 

■b'00010 

/** 

STATE  2  **/ 

SOEFINE 

SA3 

■b1 00011 

/** 

STATE  3  **/ 

/**  Inputs 

**/ 

SDEFINE 

SA4 

•b'00100 

/** 

STATE  4  **/ 

SDEFINE 

SA5 

'b'00101 

STATE  5  **/ 

PIN  2 

=   MREO  ; 

SDEFINE 

SA6 

'b'00110 

JM 

STATE  6  **/ 

PIN  3 

=    ADSELO  ; 

SDEFINE 

SA7 

•b'00111 

/** 

STATE  7  **/ 

PIN  4 

=   A0SEL1  ; 

SDEFINE 

SA8 

•b'01000 

/**  STATE  8  **/ 

PIN  5 

=    RHC  J 

SDEFINE 

SA9 

•b'01001 

/** 

STATE  9  **/ 

PIN  6 

=    RFCLK  i 

SDEFINE 

SA10 

'b'01010 

/**  STATE  10  **/ 

PIN  7 

=    INIT  J 

SDEFINE 

SA11 

'b'01011 

/**  STATE  11  **/ 

SDEFINE 

SA12 

'b'01100 

/*' 

>  STATE  12  **/ 

/**  Outputs 

**/ 

SDEFINE 

SA13 

•b'01101 

r 

•  STATE  13  **/ 

SOEFINE 

SA14 

■b'01110 

/**  STATE  14  **/ 

PIN  19 

■  ROWADD  ; 

SDEFINE 

SA15 

'b'01111 

/**  STATE  15  **/ 

PIN  18 

=  IRASO  ; 

SOEFINE 

SA16 

'b'10000 

/**  STATE  16  **/ 

PIN  17 

=  ICOLADD  ; 

SOEFINE 

SA17 

•b' 10001 

/* 

'  STATE  17  **/ 

PIN  16 

=  IRAS1  ; 

SOEFINE 

SA18 

"b'10010 

/* 

'  STATE  18  **/ 

PIN  15 

■  !RAS2  J 

SDEFINE 

SA19 

'b'10011 

/**  STATE  19  **/ 

PIN  14 

=  IRFSH  ; 

SDEFINE 

SA20 

"b'10100 

/**  STATE  20  **/ 

PIN  13 

=  IRAS3  ; 

SDEFINE 

SA21 

•b'10101 

/**  STATE  21  **/ 

PIN  12 

=  !SIG  ; 

SDEFINE 

SA22 

•b'10110 

/* 

•  STATE  22  **/ 

Figure  7.  Source  Code  for  Timing  and  Controller  PAL  Device 


SDEFINE  SBO  'b'O 
SOEFINE    SB1  'b'1 


/**  STATE  0  **/ 
/**  STATE  1  **/ 


/**    Logic  Equations  **/ 

/**  THE  FIRST  STATE  MACHINE  **/ 

SEQUENCE  STMA  < 

/**  POLLING  FOR  PROCESSOR  OR  REFRESH  CYCLES  **/ 

PRESENT  SAO    IF  !B5  &  IMREQ  &  BANKO  NEXT  SA1  OUT  RASO 

IF  IBS  &  IMREQ  &  BANK1  NEXT  SA1  OUT  RAS1 

IF  IB5  &  IMREQ  &  BANK2  NEXT  SA1  OUT  RAS2 

IF  IB5  &  IMREQ  &  BANK3  NEXT  SA1  OUT  RAS3 

/**  PROCESSOR  CYCLE  **/ 

/**  JUMP  TO  STATE  SA1  *»/ 


JUMP  TO  STATE  SA16  •*/ 

REMOVE  ROW  ADDRESS  AND  SET  UP  REFRESH  ADDRESS  **/ 


DEFAULT 
/**  ELSE  POLL  AGAIN  **/ 
/**  STAY  IN  STATE  SAO  **/ 


NEXT  SAO 


PRESENT  SA2  IF 

IF  BANK1 
IF  BANK2 
IF  BANK3 


/**  MEMORY  ACCESS  TIME  DELAY  **/ 

PRESENT  SA3  IF  BANKO 
IF  BANK1 
IF  BANK2 
IF  BANK3 


/**  ASSERT  THE  RAS  FOR  THE  MEMORY  BANK  ADDRESSED 

**/ 

PRESENT  SA4  IF 

BANKO 

1 

RMC 

IF 

BANK1 

& 

RMC 

IF    B5  NEXT 

SA16  OUT  ROWADD 

IF 

BANK2 

I 

RMC 

OUT  RFSH  ; 

IF 

BANKS 

S 

RMC 

/**  REFRESH  CYCLE  "/ 

IF 

RMC 

NEXT 

SA3 

OUT 

RASO 

NEXT 

SA3 

OUT 

RAS1 

NEXT 

SA3 

OUT 

RAS2 

NEXT 

SA3 

OUT 

RAS3 

NEXT 

SA3 

OUT 

ROWADD 

OUT 

COLADD 

OUT 

SIG  J 

NEXT 

SA4 

OUT 

RASO 

NEXT 

SA4 

OUT 

RAS1 

NEXT 

SA4 

OUT 

RAS2 

NEXT 

SA4 

OUT 

RAS3 

NEXT 

SA4 

OUT 

ROWADD 

OUT 

COLAD 

> 

OUT 

SIG  ; 

NEXT 

SA5 

OUT 

IRASO 

NEXT 

SA5 

OUT 

IRAS1 

NEXT 

SA5 

OUT 

IRAS2 

NEXT 

SA5 

OUT 

IRAS3 

NEXT 

SA5 

OUT 

! ROWADD 

OUT 

I  COLADD 

OUT 

SIG  ; 

/"  DISTINGUISH  BETWEEN  READ  WRITE  OR  READ  MODIFY  WRITE  CYCLES 
/**  IF  RMC  HIGH  EXECUTE  AS  READ  WRITE  CYCLE  **/ 
/**  JUMP  TO  STATE  SA5  **/ 

/**  ALSO  REMOVE  COLUMN  ADDRESS  AND  APPLY  ROW  ADDRESS  **/ 
/**  ALSO  REMOVE  THE  RAS  SIGNAL  **/ 


/**  THIS  IS  THE  PROCESSOR  CYCLE  **/ 


PRESENT  SA1 


IF  BANKO 

IF  BANK1 

IF  6ANK2 

IF  BANK3 


NEXT  SA2 
NEXT  SA2 
NEXT  SA2 
NEXT  SA2 
NEXT  SA2 


OUT  RASO 
OUT  RAS1 
OUT  RAS2 
OUT  RAS3 
OUT  ROWADD 
OUT  COLADD 
OUT  SIG  ; 


/**  REMOVE  ROW  ADDRESS  AND  SET  UP  COLUMN  ADDRESS  **/ 

/**  ASSERT  SIG  TO  INFORM  DATA  SIZING  &  ALIGNMENT  PAL  OF  PROCESSOR  CYCLE  **/ 


IF  BANKO  &  I RMC 

IF  BANK1  I  I  RMC 

IF  BANK2  &  I RMC 

IF  BANK3  &  IRMC 

IF  I RMC 


NEXT  SA8 
NEXT  SA8 
NEXT  SA8 
NEXT  SA8 
NEXT  SA8 


OUT  RASO  ; 
OUT  RAS1  ; 
OUT  RAS2  ; 
OUT  RAS3  ; 
OUT  ! ROWADD 
OUT  I COLADD 
OUT  SIG  ! 


/**  IF  RMC  LOW  EXECUTE  READ  MODIFY  WRITE  CYCLE  **/ 
/«•  JUMP  TO  STATE  SA8  **/ 

/**  ALSO  REMOVE  COLUMN  ADDRESS  AND  ASSERT  ROW  ADDRESS  **/ 


Figure  7.  Source  Code  for  Timing  and  Controller  PAL  Device  (Cont'd.) 


PRESENT  SA5 

/"  RAS  PRECHARGE  TIME  DELAY  **/ 


NEXT  SA6    OUT  1  S I G  ; 


/**  THIS  IS  THE  READ  WRITE  CYCLE  **/ 


PRESENT  SA13  IF  BANKO 

NEXT  SAH  OUT  IRASO  ; 

PRESENT  SA6 

NEXT  SA7  ; 

IF  BANK1 

NEXT  SAH  OUT  IRAS1  ; 

IF  BANK2 

NEXT  SAH  OUT  !RAS2  ; 

PRESENT  SA7 

NEXT  SAO  ; 

IF  BANK3 

NEXT  SAH  OUT  IRAS3  ; 

/**  PROCESSOR  READ  WRITE  CYCLE  COMPLETE  **/ 

NEXT  SAH  OUT  SIG  ; 

/**  REMOVE  THE  RAS  SIGNAL  *•/ 

/"THIS  IS  THE  READ  MODIFY  WRITE  CYCLE  **/ 

PRESENT  SAH 

NEXT  SA15  OUT  IStG  ; 

/**  RAS  PRECHARGE  TIME  DELAY  **/ 

PRESENT  SA8    IF  BANKO 

NEXT  SA9    OUT  RASO  j 

IF  BANK1 

NEXT  SA9    OUT  RAS1  J 

PRESENT  SA15 

NEXT  SAO  ; 

IF  BANK2 

NEXT  SA9    OUT  RAS2  ; 

IF  BANK3 

NEXT  SA9    OUT  RAS3  ; 

/**  PROCESSOR  READ  MODIFY  WRITE  CYCLE  COMPLETE  **/ 

NEXT  SA9    OUT  SIG  j 

/**  ACCESS  TIME  DELAY  FOR  BOTH  READ  AND  WRITE  **/ 

/**  THIS  IS  THE  MEMORY  REFRESH  CYCLE  **/ 

PRESENT  SA9    IF  BANKO 

NEXT  SA10  OUT  RASO  ; 

IF  BANK1 

NEXT  SA10  OUT  RAS1  j 

PRESENT  SA16 

NEXT  SA17  OUT  RASO 

IF  BANK2 

NEXT  SA10  OUT  RAS2  J 

OUT  RAS1 

IF  BANK3 

NEXT  SA10  OUT  RAS3  ; 

OUT  RAS2 

NEXT  SA10  OUT  SIG  ; 

OUT  RAS3 

OUT  ROWADD 

PRESENT  SA10  IF  BANKO 

NEXT  SA11  OUT  RASO  ; 

OUT  RFSH  J 

IF  BANK1 

NEXT  SA11  OUT  RAS1  J 

/**  ASSERT  ALL  FOUR  RAS  SIGNALS  **/ 

IF  BANK2 

NEXT  SA11  OUT  RAS2  ; 

IF  BANK3 

NEXT  SA11  OUT  RAS3  ; 

PRESENT  SA17 

NEXT  SA18  OUT  RASO 

NEXT  SA11  OUT  SIG  ; 

OUT  RAS1 

OUT  RAS2 

PRESENT  SA11  IF  BANKO 

NEXT  SA12  OUT  RASO  ; 

OUT  RAS3 

IF  BANK1 

NEXT  SA12  OUT  RAS1  J 

OUT  ROWADD 

IF  BANK2 

NEXT  SA12  OUT  RAS2  ; 

OUT  RFSH  ; 

IF  BANK3 

NEXT  SA12  OUT  RAS3  ; 

/**  REFRESH  TIME  DELAY  **/ 

NEXT  SA12  OUT  SIG  ; 

PRESENT  SA18 

NEXT  SA19  OUT  RASO 

PRESENT  SA12  IF  BANKO 

NEXT  SA13  OUT  RASO  ; 

OUT  RAS1 

IF  BANK1 

NEXT  SA13  OUT  RAS1  ; 

OUT  RAS2 

IF  BANK2 

NEXT  SA13  OUT  RAS2  ; 

OUT  RAS3 

IF  BANK3 

NEXT  SA13  OUT  RAS3  ; 

OUT  ROWADD 

NEXT  SA13  OUT  SIG  j 

OUT  RFSH  ; 

Figure  7.  Source  Code  for  Timing  and  Controller  PAL  Device  (Cont'd.) 

PRESENT  SA19 


NEXT  SA20  OUT  IRASO 
OUT  IRAS1 
OUT  !RAS2 
OUT  ! RAS3 


PRESENT  SA21 
PRESENT  SA22 

/**  TERMINATE  MEMORY  REFRESH  CYCLE  **/ 

/**  GO  BACK  TO  POLLING  STATE  **l 

> 

/**  THE  FIRST  STATE  MACHINE  COMPLETE  **/ 

/**  THE  SECOND  STATE  MACHINE  **/ 

SEQUENCE  STMB  ( 

PRESENT  SBO    IF  RFCLK 

DEFAULT 
/**  JUMP  TO  STATE  SB1  **/ 
/**  REMEMBER  REFRESH  REQUEST  **/ 

PRESENT  SB1     IF  RFSH 
DEFAULT 
/**  JUMP  TO  STATE  SBO  **/ 
/**  REFRESH  CYCLE  EXECUTED  THUS  RETURN  ** 


NEXT  SA22 
NEXT  SAO 


NEXT  SB1  ; 
NEXT  SBO  jj 


NEXT  SBO 
NEXT  SB1 


/**  EQUATIONS  FOR  SYSTEM  INITIALIZATION  AFTER  RESET  **/ 
[B0..B5J.ar  =  UNIT  ; 


OUT  ! ROUADD 

RASO.AR 

UNIT  ; 

OUT  ! RFSH  ; 

RAS1 .AR 

UNIT  ; 

ROUADD.  AR 

UNIT  ; 

/**  REMOVE  ALL  RAS  SIGNALS  **/ 

COL ADD. AR 

UNIT  ; 

/**  ALSO  REMOVE  REFRESH  ADDRESS  AND  ASSERT  ROW  ADDRESS  **/ 

RAS2.AR 

UNIT  j 

RAS3.AR 

UNIT  ; 

PRESENT  SA20                                                  NEXT  SA21  ; 

RFSH.AR 

s 

UNIT  ; 

/**  RAS  PRECHARGE  TIME  DELAY  **/ 

SIG.AR 

UNIT  j 

/*  THESE  ARE  THE  OUTPUT  ENABLES  */ 


RASO.OE 

RASI.OE 

ROUADD.  OE 

COLADD.OE 

RAS2.0E 

RAS3.0E 

RFSH.OE 

SIG.OE 


■  B'1 
■B'1 
■B'1 
■B'1 
'B'1 
•B'1 
■B'1 
IBM 


/**  THE  SECOND  STATE  MACHINE  COMPLETE  **/ 

Figure  7.  Source  Code  for  Timing  and  Controller  PAL  Device  (Cont'd.) 


NAME  MEGABYT  DRAM  DATA  SIZING  &  ALIGNMENT  CONTROLLER; 

PARTNO  AmPAL  23S8  ; 

DATE  03/24/86  ; 

REV  01  ; 

DESIGNER  KAPIL  SHANKAR  ; 

COMPANY  ADVANCED  MICRO  DEVICES  ; 

ASSEMBLY  NONE  ; 

LOCATION  SUNNYVALE,  CA  ; 

DEVICE  P23S8  ; 

/** 
/* 
/* 
/* 

y********  ************************************  ******************* 

/*    Allowable  Target  Device  Types:    THE  ONLY  ONE 


NODE  [B0,B1,B2,B3] 


/** 

Inputs 

"/ 

PIN 

2 

=    SIG  ; 

PIN 

3 

=    AO  ; 

PIN 

4 

=    A1  ; 

PIN 

5 

=    RU  ; 

PIN 

6 

=    RMC  i 

PIN 

7 

=    SIZO  J 

PIN 

8 

■    SIZ1  ; 

PIN 

9 

=    PRTO  ; 

PIN 

11 

=    PRT1  ; 

PIN 

12 

=    EMIT  ; 

/** 

Outputs 

**/ 

PIN 

19 

=  IUE  ; 

PIN 

18 

=  ICAS2  j 

PIN 

17 

=  ICASO  ; 

PIN 

16 

=  !CAS3  J 

PIN 

15 

=  >CAS1  ; 

PIN 

14 

=  ! 0  SACK  1  J 

PIN 

13 

=  IDSACKO  ; 

/ 
/ 
/ 
/ 

***/ 

*/ 

/ 


/**  Declarations  and  Intermediate  Variable  Definitions  **/ 


FIELD  STM  =  [B0,B1,B2,B3]  ; 


FIELD  PORTSIZE  =  [PRT1,  PRTO]  ; 


/•*  THE  STATE  MACHINE  **/ 


/»*  SIZE  OF  THE  PORT  **/ 
/**  INTERFACED  TO  PROCESSOR  **/ 
**/ 
*V 


PORT8  = 

PORTSIZE  :  0  ; 

/** 

8  BIT  WIDE  PORT 

PORT16  = 

PORTSIZE  :  2  ; 

16  BIT  WIDE  PORT 

PORT32  = 

PORTSIZE  :  3  ; 

/** 

32  BIT  WIDE  PORT 

/**  DEFINE  THE  STATES  **/ 

JOEFINE 

SAO 

'b'0000 

/** 

STATE  0  **/ 

tDEFINE 

SA1 

•b'0001 

/** 

STATE  1  **/ 

SOEFINE 

SA2 

•b'0010 

/** 

STATE  2  **/ 

SOEFINE 

SA3 

'b'0011 

/" 

STATE  3  **/ 

SOEFINE 

SA4 

'b'0100 

/** 

STATE  4  **/ 

SOEFINE 

SA5 

■b'0101 

/** 

STATE  5  **/ 

SOEFINE 

SA6 

'b'0110 

/** 

STATE  6  **/ 

SOEFINE 

SA7 

■b'0111 

/** 

STATE  7  **/ 

SOEFINE 

SA8 

'b'1000 

/** 

STATE  8  **/ 

SOEFINE 

SA9 

'b'1001 

/** 

STATE  9  **/ 

SOEFINE 

SA10 

•b'1010 

/** 

STATE  10  **/ 

SOEFINE 

SA11 

'b'1011 

/" 

STATE  11  **/ 

/**  Logic  Equations  **/ 
/**  THE  STATE  MACHINE  **/ 
SEQUENCE  STM  { 

/**  POLLING  FOR  SIG  INDICATING  A  PROCESSOR  CYCLE  **/ 


Figure  8.  Source  Code  for  Data  Sizing  and  Alignment  PAL  Device 


/**  PORT       DATA         OFFSET    STATE        OUTPUT  «*/ 

PRESENT  SAO 

/**  FOR  PORTS  I ZE  8  **/ 

IF  !SIG  &  PORT8  NEXT  SA1    OUT  CAS3  ; 

/**  GENERATE  ACKNOWLEDGE  **/ 

IF  ■ SIG  S  PORT8  NEXT  SA1    OUT  DSACKO  ; 


/**  FOR  PORTSIZE  16  **/ 

IF  IMG  &  PORT16  &  (AO  NEXT  SA1    OUT  CAS3  ; 

IF  !SIG  &  PORT16  &  ISIZO  &  SIZ1  &  AO  NEXT  SA1    OUT  CAS2  ; 

/**  GENERATE  ACKNOWLEDGE  **/ 

IF  ISIG  t  PORT16  NEXT  SA1    OUT  DSACK1  ; 


/**  FOR  PORTSIZE  32  **/ 
/**  BYTE  TRANSFER  SIZE  **/ 


IF 

■SIG 

i 

PORT32  t  IA1  &  IA0 

NEXT 

SA1 

OUT 

CAS3 

IF 

ISIG 

t 

PORT32  I  IA1  &  AO 

NEXT 

SA1 

OUT 

CAS2 

IF 

■SIG 

& 

PORT32  t  IA1  &  ISIZO 

NEXT 

SA1 

OUT 

CAS2 

IF 

ISIG 

& 

PORT32  &  IA1  t  SIZ1 

NEXT 

SA1 

OUT 

CAS2 

IF 

ISIG 

& 

PORT32  &  A1    &  I  AO 

NEXT 

SA1 

OUT 

CAS1 

IF 

ISIG 

& 

PORT32  &  IA1  t  ISIZ1 

&  ISIZO 

NEXT 

SA1 

OUT 

CAS1 

IF 

■SIG 

S 

PORT32  &  IA1  &  SIZ1  &  SIZO 

NEXT 

SA1 

OUT 

CAS1 

IF 

■SIG 

& 

PORT32  &  IA1  &  AO  & 

SIZO 

NEXT 

SA1 

OUT 

CAS1 

IF 

ISIG 

s. 

PORT32  &  A1  &  AO 

NEXT 

SA1 

OUT 

CAS3 

If 

ISIG 

8. 

PORT32  &  ISIZ1  &  ISIZO 

NEXT 

m 

OUT 

CAS2 

IF 

ISIG 

8, 

PORT32  &  AO  &  SIZ1  &  SIZO 

NEXT 

SA1 

OUT 

CASL 

/**  GENERATE  ACKNOWLEDGE  **/ 

IF  ISIG  &  PORT32  NEXT  SA1    OUT  DSACKO 

OUT  DSACK1  J 

/**  JUMP  TO  STATE  SA1  **/ 

/**  ASSERT  THE  CAS  FOR  THE  RIGHT  PORTSIZE  DATASIZE  AND  ADDRESS  OFFSET  **/ 

IF  ISIG  &  IRW  &  RHC  NEXT  SA1    OUT  WE  J 

/**  GENERATE  WE  FOR  READ  WRITE  CYCLE  ONLY  **/ 


IF  SIG 


NEXT  SAO  ; 

Figure  8.  Source  Code  for  Data  Sizing  and 


/**  ELSE  WAIT  FOR  THE  PROCESSOR  CYCLE  **/ 

/"  POLL  AGAIN  **/ 

/**  STAY  IN  STATE  SAO  **/ 


/**  THIS  IS  THE  PROCESSOR  CYCLE  START  «*/ 


PRESENT  SA1 

IF  CASO 

NEXT  SA2 

OUT  CASO  ; 

IF  CAS1 

NEXT  SA2 

OUT  CAS1  ! 

IF  CAS2 

NEXT  SA2 

OUT  CAS2  ; 

IF  CAS3 

NEXT  SA2 

OUT  CAS3  ; 

IF  IRW  t  RMC 

NEXT  SA2 

OUT  WE  ; 

IF  PORT 8 

NEXT  SA2 

OUT  DSACKO  J 

IF  PORT16 

NEXT  SA2 

OUT  DSACK1  j 

IF  PORT32 

NEXT  SA2 

OUT  DSACKO 

OUT  DSACK1  J 

NEXT  SA2  J 

/**  HOLD  CAS  GENERATED  IN  THE  LAST  STATE  **/ 

/**  IF  WRITE  CYCLE  AND  NOT  READ  MODIFY  WRITE  CYCLE  ASSERT  WE  "/ 
/**  JUMP  TO  STATE  SA2  **/ 


PRESENT  SA2 


IF  CASO 

NEXT  SA3 

OUT  CASO  ; 

IF  CAS1 

NEXT  SA3 

OUT  CAS1  ; 

IF  CAS2 

NEXT  SA3 

OUT  CAS2  ; 

IF  CAS3 

NEXT  SA3 

OUT  CAS3  ; 

IF  IRW  t  RMC 

NEXT  SA3 

OUT  WE  j 

IF  PORT8 

NEXT  SA3 

OUT  DSACKO  J 

IF  PORT16 

NEXT  SA3 

OUT  DSACK1  ; 

IF  PORT32 

NEXT  SA3 

OUT  DSACKO 

OUT  DSACK1  ; 

NEXT  SA3 

i 

/•*  HOLD  CAS  GENERATED  IN  THE  LAST  STATE  **/ 
/**  HOLD  WE  FOR  ACCESS  DURATION  **/ 
/**  JUMP  TO  STATE  SA3  **/ 

PRESENT  SA3 

IF  RHC  NEXT  SA4    OUT  I  CASO 

OUT  ICAS1 
OUT  ICAS2 

Alignment  PAL  Device  (Cont'd.) 
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OUT  ICAS3 

/"  CONTINUE  THE  READ  MODIFY  WRITE 

CYCLE  **/ 

OUT  IWE 

OUT  IDSACKO 

PRESENT  SA6 

OUT  IDSACKl  ; 

IF  CASO 

NEXT  SA7 

OUT  CASO  ; 

IF  CAS1 

NEXT  SA7 

OUT  CAS1  ; 

/**  FOR  READ  WRITE  CYCLE  REMOVE  CAS  AND  WE  FOR 

CYCLE  COMPLETION  *»/ 

IF  CAS2 

NEXT  SA7 

OUT  CAS2  ; 

/**  ACKNOWLEDGE  CYCLE  COMPLETION  BY  ACKNOT  **/ 

IF  CAS3 

NEXT  SA7 

OUT  CAS3  ; 

/**  JUMP  TO  STATE  SA4  **/ 

NEXT  SA7 

IF  CASO  &  IRMC                                    NEXT  SA5 

OUT  CASO  ; 

/**  CONTINUE  THE  READ  MODIFY  WRITE 

CYCLE  **/ 

IF  CAST  &  IRMC                                    NEXT  SA5 

OUT  CAST  ; 

IF  CAS2  &  IRMC                                    NEXT  SA5 

OUT  CAS2  ; 

PRESENT  SA7 

IF  CASJ  8,  IRMC                                    NEXT  SA5 

OUT  CAS3  ; 

IF  CASO 

NEXT  SA8 

OUT  CASO  ; 

IF  IRMC                                               NEXT  SA5 

OUT  IDSACKO 

IF  CAS1 

NEXT  SA8 

OUT  CAS1  ; 

OUT  IDSACKl  ; 

IF  CAS2 

NEXT  SA8 

OUT  CAS2  ; 

IF  CAS3 

NEXT  SA8 

OUT  CAS3  ; 

/**  FOR  READ  MODIFY  WRITE  CYCLE  CONTINUE  TO  ASSERT  CAS  SIGNALS  **/ 

IF  PORT8 

NEXT  SA8 

OUT  DSACKO 

/**  ACKNOWLEDGE  COMPLETION  OF  READ  PORTION  OF  THE  CYCLE  BY  ACKNOT  **/ 

IF  PORT16 

NEXT  SA8 

OUT  DSACK1 

/**  JUMP  TO  STATE  SA5  **/ 

IF  PORT32 

NEXT  SA8 

OUT  DSACKO 

OUT  DSACK1 

NEXT  SA8 

OUT  WE  5 

/**  REMAINING  PORTION  OF  THE  READ  WRITE  CYCLE  "/ 

/**  THIS  IS  THE  WRITE  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 
PRESENT  SA4  NEXT  SAO  ;  /**  GENERATE  THE  WE  SIGNAL  FOR  ENABLING  WRITE  **/ 

/"  GO  BACK  TO  POLLING  STATE  SAO  **/  /"  GENERATE  THE  CORRECT  ACKNOWLEDGE  FOR  THE  WRITE  CYCLE  PORTION  •*/ 

/**  CONTINUE  TO  ASSERT  CAS  **/ 

/**  COMPLETION  OF  READ  WRITE  CYCLE  **/ 

PRESENT  SA8 


IF  CASO 

NEXT  SA9    OUT  CASO  ; 

IF  CAS1 

NEXT  SA9    OUT  CAS1  ; 

/**  REMAINING  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 

IF  CAS2 

NEXT  SA9    OUT  CAS2  ; 

IF  CAS3 

NEXT  SA9   OUT  CAS3  ; 

PRESENT  SA5 

IF  PORT8 

NEXT  SA9    OUT  DSACKO  ; 

IF  CASO 

NEXT  SA6    OUT  CASO  ; 

IF  PORT16 

NEXT  SA9    OUT  DSACK1  ; 

IF  CAST 

NEXT  SA6    OUT  CAS1  ; 

IF  PORT32 

NEXT  SA9    OUT  DSACKO 

IF  CAS2 

NEXT  SA6    OUT  CAS2  ; 

OUT  DSACK1  ; 

IF  CAS3 

NEXT  SA6    OUT  CAS3  ; 

NEXT  SA9    OUT  WE  ; 

NEXT  SA6  } 

/**  THIS  IS  THE  WRITE  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 

Figure  8.  Source  Code  for  Data  Sizing  and  Alignment  PAL  Device  (Cont'd.) 


PRESENT  SA9 


IF  CASO 

NEXT  SA10  OUT  CASO  ; 

IF  CAS1 

NEXT  SAIO  OUT  CAS1  ; 

IF  CAS2 

NEXT  SAIO  OUT  CAS2  ; 

IF  CAS3 

NEXT  SAIO  OUT  CAS3  ; 

IF  PORT8 

NEXT  SA10  OUT  DSACKO 

IF  PORT  16 

NEXT  SAIO  OUT  OSACK1 

IF  PORT32 

NEXT  SA10  OUT  DSACKO 

OUT  DSACK1  ; 
NEXT  SAIO  OUT  WE  ; 


/"  THIS  IS  THE  WRITE  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  «*/ 


/**  EQUATIONS  FOR  SYSTEM  INITIALIZATION  AFTER  RESET 


[B0..B3].ar  ■  UNIT  ; 


CASO.AR 

UNIT  j 

CAS1 .AR 

UNIT  , 

DSACKO. AR 

UNIT  j 

DSACK1  .AR 

UNIT 

CAS2.AR 

UNIT  , 

CAS3.AR 

s 

UNIT 

WE.AR 

s 

UNIT 

PRESENT  SA10 

NEXT  SA11  OUT  ICASO 
OUT  !CAS1 
OUT  ICAS2 
OUT  ICAS3 
OUT  ! DSACKO 
OUT  IDSACK1 
OUT  >WE  ; 

/**  COMPLETION  OF  THE  WRITE  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 
/**  REMOVE  ALL  CAS    WE    AND  ACKNOWLEDGE  COMPLETION  **/ 


/*  THESE  ARE  THE  OUTPUT  ENABLES  V 


CASO.OE 

CASI.OE 

DSACKO. OE 

DSACK1 .OE 

CAS2.0E 

CAS3.0E 

WE.OE 


'B'1 
'B'1 
'B'1 
■B'1 
'B'1 
'B'1 
•B'1 


/«  REMAINING  PORTION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 

PRESENT  SA11  NEXT  SAO  ; 

/**  GO  BACK  TO  POLLING  STATE  SAO  **/ 

/**  COMPLETION  OF  THE  READ  MODIFY  WRITE  CYCLE  **/ 

> 

/**  THE  STATE  MACHINE  COMPLETE  **/ 


Figure  8.  Source  Code  for  Data  Sizing  and  Alignment  PAL  Device  (Cont'd.) 
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Introduction 

This  application  note  describes  the  implementation  of  a  timing 
generator  which  interfaces  between  the  8088  and  the  Am2968 
Dynamic  Memory  Controller  (DMC)  using  programmable  logic 
and  a  delay  line.  The  implementation  does  not  account  for  any 
error  detection  and  correction  (EDC)  circuitry. 

The  timing  generator,  like  the  Am2970,  is  needed  in  memory 
systems  utilizing  a  dynamic  memory  controller.  It  serves  as  an 
interface  between  the  processor  and  the  controller  and  gener- 
ates the  necessary  control  signals  for  the  controller. 

A  dynamic  memory  controller,  in  brief,  interfaces  between  a  pro- 
cessor and  the  dynamic  memory  array.  It  steers  the  appropriate 
address  inputs  and  Row  and  Column  Address  Strobes  (RAS 
and  CAS)  required  in  the  selected  memory  operations  (i.e. 
refreshing,  read/write). 

The  following  sections  will  introduce  and  illustrate  the  implemen- 
tation of  the  timing  generator  using  20-pin  PAL  devices,  and  also 
show  its  interface  to  AMD's  Am2968  (DMC)  and  8088  proces- 
sor. Figure  1  shows  the  block  diagram  of  the  interface.  Program- 
mable logic  was  chosen  because  it  is  readily  available,  simple  to 
use  and  reduces  the  number  of  devices  required  to  implement 
specific  functions. 

Interface  Overview 

The  implementation  of  this  timing  generator  is  not  a  general 
purpose  application.  It  is  dedicated  specifically  to  a  particular 
processor,  the  Intel  8088,  and  is  limited  to  accessing  only  four 
banks  of  memory.  The  range  of  the  four  banks  of  memory  is 


therefore  1  Mbyte— with  256K  in  each  bank.  This  limitation  is  set 
by  the  fact  that  no  additional  decoding  circuitry  has  been  added. 
Decoding  is  done  with  the  two  address  lines  (A18-A19)  of  the 
8088,  thus  allowing  only  four  banks  to  be  selected. 

The  clock  to  the  8088  and  the  two  PAL  devices  is  provided  by 
the  8284A.  The  CLK  signal  from  the  8284A  is  connected  directly 
to  the  CLK  input  of  the  8088  and  also  to  CLK  of  RASER 
(PAL16R4).  The  PCLK  of  the  8284A  connects  to  the  PCLK  input 
of  HLDR  (PAL16R6)  to  operate  the  6-bit  counter.  Note  that  the 
clock  operates  on  a  1/3  and  2/3  duty  cycle. 

Description  of  PAL  Device  Function 

The  function  of  RASER  (PAL16R4)  is  to  create  RASI  (Row  Ad- 
dress Strobe)  and  CASI  (Column  Address  Strobe)  at  the  appro- 
priate time.  RASI  is  generated  directly  from  the  device,  whereas 
CASI  is  generated  from  MRASI  via  a  delay  line.  Figure  2  shows 
the  timing  delays  between  the  controller  and  processor  interface 
signals.  The  two  modes  of  operation  which  the  timing  is  focused 
on  is  Refresh  w/o  Scrubbing  and  Read/Write.  By  toggling  the 
state  of  the  mode  control  pin  MC1  to  either  a  LOW  or  HIGH 
respectively,  and  with  MCO  tied  LOW  (at  the  Am2968),  the 
desired  mode  is  generated.  The  mode  control  pin  MC1  is 
generated  based  on  the  state  of  the  processor's  interface  pins: 
IO/M,  DT/R  and  SSO.  The  combination  of  these  signals  decodes 
the  processor's  current  bus  cycle  and  indicates  the  ongoing 
memory  activity.  The  MUX  Select  pin  (MSEL)  is  also  generated 
from  the  delay  line.  It  determines  whether  a  Row  or  Column 
address  is  sent  to  the  memory  address  input  based  on  the  mode 
control  inputs,  MCO  and  MC1 . 
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Figure  1.  Am2968  to  8088  Interface  Block  Diagram 
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Figure  2.  Interface  Timing 


A  two-bit  counter  implemented  internal  to  the  (PAL16R4) 
RASER  is  used  to  keep  track  of  the  internal  state  of  RASI  during 
the  Read/Write  or  Refresh  operations.  In  a  normal  memory 
Read/Write  operation,  the  appropriate  RASn  outputs  from  the 
DMC  will  be  activated  in  response  to  RASI  going  HIGH,  as 
shown  in  the  memory  timing  in  Figure  3,  during  this  time  ALE 
initiates  the  cycle.  In  Refresh  mode,  receiving  RASI  will  force  all 
the  RASn  outputs  of  the  DMC  to  go  Low.  FALE  initiates  the 
refresh  cycle,  since  no  ALE  occurs  during  this  time,  as  shown  in 
refresh  timing  of  Figure  3.  The  counter  is  also  configured  such 
that,  during  memory  operations,  sufficient  time  has  been  allotted 
within  the  cycle  for  RAS  precharge  (required  in  DRAMs)  to 
occur. 

b)  HLDR  (PAL16R6) 

The  function  of  the  (PAL16R6)  HLDR  is  to  generate  the  two 
signals— FALE  (False  Address  Latch  Enable)  and  HLD  (Hold). 
FALE  controls  the  internal  latches  (ALS  and  ALR)  of  the 
(PAL16R4)  RASER  and  initiates  the  refresh  cycle  independent 
of  the  processor's  ALE  (Address  Latch  Enable)  signal. 


The  (PAL16R6)  HLDR  is  essentially  a  6-bit  counter  that  gener- 
ates 64  T-Clock  cycles  for  the  8088  processor.  When  a  hold 
request  (HLD)  is  made  and  a  hold  acknowledge  (HLD A)  is 
received  from  the  processor,  FALE  will  be  generated  for  the 
(PAL16R4)  RASER  to  initiate  the  refresh  cycle.  Refresh  is  per- 
formed every  8  ^s  which  is  double  the  required  frequency.  This 
time  is  derived  in  the  Refresh  Calculations  section  of  this  appli- 
cation note.  The  8088  processor  allows  request  only  at  the  end 
of  the  bus  cycle.  A  bus  cycle  duration  is  4  T-Clocks,  thus  refresh 
is  performed  at  cycles  "60-63"  as  noted  in  the  function  table  in 
the  HLDR  design  specification.  Figure  2  refresh  timing  shows 
this  critical  portion  of  the  timing  activity  when  RASI  is  generated 
for  refresh.  During  CLK  (0  -  0*),  (this  corresponds  to  PCLK  count 
value  62),  the  processor  acknowledge  is  received  and  RASI 
becomes  active.  While  RASI  is  HIGH,  the  internal  2-bit  counter 
will  cycle  through  its  count  sequence  (0*  -  3*);  at  the  end  of  the 
sequence  as  indicated  by  PCLK  count  value  63  (i.e.,  CLK  3*), 
RASI  will  become  inactive. 
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Figure  3.  Refresh  Timing 


Refresh  Timing  Calculations 

The  timing  calculations  are  based  on  8088  processor  being  in 
minimum  mode  (8  MHz)  and  a  cell  array  of  128  rows. 

The  8088  clock  period  is  8  MHz,  thus 

t  =  1'f  =  1/8  M  =  125  ns. 

Refreshing  for  DRAMs  are  performed  on  128  rows  every  2  ms. 
The  calculations  for  the  above  memory  array  is  determined 
to  be: 

Required  refresh  time  =  2  ms/128  =  15.6  (is. 
Thus  the  required  time  for  256  row  cell  array  will  be: 
Required  refresh  time  =  4  ms/256  =  15.6  (is. 


From  the  above  calculations,  refreshing  needs  to  be  performed 
at  least  every  15  ixs.  But  for  this  application,  memory  is  being 
refreshed  every  8  ^s  as  shown  below: 

Refresh  time  =  64  T-ciock  cycles  x  125  ns  =  8  its. 

Delay  Line  Taps  Computation 

The  calculations  for  the  assignment  of  the  delay  line  taps  is 
based  on  the  parameters  CASI  and  MSEL  relative  to  RASI 
making  a  transition  to  the  active  state.  See  the  timing  in  Fig- 
ure 4. 
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Figure  4.  Delay  Line  Timing  Diagram 
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MSEL  may  switch  some  time  after  T2.  T2  is  calculated  to  be  as 
follows: 

T2  =  tRAH  +  t24  =  20  +  1 1  =  31  ns. 

where  T2     =  delay  time  from  RASI  to  MSEL 

(rah  =  Row  Address  Hold  Time  of  the  DRAM 
t24     =  skew  time  for  Qn  to  RASn  of  the  DMC 

Thus,  after  31  ns  MSEL  should  initiate  its  active  transition. 

CASI  is  calculated  to  make  its  change;  T3  ns  relative  to  RASI. 
The  calculated  value  is  as  follows: 

T3  =  T2  +  t25  +  tASc  =  31  +  33  +  0  =  64. 

where  T3     =  delay  from  RASI  to  CASI 

t25     =  skew  time  for  Qn  to  CASn  of  the  DMC 
*asc  =  column  setup  time  of  the  DRAM 


Thus,  64  ns  after  RASI  initiates  its  transition,  CASI  may  begin  to 
go  active. 

For  simplicity,  delay  line  taps  may  be  assigned  as  follows: 

RASI   MSEL  ~  30  ns 

RASI   CASI  ~  60  ns 

Counter  and  Mode  Function  Table 

The  following  function  tables  describe  the  activity  of  the 
RASI/MRASI  during  the  specific  modes  of  the  processor's  inter- 
face signals  (IO/M,  DT/R  and  SSO)  and  the  state  of  the  internal 
counter  which  controls  RASI/MRASI.  Note:  RASI  will  be  active 
during  the  counter  sequences  between  1  to  3  as  shown  below  in 
Figure  5. 


Two  Bit  Counter 


A 

B 

RASI 

0 

0 

0 

• 

0 

0 

;0 

0 

1 

0 

;1 

1 

0 

1 

;2 

1 

1 

1 

;3 

1 

Two  Bit  Counter 


Sequence 

IO/M 

DT/R 

SSO 

MRASI 

COMMENTS 

0 

1 

0 

0  ----- 

0  ------ 

IACK 

0 

1 

0 

0  

IOR 

0 

1 

1 

0  

0  

IOW 

0 

1 

1 

1  — 

0  

HALT 

0 

0 

1 

1  

0  

PASSIVE 

1 

3  

0 

0 

0  

1  — 

CODE  ACCESS 

1 

3  

0 

0 

1  — 

1  — 

RD 

1 

3  

0 

1 

0  

WR 

Note:  This  design  does  not  allow  the  user  to  insert  Wait  States 
during  memory  operations!  Wait  States  may  be  inserted  in 
interrupt  or  I/O  cycles. 


Figure  5. 
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TITLE         Am2968  TO  8088  INTERFACE 

PATTERN  RASER 

REVISION  01 

AUTHOR       G.  SPEARS 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 


CHIP  RASER  PAL16R4 


CLK     RES     ALE     FALE     IOM     DTR     SSO     CLK2     NU2  GND 
TRI     MRASI     RASI     /MCI     NC     /B     /A     /ALR     /ALS  VCC 

EQUATIONS 

ALS  =     FALE* /ALE* /ALR 
+  RES 

ALR  =  /A*B*/ALS 
+  A*/ALS 

/RASI   =  /A*/B*/RES 
+  RES*CLK2 

/MRASI   =  IOM*/DTR 
+   IOM* DTR 
+  /IOM*DTR*SS0 
+  /A*/B 

:=  /A*/RES*/ALS 
+  /A*/RES*ALE 

;=  A*/B*/RES*/ALS 
+  /A*B*/RES*/ALS 

MCI    :=   IOM* /DTR* ALE* /RES 
+  DTR*SS0*ALE*/RES 
+  IOM*DTR*/SS0*ALE*/RES 
+  MC1*/ALE*/RES 

SIMULATION  NOT  INCLUDED 


Figure  6.  Source  Listing  for  the  8088  to  Am2968  Interlace  (AmPAL16R4) 
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TITLE         Am2968  TO  8088  INTERFACE 

PATTERN  HLDR 

REVISION  01 

AUTHOR        G.  SPEARS 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

.  CHIP  HLDR  PAL16R6 

PCLK  RASI  MRASI  HLDA  RES  CLK  MCI  NU2  NU3  GND 
TRI    /FALE  /F  /E  /D  /C  /B  /A  HLD  VCC 

EQUATIONS 

A   :=  /A*/RES*MC1 

B    :=  A*/B*/RES*MC1 
+  /A*B*/RES*MC1 

C    :=  A*B*/C*/RES*MC1 
+  /B*C*/RES*MC1 
+  /A*C*/RES*MC1 

D   :=  A*B*C*/D*/RES*MC1 
+  /C*D*/RES*MC1 
+  /B*D*/RES*MC1 
+  /A*D*/RES*MC1 

E    :=  A*B*C*D*/E*/RES*MC1 
+  /D*E*/RES*MC1 
+  /A*E*/RES*MC1 
+  /B*E*/RES*MC1 
+  /C*E*/RES*MC1 

F    :=  /A*F*/RES*MC1 

+  /C*F*/RES*MC1 

+  /B*F*/RES*MC1 

+  /D*F*/RES*MC1 

+  A*B*C*D*E*/F*/RES*MC1 

+  /E*F*/RES*MC1 

/HLD  =  /C  +  /D  +  /E  +  /F 

FALE  =  HLDA* CLK* RASI* /MRASI 

;    SIMULATION  NOT  INCLUDED 


Figure  7.  Source  Listing  for  the  8088  to  Am2968  Interface  (AmPAL16R6) 


MC68000  to  Am2968  Interface 


Introduction 

This  application  note  shows  how  the  general-purpose  timing 
generator  can  be  configured  to  interface  between  MC68000  and 
the  Am2968.  Figure  1  shows  the  block  diagram  of  the  interface. 
The  correct  interface  signals  must  be  adapted  from  the  proces- 
sor to  the  TimGen  (PAL16L8)  and  the  Am2968.  Note,  from  Fig- 
ure  2,  that  AS  (Address  Strobe)  replaces  CYCREQ,  and  UDS 
(UpperJData  Strobe)  and  LDS  (Lower  Data  Strobe)  replace  DS 
and  B/W. 

Asserting  the  Address  Strobe  signal  (AS),  which  is  connected  to 
the  LE  input,  latches  the  valid  address  into  the  Am2968.  UDS 
and  LDS  are  controls  indicating  the  flow  of  data  on  the  data  bus 
D00-D15)  during  memory  read/write  operations.  By  asserting 
he  correct  polarity  on  these  two  signals  (UDS  &  LDS),  either  a 
byte  or  a  word  may  be  accessed. 

The  following  sections  will  show  the  timing  requirements  which 
are  being  considered — that  is,  the  desired  processor  operating 
frequency  and  the  appropriate  DRAM.  Timing  diagrams  are  also 
included  to  show  the  status  of  the  interface  signals  during 
read/write  operations  for  various  processor  frequencies. 

MC68000  Timing  Requirements 

In  generating  the  hardware  for  the  timing  interface  for  the 
MC68000  to  the  Am2968,  the  overall  system  timing  require- 
ments must  be  considered  to  guarantee  that  the  desired 
memory  access  time  can  be  met. 

The  following  general  equation  formulates  the  parameters  which 
must  be  considered  in  generating  the  read  cycle  time  for  the 
68000  processor.  The  write  cycle  time  may  be  similarly 
generated. 

tREAD  =  2tCYC  +  'CH  ~  tcHSLx  ~  toiCL  -  2tPAL  -  t2  -  t|_ATCH 
+  NtcYC 


•read  =  read  cycle  time 

'cyc  =  clock  period  of  the  processor 

tcH  =  clock  width  High  of  the  processor 

'chslx  =  clock  High  to  AS,  DS  Low  of  the  processor 

*dicl  =  data  in  to  clock  Low  (setup  time)  of  the  processor 

Iral  =  programmable  logic  access  time 

t2  =  delay  from  RASI  to  RASn  of  the  DMC 

f latch  =  delay  through  latch 

N  =  number  of  inserted  Wait  States. 

For  simplicity,  zero  Wait  States  have  been  assumed  in  selecting 
the  appropriate  DRAMs.  If  Wait  States  had  been  asserted,  the 
appropriate  DRAM  must  be  selected  to  meet  the  overall  tREAD 
cycle  time.  Note  that  the  access  time  of  the  DRAM  (tACC)  must 
be  less  than  or  equal  to  tp^AD  cycle  time  or  else  access  to 
memory  will  be  missed.  The  following  evaluations  show  the  cal- 
culated tAcc  based  on  the  various  processor  operating  frequen- 
cies: 8  MHz,  10  MHz  and  12.5  MHz. 

MC68000  W/WAIT  STATES 

8  MHz 

tREAD  =  (250  +  55  -  60  -  15  -  30  -  23  -  13  +  N  •  125)  ns 
=  (164  +  125N)  ns 

DRAM  tACc  =  150  ns 
10  MHz 

tREAD  =  (200  +  45  -  55  -  10  -  30  -  23  -  13  +  N  •  100)  ns 

=  (114  +  100N)  ns 
DRAM  tACC  =  100  ns 
12.5  MHz 

Iread  =  (160  +  35  -  55  -  10  -  30  -  23  -  13  +  N  •  80)  ns 
=  (64  +  80N)  ns 
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Figure  1.  68000  System  Diagram 
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Figure  2.  Interface  Diagram 
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Figure  3.  Read  and  Write  Cycle  Timing  Diagram 


For  12.5  MHz  frequency,  asserting  at  least  one  Wait  State  will 
allow  tREAD  =  144  ns,  thus  DRAM  tACC  =  120  ns. 

Figure  3  shows  the  read  and  write  cycle  timing  diagrams  relative 


to  the  processor's  cycle  time.  The  8  MHz  and  1 0  MHz  are  shown 
on  the  same  diagram  because  of  their  similarity.  Figure  4  shows 
the  read  and  write  timing  for  the  processor  operating  at  12.5 
MHz  with  Wait  States  asserted  during  these  operations. 
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TITLE         GEN  INTERFACE  FOR  68000 
PATTERN  TIMGEN 
REVISION  01 
AUTHOR  LEE/VEE 

COMPANY     ADVANCED  MICRO  DEVICES 
DATE  11/06/87 

CHIP  TIMGEN  PAL16L8 

/CS     /AS     /FR     /SUP     RW     T80       /UDS     /LDS     T100  GND 
NC     /DTACK     TO     FRH     /RFRQ     /RWRQ     /WEL     /WEH     /OE  VCC 


/FRH  =  FR*/FRH  +  T0*RFRQ 

RWRQ  =  CS*AS*/RWRQ*/T100  +  RWRQ*/T100 

RFRQ  =  FR*FRH*/RWRQ*/T100  +  RFRQ*/T100 

/TO  =  /RWRQ* /RFRQ  +  T80 

DTACK  =  CS*RWRQ*/RFRQ*T80 

DTACK. TRST  =  CS*RWRQ*/RFRQ*T8 0 

WEH  =  /SUP*RWRQ*CS*/UDS*/RW 

WEL  =  /SUP*RWRQ*CS*/LDS*/RW 

OE  =  CS*RW*UDS  +  RW*CS*LDS 

;    SIMULATION  NOT  INCLUDED 


Figure  5.  Source  Listing  for  the  68000  Interface 
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Introduction 

This  application  note  shows  the  implementation  of  a  general- 
purpose  dual-port  arbiter  interfacing  three  processors,  a 
Dynamic  Memory  Controller  (Am2968A  DMC),  and  dynamic 
memory.  The  arbiter,  implemented  with  two  programmable  array 
logic  PAL  devices,  and  a  delay  line,  can  provide  a  simple  and 
complete  arbitration  solution  operating  in  asynchronous  mode. 

The  interface  consists  of  two  PAL  devices  (TIMGEN2A  and 
TIMGEN2B),  a  timer,  a  delay  line,  and  two  multiplexers.  Figure  1 
shows  the  block  diagram  of  the  system  which  includes  the  ar- 
biter logic  implemented  with  the  PAL  devices.  These  two 
devices,  TIMGEN2A  and  TIMGEN2B,  provide  the  control  sig- 
nals to  the  DMC  as  well  as  to  the  dynamic  memory.  The  purpose 
of  the  arbiter  is  to  mediate  when  two  processors  request  for 


memory  cycles,  or  when  refresh  cycles  need  to  be  performed. 
The  first  PAL  device,  TIMGEN2A,  generates  the  signals  for  the 
DMC  and  the  Address  MUX;  it  also  contains  the  arbitration  logic. 
The  second  device,  TIMGEN2B,  generates  the  control  signals 
for  the  memory  and  the  data  bus  latches. 

The  Am2968A  Dynamic  Memory  Controller  (DMC)  interfaces 
between  the  processors  and  the  dynamic  memory  array.  The 
DMC  provides  the  required  addresses  either  for  the  memory  or 
the  refresh  cycles.  For  memory  cycles,  the  addresses  are 
generated  by  the  processor  and  latched  into  the  DMC.  For 
refresh  cycles,  the  addresses  are  generated  by  a  20-bit  counter 
internal  to  the  DMC.  When  the  DMC  receives  the  correct  control 
signals,  it  generates  the  appropriate  address  (Qn)  and  the  Row 
and  Column  Address  Strobes  (RASn  and  CASn)  that  are  re- 
quired to  perform  memory  read/write  and  refresh. 
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Interface  Overview 

The  dual-port  arbiter  can  be  interfaced  to  various  processors, 
using  the  proper  control  signals  from  the  particular  processor. 
Typically,  the  processor  interface  signals  are  CS  (Chip  Select), 
CYCREQ  (Cycle  Request),  RD  (Read),  and  WR  (Write),  see 
Figure  1 . 

The  two  PAL  devices,  TIMGEN2A  and  TIMGEN2B  are  designed 
to  operate  in  the  following  modes: 

(1 )  generate  proper  control  signals  when  cycle  request  is  made 

(2)  arbitrate  between  two  processor  requests 

(3)  arbitrate  between  a  refresh  request  and  processor  request 

(4)  arbitrate  between  a  refresh  request  and  two  cycle  requests 


Requests  by  the  processor  are  made  when  CS  and  CYCREQ 
are  valid  and  the  memory  address  is  present.  During  dual-port 
arbitration,  the  occurrence  of  simultaneous  requests,  for  in- 
stance, CSAJChip  Select  Port  A),  CYCREQA  (Cycle  Request 
Port  A)  and  FR  (Forced  Refresh),  will  result  in  the  granting  of  a 
refresh  cycle  and  the  appropriate  address  being  generated 
accordingly  by  the  DMC. 

TIMGEN2A  generates  the  appropriate  control  signals  to  the 
data  and  address  multiplexers,  the  delay  line,  and  the  memory 
controller.  These  control  signals  are:  RASI  (Row  Address  Strobe 
Input),  SEL  (Processor  Address  MUX  Select),  and  MC1  (Mode 
Control  Input).  RASI  is  generated  for  all  read/write  and  refresh 
cycles.  It  is  also  used  as  the  delay  line  input  from  which  MSEL 
(MUX  Select  for  the  DMC)  and  CASI  (Column  Address  Strobe 
Input)  are  generated.  SEL  allows  the  correct  port  address,  A  or 
B,  to  be  the  input  to  the  DMC  depending  upon  which  port  is 
acknowledged.  MC1  specifies  the  operating  mode  of  the  DMC, 
read/write  or  refresh  (see  Am2968A  data  sheet). 

The  arbitration  logic  prioritizes  and  grants  the  requests  for 
read/write  cycles  from  either  port,  A  or  B,  and  refresh  cycles. 
The  Mode  Table  (Figure  2)  shows  the  order  of  precedence. 
When  simultaneous  request  for  a  read/write  or  refresh  cycle 
occurs,  the  refresh  cycle  will  be  given  priority.  The  connection  of 
RFRQ  (Refresh  Request)  from  TIMGEN2A  to  MC1  of  the  DMC, 
as  shown  in  the  block  diagram,  implicitly  gives  refresh  the 
precedence.  When  simultaneous  read/write  requests  are  made 
by  processor  A  and  processor  B,  processor  A  will  be  given 
priority  over  B.  Again,  the  order  of  precedence  is  set  implicitly  by 
the  connection  of  RWRQA  (Read/Write  Request  Port  A)  to  the 
select  input  signal  of  the  processor  address  MUX. 
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The  second  programmable  logic  device,  TIMGEN2B  generates 
the  OE  (Output  Enable)  signal  for  the  data  bus  latches  and  the 
WE  (Write  Enable)  signal  for  memory.  The  inputs  to  TIMGEN2B 
are  the  control  signals— CSA,  CSB,  RDA  (Read  A),  RDB  (Read 
B),  WRA  (Write  A),  and  WRB  (Write  B).  Signals  such  as  BHEN 
(Byte  High  Enable)  and  A00,  the  least  significant  bit  of  the  ad- 
dress, have  been  included  to  show  their  function  (if  available). 
These  two  signals  are  provided  by  the  processor  and  together 
they  determine  whether  a  word  or  byte  transfer  is  to  be  per- 
formed during  memory  operations. 


0.01  (iF 


Figure  3.  Refresh  Timer  Diagram 


Refresh  Timer  and  Calculations 

The  refresh  timer  below  shows  one  method  of  implementing  an 
external  refresh  clock.  This  section  is  optional  if  an  alternate 
source  is  used.  The  refresh  timer,  implemented  with  a  555  timer, 
needs  to  generate  an  active-LOW  edge-triggered  signal  at  least 
once  every  15  (is.  This_means  that  a  refresh  occurs  when  a 
LOWgoing  edge  on  the  FR  signal  is  detected.  The  15  (is  value 
is  derived  from  the  dynamic  memory  refresh  timing  require- 
ments; refresh  is  normally  required,  over  256  rows,  every  4  ms. 
This  equates  to  a  required  refresh  cycle  every  15.6  (is.  In  this 
application,  the  refresh  timer  generates  FR  approximately  once 
every  9.8  (j.s.  Figure  3  shows  the  circuit  for  the  forced  refresh 
(FR)  timer.  The  timer  is  a  commercially-available  555  timer.  The 
delay  equation  shown  below  must  be  solved  for  R1,  R2,  and  C 
values.  This  satisfies  the  refresh  requirement  of  less  than,  or 
equal  to,  15.6  (is 

T  =  0.693  (R1  +  2R2)  C  <  4  ms/256  =  15.6  (is 
For  the  following  values  of  R1 ,  R2  and  C 
R1  =  R2  =  10K  and  C  »  470  pF 

the  timer  generates  a  refresh  every  9.8  (is  which  allows  some 
margin  in  the  refresh  requirement.  The  specified  values  for  R1, 
R2,  and  C  allow  for  30%  total  discrete  component  tolerance. 


Figure  2.  Mode  Select  Table 
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TIMGEN2A 

The  function  of  TIMGEN2A  is  to  generate  RASI  (Row  Address 
Strobe  Input)  and  arbitrate  between  processor  and  refresh 
cycles.  Based  on  the  arbitration,  TIMGEN2A  also  generates 
RWRQ  (Read/Write  Request),  or  RFRQ  (Refresh  Request),  that 
determines  the  type  of  cycle  to  be  performed.  TIMGEN2A  can 
arbitrate  between  refresh  and  processor  requests  and  also  arbi- 
trate between  Port  A  and  Port  B  processor  requests. 

When  simultaneous  refresh  and  processor  requests  are 
generated,  refresh  (RFRQ)  will  be  given  priority.  The  priority  is 
implicit  by  design  because  the  RFRQ  output  of  TIMGEN2A  is 
connected  to  MC1  of  the  Am2968A  DMC.  When  RFRQ  be- 
comes active  (LOW),  MC1  forces  the  DMC  into  refresh  mode. 

When  simultaneous  processor  requests  are  made,  RWRQA 
(Read/Write  Request  of  Port  A)  will  be  granted  before  RWRQB 
(Read/Write  Request  of  Port  B).  This  priority  is  also  implicit  by 
design  because  the  RWRQA  output  is  connected  to  SEL  of  the 
processor's  address  multiplexer.  When  RWRQA  becomes  ac- 
tive, address  from  Port  A  will  be  selected  for  input  to  the  Row 
and  Column  Address  latches  of  the  Am2968A,  also,  the  data 
from  Data  Bus  A  will  be  allowed  to  flow  to  the  inputs  of  the 
DRAMs.  In  either  case,  RASI  must  be  generated  to  initiate  any 
of  the  cycles.  Once  RASI  is  generated,  the  control  signals, 
MSEL  and  CASI,  will  be  generated  from  the  delay  line  relative 
to  RASI. 

TIMGEN2B 

The  function  of  TIMGEN2B  is  to  generate  WE  (Write  Enable)  for 
memory  and  also  OE  (Output  Enable)  to  control  the  flow  of  data 
through  the  data  bus  latches.  The  inputs  to  TIMGEN2B  are, 
CSA,  RDA  (Read  Strobe  A),  WRA  (Write  Strobe  A),  CSB,  RDB 
(Read  Strobe  B),  and  WRB  (Write  Strobe  B).  In  addition,  there 
are  BHENA  (Byte  High  Enable  A)  and  BHENB  (Byte  High 
Enable  B),  AOOA  (least  significant  address  bit  for  Port  A)  and 
AOOB  (least  significant  address  bit  for  Port  B).  These  signals  are 
generated  by  the  processor  and  are  used  to  specify  either  byte 
or  word  transfers. 

The  following  description  outlines  the  operation  of  TIMGEN2B.  If 
CSA  and  RDA  become  active  (LOW),  TIMGEN2B  will  cause 
OEA  to  become  active  (LOW),  enabling  the  data  bus  latches 
and  allowing  data  corresponding  to  address  A  to  flow  onto  the 
system  data  bus  (read  operation).  The  identical  procedure  oc- 
curs for  Port  B  when  the  control  inputs  for  Port  B  is  active.  The 
block  diagram  shown  in  Figure  1  shows  a  multiplexer  for  the 
system  data  bus.  This  multiplexer  controls  the  data  flow  into 
memory  from  either  processor  A  or  processor  B,  during  write 
operations.  The  select  to  the  MUX  is  controlled  by  RWRQA,  the 
same  signal  that  controls  the  select  to  the  processor  address 
MUX,  allowing  address  Port  A  to  implicitly  have  higher  priority 
during  process  memory  access. 

The  following  sections  provide  general  information  about 
interfacing  the  arbiter  to  various  major  types  of  processors;  they 
are:  the  iAPX-type,  the  AmZ8000,  and  the  MC68000-type 
processors.  The  PAL  devices  for  the  three  types  of  processors 
will  be  identified  as  follows:  TIMGEN2A  and  TIMGEN2B  are  for 
the  iAPX-type;  TIMGEN3A,  TIMGEN3B  for  the  AmZ8000;  and 
TIMGEN4A  and  TIMGEN4B  are  for  the  MC68000-type.  When 
mixing  different  processor  in  an  interface,  some  of  the  control 
signals  mentioned  for  the  particular  groups  may  need  to  be 


modified  to  be  applicable  to  the  processor  under  consideration. 
The  designer  can  tailor  the  design  to  meet  specific  needs  with 
simple  modification  to  the  PAL  device  logic  equations  provided. 

Interfacing  the  iAPX-Type  Processors 

The  control  signals  required  for  TIMGEN2A  are  shown  in  Fig- 
ure 4.  These  signals  are:  CS,  CYCREQ,  FR  and  the  delay  line 
outputs  (TAP1_-TAP5).  CS  and  CYCREQ  are  provided  by  the 
processor.  FR  is  the  refresh  request  from  the  refresh  timer. 
Sources  to  this  input  can  be  provided  either  by  the  timer 
described  in  this  application  note  or  other  appropriate  sources. 
TAP1 -TAP5  are  the  timing  delay  outputs  used  to  regulate  many 
of  the  signal  generation.  The  outputs  generated  from  this  PAL 
device  are  TAPO  (RASI),  XACKA,  XACKB,  and  the  grants  for 
processor  and  refresh  requests. 

The  signals  for  TIMGEN2B  are  RD,  WR,  BHE,  and  A00.  For 
byte/word  transfers,  the  designer  should  consult  the  appropriate 
processor  data  sheet.  In  general,  this  processor  group  uses 
BHE  in  conjunction  with  A00  to  define  the  transfer  function  of  the 
data  bus.  The  outputs  from  this  PAL  device  are:  Write  Enables, 
Latch  Enables,  and  Output  Enables. 

Interfacing  the  AmZ8000 

The  control  signals  required  for  this  interface  are  shown  in  Fig- 
ure  5.  For  TIMGEN3A,  the  general  controls  are  CS  and 
CYCREQ,  FR  and  the  delay  line  outputs  (TAP_1^TAP5).  CS  and 
CYCREQ  are  generated  by  the  processor.  FR  is  the  refresh 
request  from  the  refresh  timer.  TAP1 -TAP5  are  the  delay  line 
outputs  used  to  regulate  the  signal  generation.  The  outputs  from 
this  PAL  device  are  WAITA,  WAITB,  TAPO  (RASI),  and  the 
grants  for  processor  and  refresh  requests. 

The  control  signals  for  TIMGEN3B  are  R/W,  DS,  A00  and  B/W. 
For  byte/word  transfers,  the  designer  should  consult  the  proces- 
sor data  sheet  for  correct  signal  generation.  The  outputs  from 
this  PAL  device  are:  Write  Enables,  Latch  Enables  and  Output 
Enables. 

Interfacing  the  MC68000-Type 
Processors 

The  control  signals  required  for  this  interface  are  shown  in  Fig- 
ure  6.  For  TIMGEN4A,  the  general  controls  are  CS  and 
CYCREQ,  FR  and  the  delay  line  outputs  (TAP1-TAPSr).  CS  and 
CYCREQ  are  generated  by  the  processor.  FR  is  the  refresh 
request  from  the  refresh  timer.  TAP1-TAP5  are  the  delay  line 
outputs  used  to  regulate  signal  generation.  The  outputs 
generated  from  this  PAL  device  are:  DTACKA,  DTACKB,  TAPO 
(RASI)  and  the  grants  for  processor  and  refresh  request. 

The  control  signals  for  TIMGEN4B  are  R/W,  UDS,  and  LDS.  For 
byte/word  transfers,  the  designer  should  consult  the  respective 
processor  data  sheet  for  correct  signal  generation.  For  example, 
to  obtain  either  byte  or  worrj  transfers  with  the  MC68000,  three 
signals,  UDS,  LDS  and  R/W,  must  provide  the  correct  levels  to 
guarantee  the  correct  data  transfers,  see  MC68000  Data  Bus 
Control  Table  in  the  User's  Manual.  The  outputs  generated  from 
this  PAL  device  are:  Write  Enables,  Latch  Enables,  and  Output 
Enables. 

The  programmable  logic  device  equations  for  all  three  interfaces 
are  shown  in  Figures  7,  8,  and  9. 
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Figure  4.  Interface  for  IAPX-Type  Processors 
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General-Purpose  Dual-Port  Arbiter 


TITLE  INTERFACE  TIMING  PAL  DEVICE  #1 

PATTERN  A3-249A 

REVISION  01 

AUTHOR  J .  ENGINEER 

COMPANY  ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP  TIMGEN2A  PAL16L8 

/CSA  /CYREQA  /CSB  /CYREQB  TAP1  TAP 2  TAP 3  TAP 4  TAP 5  GND 
/FR      /XACKA      /XACKB  TAPO        FRH   /RFRQ   /RWRQA   / RWRQB  NC  VCC 

EQUATIONS 

/FRH  =  FR*/FRH  +  TAP0*RFRQ 

RFRQ  =  FR*FRH  * / RWRQA* / RWRQB  * / TAP  5  +  RFRQ*/TAP5 
RWRQA  =  CSA*CYREQA* /RFRQ* /RWRQB* /TAP 5  +  RWRQA* / TAP  5 
RWRQB  =  CSB *CYREQB* /RFRQ* / RWRQA* / TAP  5  +  RWRQB* /TAP5 


/TAPO  =  /RFRQ* /RWRQA* /RWRQB  +  TAP 4   ;PAL  device  for  8086,80186 

;and  80286  processors 

XACKA  =  CSA* RWRQA* /RFRQ* TAP 4 
XACKA. TRST  =  CSA*RWRQA*/RFRQ*TAP4 

XACKB  =  CSB*  RWRQB  * / RF RQ  * / RWRQA *  TAP  4 
XACKB. TRST  =  CSB*RWRQB*/RFRQ*/RWRQA*TAP4 


TITLE  INTERFACE  TIMING  PAL  DEVICE  #2 

PATTERN  A3-249B 

REVISION  01 

AUTHOR       J.  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP   TIMGEN2B  PAL22V10 

/CSA     /RDA  /WRA  /BHENA  A00A  /BHENB  /WRB       /RDB   /CSB  NC     TAP 3  GND 
A00B  /OEA  / OEB   /SUP     /RFRQ   / RWRQA  /RWRQB  /WEL   /WEH  LEB  LEA  VCC 
GLOBAL 

EQUATIONS   ;PAL  DEVICE  FOR  808  6  PROCESSORS 

WEH  =   /  SUP  *  RWRQA*/ RFRQ  *CS  A*  WRA*  BHENA 

+   /SUP*  RWRQB  * / RFRQ  * / RWRQA *  C S  B  * WRB  * BHENB 

WEL  =  /SUP*RWRQA*/RFRQ*CSA*WRA*/A00A 

+  /SUP*RWRQB*/RFRQ*/RWRQA*CSB*WRB*/A00B 

OEA  =  CSA*RWRQA*/RFRQ*RDA 

OEB  =  CSB*RWRQB*/RFRQ*RDB 

LEA  =  CSA*RWRQA*/RFRQ*RDA*TAP3 

LEB  =  CS B * RWRQB * /RFRQ* RDB *TAP 3 


Figure  7.  PLPL  Specification  lor  the  Example  of  Figure  4 
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General-Purpose  Dual-Port  Arbiter 


TITLE  INTERFACE  TIMING  PAL  DEVICE  #1 

PATTERN  A3-250A 

REVISION  01 

AUTHOR       J.  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP  TIMGEN3A  PAL18P8 

/CSA  /CYREQA  /CSB  / CYREQB  TAP1  TAP 2  TAP 3  TAP 4  TAP 5  GND 
/FR     /WAITA     /WAITB  TAPO       FRH  /RFRQ  /RWRQA  /RWRQB  NC  VCC 

EQUATIONS 

/FRH  =  FR*/FRH  +  TAP0*RFRQ 

RFRQ  =  FR*FRH*/ RWRQA* /RWRQB* /TAP 5  +  RFRQ*/TAP5 

RWRQA  =  CSA*  CYREQA  * / RFRQ  * / RWRQB * / TAP  5  +  RWRQA* /TAP 5 

RWRQB  =  CSB*CYREQB* /RFRQ* /RWRQA* /TAP 5   +  RWRQB* /TAP 5 

/TAPO  =  /RFRQ * / RWRQA* / RWRQB  +  TAP 4       ; PAL  device  for  Z8000 

; processor 

/WAITA  =  CSA*RWRQA*/RFRQ*TAP4 
WAITA. TRST  =  CSA*RWRQA*/RFRQ*TAP4 

/WAITB  =  CSB*RWRQB*/ RFRQ* /RWRQA* TAP 4 
WAITB. TRST  =  CSB* RWRQB  * / RF RQ  * / RWRQA * TAP  4 

TITLE  INTERFACE  TIMING  PAL  DEVICE  #2 

PATTERN  A3-250B 

REVISION  01 

AUTHOR  J.  ENGINEER 

COMPANY  ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP  TIMGEN3B  PAL22V10 

/CSA       RWA  /DSA     BWA     A00A     BWB       /DSB         RWB  /CSB  NC     TAP 3  GND 
A00B   / OEA  / OEB  /SUP   /RFRQ  /RWRQA  / RWRQB  /WEL   /WEH  LEB  LEA  VCC 
GLOBAL 

EQUATIONS   ;PAL  DEVICE  FOR  Z8000  PROCESSOR 

WEH  =  /SUP*RWRQA*/RFRQ*CSA*/RWA*/BWA 

+  /SUP*RWRQA*/RFRQ*CSA*/RWA*BWA*AOOA 

+  /SUP  *  RWRQB  *  /  I^F  RQ  *  /  RWRQ  A*  C  S  B  *  /  RWB  *  /  BWB 

+  /SUP*RWRQB*/R:VQ*/RWRQA*CSB*/RWB*BWB*AOOB 

WEL  =  /SUP*RWRQA*/RFRQ*CSA*/RWA*/BWA 

+  /SUP*RWRQA*/RFRQ*CSA*/RWA*BWA*/AOOA 

+  /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*/BWB 

+  /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*BWB*/AOOB 

OEA  =  CSA*RWRQA*/RFRQ*RWA*DSA 

OEB  =  CSB  *  RWRQB*/ RFRQ  *RWB*DSB 

LEA  =  CSA* RWRQA* /RFRQ* RWA* TAP 3 

LEB  =  CSB*  RWRQB  * / RFRQ  *RWB  *  TAP  3 

Figure  8.  PLPL  Specification  for  the  Example  of  Figure  5 
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General-Purpose  Dual-Port  Arbiter 


TITLE  INTERFACE  TIMING  PAL  DEVICE  #1 

PATTERN  A3-251A 

REVISION  01 

AUTHOR  J.  ENGINEER 

COMPANY  ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP  TIMGEN4A  PAL16L8 

/CSA  /CYREQA  /CSB  /CYREQB  TAP1  TAP 2  TAP 3  TAP 4  TAP 5  GND 
/FR     /DTACKA  /DTACKB  TAPO     FRH   /RFRQ   / RWRQ A  / RWRQB  NC  VCC 

EQUATIONS 

/FRH  =  FR*/FRH  +  TAPO* RFRQ 

RFRQ  ■  FR*FRH * / RWRQA* / RWRQB* / TAP 5  +  RFRQ* /TAP 5 

RWRQA  =  CSA*CYREQA* /RFRQ* /RWRQB* / TAP  5   +  RWRQA* / TAP  5 

RWRQB  =  CSB*CYREQB*/ RFRQ* /RWRQA* /TAP 5   +  RWRQB* /TAP5 

/TAPO  =  / RF RQ * / RWRQA * / RWRQB  +  TAP 4       ; PAL  device  for  68000 

/processor 

DTACKA  =  CSA*RWRQA*/RFRQ*TAP4 
DTACKA. TRST  =  CSA*RWRQA*/RFRQ*TAP4 

DTACKB  =  CSB*RWRQB*/ RFRQ* /RWRQA* TAP 4 
DTACKB. TRST  =  CSB*RWRQB*/ RFRQ* /RWRQA* TAP 4 


TITLE  INTERFACE  TIMING  PAL  DEVICE  #2 

PATTERN  A3-251B 

REVISION  01 

AUTHOR  J.  ENGINEER 

COMPANY  ADVANCED  MICRO  DEVICES 

DATE  12/15/87 

CHIP   TIMGEN4B  PAL22V10 

/CSA       RWA  /UDSA  /LDSA  NC5     /LDSB     /UDSB       RWB   /CSB  NC  TAP 3  GND 
NC10   /OEA  /OEB     /SUP   /RFRQ   /RWRQA  /RWRQB  /WEL  /WEH  LEB  LEA  VCC 
GLOBAL 

EQUATIONS   ;PAL  DEVICE  FOR  68000  PROCESSOR 

WEH  =  /SUP*RWRQA*/RFRQ*CSA*/RWA*UDSA 

+  /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*UDSB 

WEL  =  /SUP*RWRQA*/RFRQ*CSA*/RWA*LDSA 

+  /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*LDSB 

OEA  =  CSA*RWRQA*/RFRQ*RWA* (UDSA  +  LDSA) 

OEB  =  CSB*RWRQB*/RFRQ*RWB* (UDSB  +  LDSB) 

LEA  =  CSA*RWRQA*/RFRQ*RWA*TAP3 

LEB  -  CSB*RWRQB*/RFRQ*RWB*TAP3 


Figure  9.  PLPL  Specification  for  the  Example  of  Figure  6 
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Dynamic  Memory  Control 
State  Sequencer 


An  example  of  a  control  path  application  for  a  PAL  device  is  in  a 
memory  system.  Most  large  memory  systems  use  MOS 
dynamic  RAMs.  Their  high  density  allows  packing  a  large 
memory  size  into  a  small  board  area.  Dynamic  RAM  prices  also 
make  them  very  cost  effective. 

Dynamic  RAMs  require  external  logic  for  address  multiplexing, 
timing  generation  and  refresh  control.  This  application  note 
shows  the  use  of  a  PAL16R8A  and  an  Am2964B  to  provide  the 
necessary  external  logic  for  a  typical  dynamic  memory  system. 
The  PAL  device  is  used  as  a  state  sequencer  for  timing  genera- 
tion and  the  Am2964B  provides  specialized  control  circuitry  and 
reduces  timing  skew  between  control  signals.  This  implementa- 
tion replaces  about  20  SSI/ MS  I  packages. 


Design  Requirements 

A  system  block  diagram  is  shown  in  Figure  1 .  The  control  bus 
provides  most  of  the  inputs  to  the  PAL  state  sequencer.  These 
include:  Memory  Request  (MREQ),  READ/WRITE  (RW), 
RESET  (RST),  Refresh  Clock  (RFCK),  and  Read-Modify-Write 
(RMW).  Two  upper  address  lines  of  the  address  bus  serve  as 
board  selects  (BS-,,  BS0),  and  one  local  signal,  SLOW/FAST 
Memory  (FAST),  allows  use  of  either  slow  or  fast  memory.  A 
READ/WRITE  sequence  is  initialized  by  MREQ  ANDed  with  the 
proper  board  select  conditions  and  a  refresh  sequence  is  in- 
itialized by  RFCK.  If  both  sequences  are  requested  at  the  same 
time,  a  refresh  sequence  is  performed.  RW  when  HIGH  selects 
a  READ  operation  and  when  LOW  selects  a  WRITE  operation. 
RMW  when  HIGH  selects  a  Read-Modify-Write  cycle. 


Figure  1.  Dynamic  Memory  Controller 
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Dynamic  Memory  Control  State  Sequencer 


The  outputs  of  the  PAL  device  provide  the  timing  and  control 
inputs  to  the  Am2964B.  These  are:  Row  Address  Strobe  (RAS), 
Address  Multiplexer  Select  (MS),  Column  Address  Strobe 
(CAS),  and  Refresh  (RFSJj).  In  addition,  the  PAL  device  pro- 
vides the  Write  Enable  (WE)  to  the  Memory  Array.  Figure  1 
shows  the  timing  for  fast  READ/WRITE  cycles.  The  memory 
cycle  is  initiated  by  MREQ  going  LOW.  The  PAL  device  re- 
sponds by  bringing  RAS  LOW  at  t0,  followed  by  MS  going  LOW 
at  t, ,  and  finally  bringing  CAS  LOW  at  t2.  If  RW  is  LOW,  WE  is 


also  brought  LOW  at  t2.  WE  is  held  LOW  until  t4.  RAS,  MS  and 
CAS  are  brought  HIGH  at  t5.  The  rising  edge  of  any  of  these 
three  signals  may  be  used  to  latch  output  data  during  a  Read 
operation.  The  state  sequencer  then  disabled  for  three  states  to 
allow  for  memory  precharge. 

By  holding  the  FAST  input  LOW,  an  extended  memory  cycle  is 
available  to  accommodate  slower  RAMs.  The  timing  appears  in 
Figure  3. 
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Figure  2.  Fast  READ/WRITE  Cycle 
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Figure  3.  Extended  Memory  Cycle 
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Dynamic  Memory  Control  State  Sequencer 


RAS-Only  refresh  cycle  timing  is  shown  in  Figure  4.  The  refresh 
cycle  is  initiated  when  RFCK  goes  HIGH.  The  RFSH  output 
goes  LOW  at  to,  followed  by  RAS  at  t1 .  The  Am2964B  supplied 
the  necessary  refresh  address.  RAS  is  brought  back  HIGH  at  t5 
and  precharge  is  then  timed  out.  An  extended  refresh  cycle  for 
slower  memory  is  available  also.  Burst  refresh  can  be  accom- 
plished by  leaving  RFCK  HIGH  for  as  many  refresh  cycles  as 
desired. 


Read-Modify-Write  cycle  timing  is  activated  by  setting  RMW 
HIGH.  This  is  especially  valuable  in  systems  with  Error  Detec- 
tion/Correction (EDC)  capability.  Data  can  be  read,  modified  by 
the  EDC  circuitry  (Am2960),  and  if  necessary,  written  back  to 
memory  in  a  single  memory  cycle.  Read-Modify-Write  cycle 
timing  is  shown  in  Figure  5.  Note  that  WE  goes  LOW  at  the  end 
of  the  cycle. 
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Figure  4.  RAS-Oniy  Refresh  Cycle 
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Figure  5.  Read-Modify-Write  Cycle 
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Dynamic  Memory  Control  State  Sequencer 


Design  Approach 

The  first  step  in  the  state  sequencer  design  process  is  to  define 
the  timing  waveforms  for  all  of  the  functions  desired.  Figures  2, 
3,  4,  and  5  are  the  result.  Next,  characteristics  of  the  resulting 
waveforms  are  examined.  Initially,  the  sequencer  is  waiting  on 
the  MREQ  or  RFCK  input.  If  MREQ  goes  LOW,  the  RAS  to  MS 
to  CAS  sequence  is  initiated.  If  RFCK  goes  HIGH,  the  RFSH  to 
RAS  sequence  is  initiated.  Both  sequences  are  equivalent  to  a 
simple  "shift"  function.  Once  the  shift  sequence  is  completed 
and  the  signals  are  asserted,  they  must  stay  asserted  for  a 
specific  time  depending  on  the  selected  function.  To  time  the 


length  that  signals  must  stay  asserted  requires  a  "counting" 
function.  The  precharge  sequence  at  the  end  of  all  cycles  also 
requires  "counting".  This  partitions  most  of  the  design  into  two 
smaller  functional  blocks;  a  shifter  and  a  counter.  The  remaining 
function  select  and  control  logic  is  partitioned  into  a  "multiplexer- 
like" functional  block.  Figure  6  shows  the  PAL  device  partitioned 
into  functional  blocks.  By  dividing  the  design  into  blocks,  its 
implementation  becomes  simple. 

Figure  7  shows  PALdevice  equations  for  the  PAL1 6R8A  dynamic 
memory  state  sequencer. 
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Figure  6.  Partitioned  Design/PAL  Device  Equivalent 
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Dynamic  Memory  Control  State  Sequencer 


TITLE         DYNAMIC  MEMORY  STATE  SEQUENCER 

PATTERN  AMP3-285 

REVISION  01 

NAME  JOE  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/25/87 

CHIP  MEMORY 1  PAL16R8 

CK  RFCK  /RST  RW  /MREQ  RMW  FAST  BS1  BSO  GND 
/E  /Q0       /Ql   /Q2  /RFSH  /WE  /CAS     MS  /RAS  VCC 

EQUATIONS 

Q0    :=  /RST*/MS*/Q0 

+  /RST*RFSH*RAS*/Q0 

+  /RST*/FAST*/Q0*Q2 

+  /RST*/FAST*/Q0*Q1 

+  /RST*/FAST*  Q1*Q2 

+  /RST*  FAST*/RMW*Q0*/Q1 

+  /RST*  FAST*/RMW*Q0*/Q2 

Ql    :=  /RST*RAS*/Q0*  Ql 

+  /RST*RAS*  Q0*/Q1 

+  /RST*RAS*   Q0*  Ql 

+  /RST*RAS*/Q0*  Q2 

Q2    :=  /RST*RAS*  Q2 
+   /RST*  Q0*Q2 
+   /RST*RAS*Q0*  Ql 

RFSH    :=  /RST*RFCK*/Q2*/Q1*/Q0*/RAS 

+  /RST*RFSH*RAS 

+  /RST* RFSH* /FAST*  Ql 

+  /RST* RFSH*  Q2 

WE   :=  /RST*/RW*/MS*/RFSH*/RMW*/Q0*/Q2 

+  /RST*/RW*/MS*/RFSH*/RMW*/Q1*/Q2 

+  /RST*/RW*/MS*/RFSH*   RMW*/Q0*  Ql  *Q2 

+  /RST*/RW*/MS*/RFSH*  RMW*  Q0*/Q1  *Q2 

CAS  :=  /RST*/RFSH*/MS*/Q0 
+  /RST*/RFSH*/MS*/Q1 
+  /RST*/RFSH*/MS*/Q2 

/MS  :=  /RST*/RFSH*RAS*/Q0 
+  /RST*/RFSH*RAS*/Q1 
+  /RST*/RFSH*RAS*/Q2 


RAS    :=  /RST*/RFCK*/Q0*/Q1*/Q2*MREQ*/BS1*/BS0 
+  /RST*/RFSH*/Q0*/Q1*/Q2*MREQ*/BS1*/BS0 
+  /RST*  RFSH*/Q0*/Q1*/Q2 
+  /RST*RAS*/Q0 
+  /RST*RAS*/Q1 
+  /RST*RAS*/Q2 


Figure  7.  Source  Listing  for  Dynamic  Memory  Control  State  Sequencer 
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8-Bit  Error  Detection 
and  Correction 


Single  bit  error  detection  and  correction  tor  an  8-bit  data  word 
requires  4  check  bits,  making  a  12-bit  code  word.  The  simplest 
code  to  design  is  a  12-bit  Hamming  code.  To  arrive  at  the  code, 
we  set  up  the  following  matrix: 

B7  B6   B5  B4   B3  B2   B1    BO     |     C3  C2  C1  CO 

S3    X  X     X  X  X 
S2    X  XXX  X 

S1  XX  XX  X  X 

SO  X  X     X  X     X  X 

The  vertical  columns  are  in  a  counting  pattern,  excluding 
the  single  bit  values  of  8,  4,  2,  1.  The  single  bit  values  are 
assigned  to  the  check  bits  C3-C0.  By  reading  horizontally 
across  the  rows  of  the  matrix,  we  get  the  check  equations 
by  exclusive  OR'ing  the  data  bits  with  X's  In  that  row  and 
equating  that  to  the  check  bit  with  an  X  in  that  row: 

C3  =  B7  ®   B6  ®  B5  ®  B4 

C2  =  B7  ®   B3  ®   B2  9  B1 

C1  =  B6®B5®B3«B2$B0 

CO  =  B6  9   B4  ®   B3  9   B1  B  BO 


The  Hamming  matrix  and  resultant  sets  of  check  bit  and 
syndrome  equations  are  selected  so  that  when  a  single  bit  error 
occurs,  the  syndrome  gives  the  position  of  that  bit  (either  data 
or  check)  in  the  12-bit  message  format. 


Example 


B7 


BO 


random  data  word: 

1 

1 

0  1 

1 

1 

0 

0 

check  bits: 

C3  =   B7  ® 

B6 

9 

B5 

$  B4 

=    1  & 

1 

0 

$  1 

1 

C2   =    B7  ® 

B3 

§ 

B2 

H  B1 

=     1  9 

1 

e 

1 

$  0 

1 

C1    =    B6  ® 

B5 

9 

B3 

$  B2 

9 

BO 

=     1  ® 

0 

9 

1 

9  1 

9 

0 

1 

CO   =    B6  ® 

B4 

9 

B3 

$  B1 

9 

BO 

■     1  • 

1 

9 

1 

$  0 

9 

0 

1 

message: 

M12  M11  M10 

M9 

M8 

M7 

M6 

M5 

M4 

M3 

M2 

Ml 

1      1  0 

1 

1 

1 

1 

0 

1 

0 

1 

1 

B7    B6  B5 

B4 

C3 

B3 

B2 

B1 

C2 

BO 

C1 

CO 

The  check  bits  are  stored  along  with  the  data  bits  in  the 
following  message  format: 

MJ2Mn^M9M8M7^M5M4f^^MJ 
B7    B6    B5    B4    C3    B3    B2    B1    C2    BO    CI  CO 


When  a  read  occurs,  the  check  bits  are  recalculated  and 
compared  with  the  stored  bits  to  generate  the  4  bit  syndrome: 

S3  =  B7®B6®B5S>B4®C3 

S2  =    B7  ®   B3  ®  B2  9   B1   ®  C2 

S1  -B6<BB5®B3®B2®B0®C1 

S0  =  B6®B4®B3<SB13SB0®C0 


The  4  syedrome  bits  indicate  the  location  of  any  single  bit  errors 
in  the  12-bit  message  format  which  may  than  be  corrected  by 
inversion. 


The  Hamming  code  works  by  introducing  enough  other  code 
words  to  create  a  difference  of  exactly  3  bits  between  legal  code 
words.  All  other  code  words  are  illegal.  If,  in  storage,  one  bit 
flips,  the  result  is  an  illegal  word.  In  addition,  there  is  only  one 
word  in  the  set  of  legal  code  words  from  which  it  could  have 
come,  hence  the  correction. 


EDAC  System  Block  Diagram 


4 -BIT 
4a_ SYNDROME 
7  FOR  ERROR 

LOGGING 


8-Bit  Error  Detection  and  Correction 


C3 

1  =  0 
C2 

1  =  0 

BO  ®  C1 

0  ®  1 

bo  e  co 

0  ®  1 


=  1 
=  1 

s  1  =  o 
©1    =  0 


assume  no  error: 

S3   =    B7  ®   B6  ®  B5  ®  B4 

=    1    9    1  9  0  $  1 

12   =   B7  ®  B3  S  B2  ®  B1 

=    1    8)    1  85  1    85  0 

51  =    B6  9  B5  85  B3  95  B2 

B      1      $      0      85     1      $  1 

50  =   B6  85  B4  85  B3  8>  B1 
=    1*1     «    1    m  0 

syndrome  0000  indicates  no  error 
assume  data  bit  B7  flips  (1  -  0): 

S3    =     0    85    1    45    0    85  1 

52  =    0   95    1    45    1    85  0 

51  =     1    ®    0    85    1    85  1 

50  =     1    95    1    45    1    95  0 
syndrome  1100  indicates  M12  or  B7  is  in  error 
assume  check  bit  C3  flips  (1  -  0): 

S3=191»0«18J=1 

52  =1951851950®1=0 

51  =1850851851950951  =0 
SO  =  1851*1  8508)0*1  =  0 
syndrome  1000  indicates  M8  or  C3  is  in  error 

While  the  check  bits  can  be  generated  with  a  256x4  PROM  if 
the  tolerances  are  loose,  high  performance  systems  will  need 
to  latch  or  register  the  check  bits  to  meet  cycle  time  require- 
ments. Similarly,  the  syndrome  bits  can  be  generated  with  a 
4096x4  PROM  but  in  both  cases  the  PAL16X4  is  the  high 
performance  choice. 

The  worst  case  equations  are  S1  and  SO  with  a  6  term  exclusive 
OR.  We  use  2  properties  of  the  exclusive  OR  to  fit  the  equations 
into  the  16X4: 

1.  Associativity 

A95B9)C  =  (A95B)95C 

2.  A  85  B  45  C  =  ABC  +  ABC  +  ABC  +  ABC 

Since  a  3  term  exclusive  OR  can  be  realized  with  a  4  product 
sum  in  sum  of  products  form,  a  6  term  exclusive  OR  can  be 
realized  by  exclusive  OR  of  2  4  product  sums.  This  is  exactly 
the  16X4  configuration. 

Check  Bit  Equations 

C3  =  B7  958695  B5  45B4 

=  (B7  •  B6  +  B7  •  B6)  85  (B5  ■  B4  +  B5  •  B4) 

C2  =  B7  85  B3  85  B2  95  B1 

=  (B7  •  B3  +  §7  •  B3)  85  (B2  ■  B1  +  B2  •  B1) 

C1  =  B6  85  B5  ®  B3  85  B2  85  B0 

=  (B6  •  B5  •  B3  +  B6  •  §5  •  B3  +  B6  •  B5  •  §3  +  B6  •  B5  •  B3) 
85  (B2  •  B0  +  §2  •  B0  ) 

C0=B6  45B4  85  B3  85  B1  85  B0 

=  (B6  •  B4  •  B3  +  B6  •  B4  •  B3  +  86  •  B4  •  B3  +  B6  •  B4  •  B3) 
85(B1  •  B0  +  B1  •  B0) 


Syndrome  Bit  Equations 

S3=  B7  85  B6  85  B5  85  B4  85  C3 

-  (B7  •  B6  •  B5  +  §7  •  B6  •  B5  +  §7  •  B6  •  B5  +  B7  ■  B6-  B5) 
95  (B4  •  C3  +  §4  ■  C3) 

S2  =  B7  85  B3  85  B2  85  B1  85  C2 

=  (B7  •  B3  •  B2  +  B7  •  B3  •  B2  +  §7  •  B3  ■  B2  +  B7  •  B3  •  B2) 
85  (B1  •  C2  +  §T  ■  C2) 

S1  =  B6  85  B5  85  B3  85  B2  85  B0  85  C1 

=  (B6  •  B5  •  B3  +  §6  •  B5  •  B3  +  B6  •  B5  •  B3  +  B6  ■  B5  •  §3) 
85  (§2  •  B0  •  C1  +  B2  •  BO  •  C1  +  B2  ■  B0  •  CI  +  B2  ■  B0) 

SO  =  B6  ®  B4  83  B3  85  B1  ®  B0  85  CO 

=  (B6  •  B4  •  B3  +  B6  •  B4  ■  B3  +  §6  •  B4  •  §3  +  B6  •  B4  •  B3) 
85  (B1  •  B0  ■  CO  +  B1  •  B0  •  CO  +  B1  •  B0  •  CO  +  B1  ■  §0  •  CO) 


The  error  correction  block  decodes  the  4  syndrome  bits,  and,  if 
they  are  not  0000,  inverts  the  indicated  bit  in  the  message 
format.  The  equations: 


M12(=  B7C)  =  S3-S2 
=  S3-S2 
-S3-S2 

•  B7  +  S0 
M11  (=  B6C)  =  S3-S2 

•  B6-SO 
M10(=  B5C)  =  S3-S2 

•  B5+S0 
M9  (=B4C)  =  S3-S2 

•  B4+S0 
M8  (=C3C)  =  S3-S2 

•C3+S0 
M7  (=B3C)  =  S3-S2 

•  B3+S0 
M6  (=B2C)  =  S3-S2 

■  B2+S0- 
M5  (=B1C)  =  S3-S2 

•  B1  +  S5 
M4  (=C2C)  =  S3-S2 

•  C2+S0 
M3  (=B0C)  =  S3-S2 

•  BO+SO 
M2  (=C1C)  =S3-S2 

•  C1  +  S0 
M1  (=  COC)  =  S3  ■  S2 

•CO-SO 
ERROR       =S3  -S2 


S1 


S5®B7   

SO -B7+(S3-S2  •  SI -§5)  -B7 
SO  •  B7+  S3-B7+S2-B7  +  S1 

SO  •  B6  •  S3  •  B6  •  S2  •  B6  +  SI 

SO  •  B5+  S3-B5  +  S2-B5  +S1 

SO  •  B4+  S3-B4+S2-B4  +S1 

SO  •  C3+  S3-C3+S2-C3  +  S1 

SO  ■  B3+  S3-B3+S2-B3  +S1 

SO  •  B2+  S3-B2  +  S2-B2  +ST 

SO  •  B1  +  S3  •  B1  +  S2  •  B1  +  S1 

SO  -C2+  S3-C2  +  S2-C2  +  S1 

SO  -B0+  S3-B0  +  S2-B0  +S1 

S0-C1+  S3-C1  +  S2.-C1  +Si 

S0-C0+  S3-C0+S2-C0 +S1 


SO 


ERROR  is  an  active  high  error  indicator  available  for  error 
logging  along  with  the  4  syndrome  bits. 

To  use  2  PAL16L8's  for  the  error  correction  block,  we  need 
only  invert  the  message  bits  to  get  active  true  outputs. 
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8-Bit  Error  Detection  and  Correction 


Description 

This  PAL  device  generates  4  check  bits  in  a  12  bit  hamming  code  word  to  provide  error  detection 
and  correction  on  an  8  bit  data  word. 


PAL16X4 


8-Bit  Error  Detection  and  Correction 


Description 


This  PAL  device  generates  the  syndrome  bits  for  a  12  bit  hamming  code  word  as  a  function  of  the 
8  data  bits  and  the  4  check  bits  to  point  to  any  single  bit  in  error. 
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8-Bit  Error  Detection  and  Correction 

Description 

This  PAL  device  performs  error  correction  of  bits  B2-B7  based  on  the  4  bit  error  syndrome  S0-S3. 


PAL16L8 


WlMornaHesa 
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8-Bit  Error  Detection  and  Correction 


Description 

This  PAL  device  performs  error  correction  of  bits  B0-B1  and  checks  bits  C0-C3  based  on  the  4  bit 
error  syndrome  S0-S3. 


PAL16L8 


CORRECTED  BITS 


CORRECTED 


8-Bit  Error  Detection  and  Correction 


PAL16X4 
CBG 

CHECK  BIT  GENERATOR 

mi  FIELD  APPLICATIONS  ENGINEER  YORBA  LINDA,  CALIFORNIA 

CHKCLK  B7  B6  NC  NC  NC  NC  B5  B4  GND 

/OC  B3  B2  /CO  /CI  /C2  /C3  Bl  BO  VCC 


PAL  DESIGN  SPECIFICATION 
B.  BRAFMAN  02/16/81 


CJ 

Bl  w/BO 

■  t**T    •  x  •  n<; 
;  o  1   ;  + 1  dd 

+ 

/B7*  B6 

;      :  +  : 

+ 

GND 

;DO  NOT  BLOW  THIS  PRODUCT  LINE 

+ 

GND 

zDO  NOT  BLOW  THIS  PRODUCT  LINE 

:  +  : 

B5*/B4 

;B5  :+:  B4 

+ 

/B5*  B4 

C2 

B7*/B3 

;B7  :+:  B3 

+ 

/B7*  B3 

»      :  +  : 

+ 

GND 

;DO  NOT  BLOW  THIS  PRODUCT  LINE 

+ 

GND 

j DO  NOT  BLOW  THIS  PRODUCT  LINE 

:  +  : 

B2*/B1 

;B2  :  +  :  Bl 

+ 

/B2*  Bl 

CI 

B6*  B5*  B3 

;B6  :+:  B5  :+:  B3 

+ 

/B6*/B5*  B3 

+ 

/Bo"  B5"/B3 

+ 

B6*/B5*/B3 

B2*/B0 

;B2   :+;  BO 

+ 

/B2*  BO 

CO 

B6*  B4*  B3 

;B6  :+:  B4  :+:  B3 

+ 

/B6*/B4*  B3 

+ 

/B6*  B4*/B3 

;       :  +  : 

+ 

B6*/B4*/B3 

:  +  s 

B1*/B0 

sBl  :+:  BO 

+ 

/Bl*  BO 

;FJ?J  Memories  £1 


8-Bit  Error  Detection  and  Correction 


PAL16X4 
SBG 

SYNDROME  BIT  GENERATOR 

MMI  FIELD  APPLICATIONS  ENGINEER  YORBA  LINDA,  CALIFORNIA 
SYNCLK  D7  D6  BO  Bl  B2  B3  D5  D4  GND 
/OC  D3  D2  A3  A2  Al  AO  Dl  DO  VCC 


PAL  DESIGN  SPECIFICATION 
B.  BRAFMAN  03/13/81 


IN  THE  ABOVE  PIN  LIST,    THE  FOLLOWING  SUBSTITUTIONS  HAVE  BEEN 
MADE  TO  ACCOMODATE  THE  SPECIFIC  FORMAT    (FIXED  SYMBOLS)    FOR  THE 
ARITHEMETIC  PAL  DEVICES  IN  PALASM  SOFTWARE  : 


D7  MEANS  B7 
D6  MEANS  B6 
D5  MEANS  B5 
D4  MEANS  B4 
D3  MEANS  B3 
D2  MEANS  B2 
Dl  MEANS  Bl 
DO  MEANS  BO 


BO  MEANS  /C3 
Bl  MEANS  /C2 
B2  MEANS  /CI 
B3  MEANS  /CO 
AO  MEANS  /S3 
Al  MEANS  /S2 
A2  MEANS  /SI 
A3  MEANS  /SO 


(CHECK  BIT  3) 
(CHECK  BIT  2) 
(CHECK  BIT  1) 
(CHECK  BIT  0) 
(SYNDROME  BIT  3) 
(SYNDROME  BIT  2) 
(SYNDROME  BIT  1) 
(SYNDROME  BIT  0) 


B0-B7  ARE  THE  BITS  OF  THE  DATA  WORD. 

THE  SUBSTITUTIONS  APPLY  BELOW  WITH  THE  EXCEPTION  OF  COMMENTS. 


T 


o  t- 
+ 
+ 
+ 

:  +  : 
+ 


/Al 


/A2 


/A3 


+ 
+ 
+ 

:  +  : 
+ 


+ 
+ 
+ 
:  +  : 
+ 
+ 
+ 


D7*  D6*  D5 
/D7*/D6*  D5 
/D7*  D6*/D5 

D7*/D6*/D5 

D4* (/BO) 
/D4*(  BO) 

D7*  D3*  D2 
/D7*/D3*  D2 
/D7*  D3*/D2 

D7*/D3*/D2 

Dl*  (/Bl) 
/Dl*(  Bl) 

D6*  D5*  D3 
/D6*/D5*  D3 
/D6*  D5*/D3 

D6*/D5*/D3 

D2*  D0*(  B2) 
/D2*/D0*(  B2) 
/D2*  D0*(/B2) 

D2*/D0* (/B2) 

D6*  D4*  D3 
/D6*/D4*  D3 
/D6*  D4*/D3 

D6*/D4*/D3 

Dl*  D0*(  B3) 
/D1*/D0* (  B3) 
/Dl*  D0*(/B3) 

D1*/D0* (/B3) 


;B7  :+:  B6  :+:  B5 

;B4  :+:  C3 

»B7  :+:  B3  :+:  B2 

;  :  +  : 

fBl  :+:  C2 

;B6  :+:  B5  :+:  B3 
:  +  : 

?B2  :+:  BO  :+:  CI 

;B6  :+:  B4  :+:  B3 

;  :  +  : 

;B1  :+:  BO  :+:  CO 
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8-Bit  Error  Detection  and  Correction 


TITLE  ERROR  CORRECTION  UNIT  NO.  1 

PATTERN  ECU1 

REVISION  01 

AUTHOR       B.  BRAFMAN 

COMPANY     MONOLITHIC  MEMORIES,  INC. 
DATE  11/06/87 

CHIP  ECU1  PAL16L8 

/S3  /S2  /SI  /SO  B7     B6     B5     B4     B3  GND 
B2     NC     NC  B2C  B3C  B4C  B5C  B6C  B7C  VCC 


EQUATIONS 


/B7C  =     S3*  S2*/S1*/S0*  B7 
+  /S3*/B7 


/S2*/B7 
Sl*/B7 
S0*/B7 


/B6C  =     S3*/S2*   SI*   SO*  B6 
+  /S3*/B6 
+  S2*/B6 
+  /Sl*/B6 
+  /S0*/B6 


CORRECTION  OF  B7 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 


; CORRECTION  OF 
;NO  CORRECTION 
;NO  CORRECTION 
;NO  CORRECTION 
;NO  CORRECTION 


B6 


/B5C  =     S3*/S2*   S1*/S0*  B5 
+  /S3*/B5 
+  S2*/B5 
+  /Sl*/B5 
+  S0*/B5 


CORRECTION  OF 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 


B5 


/B4C  =     S3*/S2*/S1*  SO*  B4 
+  /S3*/B4 
+  S2*/B4 
+  Sl*/B4 
+  /S0*/B4 

/B3C  =  /S3*  S2*  SI*  SO*  B3 
+  S3*/B3 
+  /S2*/B3 
+  /Sl*/B3 
+  /S0*/B3 


CORRECTION  OF  B4 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 

CORRECTION  OF  B3 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 


/B2C  =  /S3*   S2*   S1*/S0*  B2 
+  S3*/B2 
+  /S2*/B2 
+  /Sl*/B2 
+  S0*/B2 


CORRECTION  OF 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 


B2 


SIMULATION  NOT  INCLUDED 


&MonomMcM  Memories  & 
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TITLE 
PATTERN 


8-Bit  Error  Detection  and  Correction 


ERROR  CORRECTION  UNIT  NO. 
ECU2 


INC. 


REVISION  01 

AUTHOR        B.  BRAFMAN 

COMPANY     MONOLITHIC  MEMORIES, 

DATE  11/06/87 

CHIP  ECU2  PAL16L8 


/S3  /S2  /SI  /SO  Bl  BO  /C3  /C2  /CI  GND 
/CO  NC     ERROR  COC  C1C  C2C  C3C  BOC  B1C  VCC 


EQUATIONS 

/B1C  =  /S3*  S2*/S1*  SO*  Bl 
+  S3*/B1 
+  /S'2*/B1 
+  S1*/B1 

/BOC  =  /S3*/S2*  SI*  SO*  BO 
+  S3*/B0 
+  S2*/B0 
+  /S1*/B0 
+  /S0*/B0 


/C3C  =     S3*/S2*/S1*/S0*  C3 
+  /S3*/C3 
+  S2*/C3 
+  Sl*/C3 
+  S0*/C3 

/C2C  =  /S3*  S2*/S1*/S0*  C2 
+  S3*/C2 
+  /S2*/C2 
+  Sl*/C2 
+  S0*/C2 

/C1C  =  /S3*/S2*  S1*/S0*  CI 
+  S3*/C1 
+  S2*/C1 
+  /S1*/C1 
+  S0*/C1 

/COC  =  /S3*/S2*/S1*  SO*  CO 
+  S3*/C0 
+  S2*/C0 
+  S1*/C0 
+  /S0*/C0 

/ERROR  =  /S3*/S2*/S1*/S0 

SIMULATION  NOT  INCLUDED 


; CORRECTION  OF  Bl 
;NO  CORRECTION 
;NO  CORRECTION 
;NO  CORRECTION 

; CORRECTION  OF  BO 
;NO  CORRECTION 
;NO  CORRECTION 
;NO  CORRECTION 
;NO  CORRECTION 

CORRECTION  OF  C3 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 

CORRECTION  OF  C2 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 

CORRECTION  OF  CI 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 

CORRECTION  OF  CO 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 
NO  CORRECTION 

;NO  ERROR! 
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Fuse  Programmable  Controller 
Simplifies  Cache  Design 


Introduction 

The  growing  sophistication  of  computer  systems  has  dictated  a 
corresponding  increase  in  memory  storage  capacity.  Unfortu- 
nately, the  speed  of  these  large  memories  has  not  kept  pace  with 
that  of  their  associated  processors.  Fast  memories  are  still 
expensive;  in  order  to  reduce  this  cost,  a  cache  memory  system 
can  be  used  to  provide  information  to  the  processor  without  sac- 
rificing speed.  Located  between  the  central  processing  unit  and 
the  main  memory  a  cache  memory  stores  only  the  blocks  of 
memory  currently  in  use  by  a  program.  This  provides  fast  access 
to  program  and  data  resulting  in  higher  system  performance. 
Once  a  new  memory  location  is  desired,  blocks  of  cache  and  main 
memory  are  swapped  to  update  the  cache  with  the  memory 
blocks  currently  in  use.  Thus  a  cache  memory  can  dramatically 
increase  the  performance  of  the  entire  computer  system  while 
maintaining  minimal  costs.  A  cache  system  with  a  35  ns  access 
time  and  a  50  ns  cycle  time  is  described  below. 

The  use  of  a  fuse-programmable  controller  (FPC),  Am29PL141, 
simplifies  the  design  of  such  a  cache  memory  system.  The  simple 
internal  architecture  and  instruction  set  of  the  FPC  make  this 
controller  easy  to  implement  in  systems,  and  an  on-chip  micro- 
programmablef  use  array  provides  a  high  level  of  integration.  The 
cache  controller  algorithm  is  prog  ram  med  and  stored  in  this  array. 
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Because  the  FPC  operates  at  20  MHz,  as  opposed  to  the  1 0  MHz 
clock  frequency  of  other  currently  available  controllers,  it  is 
especially  well  suited  for  high-performance  applications. 

Cache  Memory  Systems 

Almost  any  system  that  requires  a  high-speed  memory  interface 
can  benefit  from  the  use  of  a  cache-memory  system.  The  system 
bottleneck  is  typically  the  memory  access  time.  Large  dynamic 
RAMs  that  are  currently  available  have  access  times  of  1 00-1 80 
ns  with  cycle  times  of  about  180-400  ns.  The  smaller,  faster 
memory  devices  that  can  be  used  for  a  cache  have  access  times 
of  25-30  ns  and  cycle  times  of  around  40-50  ns.  In  order  to 
achieve  these  lower  access  times  and  still  have  the  memory 
capacity  of  the  larger  DRAM  memories,  a  cache  memory  hierar- 
chy is  used. 

Any  application  can  make  effective  use  of  caches  provided  that 
the  concept  of  "locality  of  reference"  is  operative.  This  concept 
states  that,  at  any  particular  time,  the  addressing  pattern  of  a 
program  tends  to  be  localized  within  a  block  or  area  of  the  total 
available  address  space.  In  other  words,  if  data  at  a  given 
address  is  accessed,  it  is  likely  that  the  next  memory  access  will 
also  be  in  that  same  block  of  addresses.  In  general,  for  systems 
where  this  concept  does  not  apply,  the  use  of  a  cache  system 
would  be  difficult. 
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Figure  1.  Block  Diagram  of  Cache  Memory  System 
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Sectionsof  main  memory  datathat can  be  identified  asfrequently 
used  can  be  stored  in  cache  memories  to  allow  decreased  access 
times,  thereby  increasing  the  overall  performance  of  the  system. 
As  different  sections  of  a  program  become  frequently  used,  they 
can  be  transferred  to  the  cache  memory,  displacing  other  sec- 
tions that  are  no  longer  needed.  Some  of  the  application  areas 
where  caches  are  used  to  increase  the  system  performance  are 
for  mini/micro-computers,  signal  processing  and  image  process- 
ing systems. 

The  main  components  of  the  cache  memory  system  are  the 
cache  data  memory,  the  cache  tag  buffer,  the  replacement  logic, 
and  the  cache  controller  (see  Figure  1 ). 

Cache  Data  Memory 


When  a  tag  search  is  performed  in  the  tag  buffer,  either  a  hit  or 
miss  condition  results.  A  hit  indicates  that  the  desired  word  is  in 
the  cache;  a  miss  indicates  that  the  desired  word  must  be  ac- 
cessed from  the  main  memory  or  atransferfrom  main  memory  to 
cache  is  needed.  The  performance  of  the  cache  system  is 
dependent  on  the  ratio  of  the  number  of  hits  to  the  total  number 
of  memory  references  made,  which  is  called  the  hit  ratio.  The  hit 
ratio  depends  on  the  size  of  the  cache  memory  selected  and  the 
application  of  the  system  using  the  cache  memory.  Ahigh  hit  ratio 
implies  that  most  of  the  data  accessed  by  the  CPU  was  in  the  fast 
data-cache  memory  when  requested.  The  performance  of  the 
entire  system  gets  degraded  if  a  poor  hit  ratio  occurs.  This  is  why 
"locality  of  reference"  is  important  for  performance. 

Replacement  Algorithms 


The  cache  data  memory  is  the  small,  fast  memory  that  contains 
the  most  often  used  data,  copied  from  the  slower  main  memory. 
The  cache  data  memory  size  directly  affects  the  performance  of 
the  entire  system.  The  larger  the  cache  data  memory,  the  more 
data  that  can  be  accessed  at  the  faster  speeds.  However,  if  the 
size  of  the  memory  device  gets  very  large,  the  performance  of  the 
memory  decreases.  The  larger  the  memory  devices  are,  the 
longer  are  their  access  times,  and,  consequently,  their  cycle 
times.  The  optimum  data  cache  size  will  vary  depending  on  the 
application/algorithm  being  executed. 

Tag  Buffer 

An  identifier  (or  tag)  accompanies  every  block  (a  contiguous  set 
of  data  words)  that  can  be  transferred  to  or  from  main  memory. 
This  tag  is  stored  in  the  tag  buffer.  This  tag  identifies  where  the 
data  is  located  in  main  memory.  To  determine  whether  a  block  of 
data  is  already  in  the  cache  memory,  the  tags  stored  in  the  tag 
buffer  are  compared  to  the  desired  tag  (Figure  2).  Various 
methods  exist  for  searching  the  tag  buffer  for  a  block  of  data. 
Three  of  the  most  popular  approaches  are: 


1.  Fully  associative 

2.  Direct  mapped 

3.  Set-associative 

In  fully  associative  caches,  any  memory  word  can  be  found  in  any 
of  the  cache-data  blocks.  The  memory  address  is  divided  into  two 
sections:  the  TAG  and  the  BLOCK  fields.  The  TAG  is  used  to 
determine  if  the  block  of  data  is  in  the  cache  and  the  BLOCK 
indicates  a  data  word  within  that  block. 

For  a  direct-mapped  cache  the  memory  word  can  be  found  in  only 
one  of  the  cache  blocks.  The  memory  address  is  sectioned  into 
three  fields:  the  TAG,  the  INDEX,  and  the  BLOCK.  The  BLOCK 
indicates  the  data  word  in  the  block.  The  INDEX  selects  the 
program  block  that  contains  the  word,  and  the  TAG  is  used  to 
determine  if  the  block  is  in  the  cache. 

The  third  way  of  searching  the  cache  for  data  words  is  the  set- 
associative  method.  This  divides  blocks  into  sets  and  allows  the 
cache  block  to  be  located  in  any  one  of  these  sets.  Typically,  the 
number  of  sets  used  are  two,  four,  or  eight.  The  TAG,  INDEX  and 
the  BLOCK  fields  are  similar  to  those  in  the  direct  mapped 
techniques.  Figure  2  shows  the  three  different  address  tech- 
niques and  their  fields. 


The  replacement  algorithm  is  used  to  decide  the  cache  location 
to  replace  with  new  data  when  a  miss  occurs.  Three  of  the  most 
common  data  replacement  algorithms  currently  in  use  are: 

1.  First-in  First-out  (FIFO) 

2.  Least  Recently  Used  (LRU) 

3.  Random  Replacement  (RR). 

The  choice  between  these  three  techniques  is  dependent  upon 
the  nature  of  the  data  being  transferred  to  and  from  the  cache. 
The  FIFO  algorithm  simply  selects  the  oldest  block  of  data  written 
into  the  cache  to  be  replaced.  After  the  cache  buffer  is  full,  data 
blocks  are  replaced  sequentially.  In  this  manner,  even  a  data 
block  which  is  used  very  often,  will  be  replaced. 

The  LRU  technique  selects  the  data  block  to  be  replaced  by 
determining  which  block  in  the  cache  was  least  used  as  com- 
pared to  the  other  blocks  at  that  time.  With  this  strategy,  the 
chances  that  the  block  just  replaced  in  the  cache  will  be  required 
again  soon  is  assumed  to  be  relatively  small. 
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Figure  2.  Cache  Organizations 
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The  third  method,  the  random  replacement  approach,  randomly 
chooses  the  block  of  data  to  be  replaced.  The  easiest  to 
implement,  this  technique  is  optimum  for  some  applications. 

Another  important  design  consideration  for  cache  memory  sys- 
tems is  the  method  used  for  keeping  the  cache  data  and  main 
memory  updated.  When  writing  to  the  cache  memory,  the  data 
change  must  be  echoed  to  the  main  memory  before  that  data 
word  is  replaced  by  new  data  in  the  cache  block.  One  way  this  is 
handled  is  the  "store-through"  method.  On  a  write  command,  the 
data  is  written  to  the  cache  as  well  as  the  main  memory  simulta- 
neously. In  this  way,  the  main  memory  data  is  kept  current.  This 
also  saves  the  task  of  writing  back  old  cache  data  into  main 
memory  when  replacing  a  new  data  block  into  cache. 


Cache  Controller 

All  of  the  signals  that  are  required  to  keep  track  of  the  various 
functional  blocks  involved  in  the  cache  memory  system  are 
controlled  by  the  cache  controller.  This  controller  monitors  the 
state  of  the  tag  buffer  search,  the  CPU,  main  memory  and  cache 
data  memory  condition,  as  well  as  handling  all  memory  transfers 
for  hits  or  misses  and  read  and  write  operations.  The  high- 
performance  and  integrated  functions  available  on  a  single  chip 
make  the  design  of  a  cache  controller  using  the  Am29PL141 
simple.  The  FPC  provides  1 6  control  signals  for  off-chip  control 
and  seven  input  test  pins  for  monitoring  external  conditions.  The 
flexible  instruction  set  permits  easy  microprogramming  of  the 
desired  function. 
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The  Am29PL141  Fuse-Programmable  Controller 

The  Am29PL141  is  a  high-performance,  single-chip,  fuse- 
programmable  controller  (FPC).  This  chip  is  designed  to  allow 
implementation  of  complex  state  machines  and  controllers  by 
programming  the  appropriate  sequence  of  microinstructions  in 
the  on-chip  microprogram  memory.  With  its  intelligent  micropro- 
gram address  sequencer,  high-speed  64  x  32  bit  microprogram 
memory,  pipeline  register,  and  an  on-chip  diagnostics  register, 
th  s  chip  offersthe  benefits  of  low  chip  count,  fast  operation,  easy 
development,  and  testability.  This  device  is  available  with  a  20 
MHz  clock  rate  (50  ns  cycle  time)  in  a  28  pin  dual-in-line  package. 
A  microprogram  address  sequencer  is  the  heart  of  the  FPC.  The 


Am29PL141  has  29  high-level  microinstructions  which  include 
jumps,  loops,  subroutine  calls,  and  muttiway  branching.  These 
microinstructions  can  be  conditionally  executed  based  on  the  test 
inputs.  As  a  single  chip  solution,  the  FPC  is  designed  to  ease  the 
implementation  of  distributed  microprogrammed  systems.  It  can 
be  used  to  off-load  the  central  controller  by  serving  as  an 
intelligent  distributed  controller  for  various  functional  units.  Its  on- 
chip  diagnostics  register  is  used  to  control  and  observe  the 
parallel  pipeline  register  during  diagnostics  mode.  This  capability 
provides  both  controllability  and  observability  for  the  functional 
blocks  connected  to  the  FPC  control  outputs.  A  block  diagram  of 
the  FPC  is  shown  in  Figure  3  and  the  micro-instruction  format  is 
shown  in  Figure  4. 


31       30  26       25       24        22  21  16 
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OE  =  SYNCHRONOUS  OUTPUT  ENABLE  FOR  P[15:8] 
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THE  SPECIAL  TWO  DATA  FIELD  MICROINSTRUCTION  FORMAT  IS  SHOWN  BELOW: 
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Figure  4.  Microinstruction  Format 
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System  Overview 

The  cache  scheme  selected  for  implementation  is  a  set-associa- 
tive cache  with  four  sets  (see  Figure  5).  The  18-bit  physical 
memory  address  word  is  divided  into  8  bits  of  tag,  8  bits  of  index, 
and  2  bits  of  block  address.   Four  256-byte  deep,  index-ad- 
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Figure  5.  A  Set-Associative  Cache  Scheme 


dressed  tag  buffers  are  used  in  this  scheme.  These  tag  buffers 
contain  four  possible  sets  of  tags  for  the  same  index  location. 
Corresponding  to  each  tag  buffer  there  are  four  1 6-bit  wide  cache 
memories,  which  contain  four  (block)  words  for  each  index 
location.  The  physical  block  address  is  also  used  to  address 
these  cache  memories. 

Figure  6  shows  the  actual  implementation  where  the  main  system 
controller  is  the  Am29PI_1 41  FPC.  There  is  additional  glue  logic 
in  the  form  of  one  PAL1 8P8  and  one  PAL1 6R6.  The  PAL1 8P8  is 
used  in  the  cache  access  path  and  is  very  fast.  It  generates  the 
cache  output  enable,  cache  selects,  and  write  signals  for  the 
cache.  The  PAL.1 6R6  is  used  to  generate  the  Ready  signal  to  the 
system  CPU.  It  is  also  used  to  inform  the  FPC  of  the  status  of 
read/write  cycles.  All  the  PAL  devices  are  controlled  by  the  FPC 
for  generation  of  the  system  control  signals. 

System  Operation 

There  are  four  main  system  cycles  possible:  read  and  hit,  read 
and  miss,  write  and  hit  and  write  and  miss.  The  response  of  the 
controller  is  different  for  each  system  operation  cycle.  For  read 
&  hit  cycle  the  FPC  performs  no  action.  Only  on  detection  of  write 
cycle,  or  read  &  miss  cycle,  the  PAL1 6R6  asserts  the  CYCLsignal 
initiating  FPC  action. 

Read  and  Hit  Cycle 

When  the  system  CPU  requests  a  read  cycle,  it  sends  out  an  1 8- 
bit  address.  The  most  significant  eight  bits  (Figure  6) 
TAGADDO— 7  are  the  eight  bit  tag  which  is  compared  to  the  tags 
stored  in  the  four  tag  buffers  to  detect  a  hit  or  a  miss.  The  next 
eight  bits  IDXADDO-7  for  index,  address  the  correct  tag  buffer 
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Figure  6.  Control  Section  for  Cache  System  uses  one  FPC  and  two  PAL  Devices. 
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locations.  These  eight  bits  are  also  used  to  access  the  four  cache 
sets  simultaneously.  The  least  significant  BLOCKO-1  block 
address  are  directly  used  to  address  the  four  cache  sets.  A 
cache  CACS  (chip  select)  is  generated  by  the  PAL1 8P8  to  access 
the  four  cache  sets  simultaneously.  Once  the  tag  buffer  match 
occurs,  the  selected  tag  buffers  asserts  the  hit  signal.  The  hit 
signal  is  used  to  enable  the  outputs  of  the  corresponding  cache 
set,  in  order  to  provide  the  correct  cache  data  to  the  CPU.  The 
flow  chart  is  shown  in  Figure  9. 

ad  and  Miss  Cycle 


address  different  memory  words  within  a  block  itself.  It  then 
performs  read  cycles  on  the  system  memory  by  using  DREQ 
(DRAM  memory  request),  DRW  (read  or  write)  and  DRDY 
(DRAM  data  ready)  signals  and  write  cycles  on  all  the  blocks  of 
the  corresponding  cache  set  by  asserting  CACWRT  signal.  The 
replacement  set  is  determined  by  the  LRU  control  block  which 
selects  one  of  the  four  sets. 

Upon  completion  of  the  update  of  new  cache  data,  the  FPC 
removes  the  INHIBIT  signal  to  allow  further  CPU  access  to  the 
cache. 


If  the  proper  tag  does  not  appear  in  any  of  the  four  tag  buffers,  a 
niss  signal  is  generated  by  the  tag  buffers  on  H/MO-3  signal  lines 
(Figure  7  and  8).  On  detection  of  the  four  miss  signals  the 
PAL1 6R6  removes  the  "RDY"  signal  to  CPU  (for  CPU  wait  cycles) 
and  informs  the  FPC  of  the  MISS  by  asserting  the  CYCL  &  HIT/ 
MISS  signals.  When  the  FPC  detects  this  read  and  miss,  it 
latches  the  current  physical  address  by  asserting  the  ADLATCH 
signal,  and  initiates  a  DRAM  (System  Memory)  cycle  by  asserting 
the  DREQ  signal,  to  provide  the  data  to  the  CPU  from  the  system 
memory.  When  this  data  is  available  signalled  by  the  DRAM  data 
ready  signal  DRDY,  the  FPC  signals  the  assertion  of  the  RDY 
signal,  allowing  the  CPU  to  read  the  requested  data.  Simultane- 
ously the  FPC  asserts  the  INHIBIT  signal  which  disallows  further 
CPU  access  to  the  cache. 

The  FPC  then  performs  an  update  algorithm  which  brings  the 
data  from  the  system  memory  to  the  cache.  It  asserts  the  CTRST 
gnal  which  disables  the  CPU  data  lines  to  the  cache  system.  It 
)  disables  CPU  block  addresses  by  asserting  BLDBL  signal 
nd  enables  its  own  output  address  lines  BADO-1  in  order  to 


Write  and  Hit  Cycle 

The  write  cycle  operation  is  differentf  ram  the  read  cycle,  since  the 
main  memory  has  to  be  updated  along  with  the  cache.  This  is 
called  the  "write-through"  strategy.  Since  typical  programs  re- 
quire only  1 0-1 5%  of  the  processortime  for  write  cycles,  this  does 
not  impact  the  overall  performance.  For  every  write  cycle  for 
which  the  tag  buffer  signals  a  hit,  the  CPU  writes  the  data  to  the 
cache.  In  addition,  the  FPC  initiates  a  DRAM  (system  memory) 
cycle  and  updates  it  with  the  same  data  (see  Figure  10). 

Write  and  Miss  Cycle 

For  a  write  and  miss  cycle,  the  function  of  the  system  is  similar  to 
the  read  and  miss  cycle,  except  that  a  write  of  the  main  memory 
is  performed  instead  of  a  read.  After  the  write  to  the  main  memory 
is  completed,  the  FPC  executes  the  update  algorithm  to  bring  the 
contents  of  that  memory  block  into  the  cache.  The  flow  chart  for 
the  microcode  executed  by  the  FPC  for  the  write  cycle  is  shown 
in  Figure  10. 
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Figure  7.  Memory  Organization  of  Cache  System 
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FPC  Operation 

The  System  Controller  has  five  main  functions: 


Test  for  a  cycle  request  (CYCL) 

Test  for  different  cycle  types 

Execute  appropriate  action  for  each  cycle 

Execute  update  algorithm  if  required 

Generate  update  addresses  if  required. 


Test  for  Cycle  Request 

The  FPC  samples  the  CYCL  signal  continuously.  The  CYCL 
signal  is  generated  only  for  read  &  miss,  write  and  hit,  and  write 
and  miss  cycles.  The  system  requires  FPC  intervention  only  for 
these  three  cycles.  When  CYCL  is  asserted,  the  FPC  tests  for  the 
three  cycle  types. 

Test  for  Cycle  Type 


The  replacement  algorithm  then  executes  read  cycles  on  the 
memory  and  write  cycles  on  the  cache  for  all  the  four  block 
addresses.  Afterthe  replacement  is  complete,  the  control  returns 
for  sampling  of  new  CPU  cycles.  The  design  file  for  the  Cache 
controller  is  shown  in  Figure  12. 

The  replacement  set  selection  can  depend  upon  various  selec- 
tion algorithms  as  described  before.  This  set  selection  logic  has 
not  been  shown  in  this  design.  However,  the  algorithm  used  here 
is  LRU  (Least  Recently  Used)  method,  and  only  its  interface  with 
the  cache  controller  is  shown. 

Timing  and  Performance 

For  a  cache  read  cycle  when  a  hit  occurs,  the  worst  case  access 
time  for  the  cache  system  is  the  tag  buffer  delay  (20  ns)  along  with 
the  cache  output  enable  delay  (15  ns).  The  worst  case  access 
timeforthis  cache  implementation  is  35  ns.  The  cycle  time  forthe 
system  will  be  a  little  longer  than  the  access  time. 


This  involves  sampling  the  CPU,  memory  request  MREQ,  and 
read  or  write  (RHWL)  signals.  On  the  basis  of  these  signals  the 
FPC  decides  between  the  initiation  of  read  or  write  cycles.  It  then 
samples  the  HIT/MISS  signal  generated  by  the  PAL16R6  to 
decide  on  the  appropriate  action  to  be  taken. 


Action  for  Each  Cycle 

One 


nee  the  FPC  determines  the  CPU  cycle  type  and  whether  the 
location  exists  in  the  cache  (HIT  or  MISS),  it  then  performs 
appropriate  actions  such  as  dynamic  RAM  (System  Memory) 
cycle,  and  Update  algorithm.  The  functions  performed  by  the 
FPC  for  each  cycle  type  were  detailed  earlier. 

Update  Algorithm 

On  detection  of  a  CPU  cycle  and  a  MISS,  the  FPC  performs  the 
update  algorithm.  It  consists  of  reading  data  from  the  system 
memory  (DRAM)  and  writing  that  data  to  the  appropriate  cache 
set.  The  address  for  this  replacement  is  the  one  on  which  the  CPU 
cycle  MISS  occurs,  which  in  turn  is  latched  by  the  FPC  on 
detection  of  the  MISS.  The  selection  of  the  appropriate  cache  set 
is  dependent  upon  the  information  provided  by  the  replacement 
logic  block.  The  update  is  done  for  all  four  blocks  (words)  of  data. 
During  this  update,  the  FPC  isolates  the  CPU  data  bus  from  the 
cache  data  bus  by  asserting  the  CTRST  signal. 

Update  Address  Generation 

The  replacement  address  is  the  CPU  address  at  which  the  MISS 
occurs.  However,  all  the  four  blocks  for  the  same  memory 
address  should  be  replaced  in  the  cache.  To  generate  these  four 
addresses,  two  of  the  FPC's  eightthree-state  lines  are  used.  The 
two  least  significant  bits  of  the  CPU's  physical  address  (Block 
Address  bits)  are  disabled  by  the  FPC  by  asserting  the  BLDBL 
signal,  and  the  two  output  bits  of  the  FPC  (three-state  control 
lines)  are  enabled  onto  the  address  bus.  This  is  done  in  a  manner 
to  allow  the  system  CPU  to  continue  to  work  with  the  address  and 
data  bus,  isolated  from  the  cache  system,  during  the  execution  of 
the  replacement  algorithm. 


The  50  ns  cycle  time  is  a  very  high  speed  cache  performance, 
considering  that  the  usual  cycle  time  for  DRAMs  is  in  the  range  of 
280-400  ns.  Further  improvements  in  the  cycle  time  can  be  made 
by  using  PAL  devices  with  a  faster  propagation  delay  of  1 0  ns. 
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Figure  11.  Flow  Chart  for  Cache  Update  Scheme 
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device  (am29pll41) 
default  =  0  ; 

define 

mreq  =  to 

rhwl  =  tl 

hitm  =  t2 

drdy  =  t3 

nocdn  =  t4  "No  condition" 

intword  =  07d#h;  "Initialize  output  word" 

begin 

init     :  od  intword,  continue  ;  "Initialize  Outputs" 

start   :  od  intword,   if  (mreq)  then  goto  pl(start); 

"Check  for  memory  cycle" 
od  intword,   if  (not  rhwl)  then  goto  pl(write); 

"Check  for  read  or  write" 

read     :  od  intword,   if  (hitm)  then  goto  pl(start); 

"Read  &  hit  goto  start" 

latch  :  od  079#h,       if  (not  drdy)  then  goto  pl(latch); 

"Read  &  miss  latch  address" 
od  069#h,       continue;  "DRAM  req  for  CPU  data" 

od  063#h,       continue;  "DRAM  output  enb  i  CPU  cycle" 

od  063#h,       if  (nocdn)  then  call  pl(rplce); 

"Goto  replacement  subroutine" 
od  intword,  if  (nocdn)  then  goto  pl(start); 

"Read  cycle  complete" 

write  ;  od  intword,   if  (not  hitm)  then  goto  pl(wrtms); 

"Write  hit  or  miss" 

wrtht  :  od  Ofd#h,       if  (not  drdy)  then  goto  pl(wrtht); 

"Hit  write  to  cache" 
od  04d#h,       continue;  "Write  to  DRAM" 

od  04f#h,  continue; 

od  04f#h,       if  (nocdn)  then  goto  pl(start); 

"Write  hit  complete" 

wrtms  :  od  intword,   if  (not  drdy)  then  goto  pl(wrtms); 

"Miss  check  DRAM  for  write" 

od  049#h,  continue; 

od  04b#h,       continue;  "Latch  address  i  DRAM  write" 

od  04b#h,       if  (nocdn)  then  call  pl(rplce); 

"goto  replacement  algorithm" 
od  intword,   if  (nocdn)   then  goto  pl(start); 

"Write  miss  complete" 

rplce  :  od  07b#h,       if  (not  drdy)  then  goto  pl(rplce); 

"This  is  replace  routine" 
oe  022#h,       continue;  "Isolate  CPU  databus" 

oe  022#h,       continue;  "DRAM  data  enb  for  cache  write" 

oe  0a2#h,       continue;  "Write  to  cache  at  address  00" 

waitl   :   od  13a#h,       if   (not  drdy)   then  goto  pl(waitl); 

"Change  Block  address" 
oe  122#h,       continue;  "Repeat  as  before  address  01" 

oe  122#h,  continue; 

oe  la2#h,       continue;  "Write  to  cache  at  address  01" 

wait2   :   od  23a#h,       if   (not  drdy)   then  goto  pl(wait2); 

"Change  Block  address" 
oe  222#h,       continue;  "Repeat  as  before  address  02" 

oe  222#h,  continue; 

oe  2a2#h,       continue;  "Write  to  cache  at  address  02" 

wait3  :  od  33a#h,       if  (not  drdy)  then  goto  pl(wait3); 

"Change  Block  address" 
oe  322#h,       continue;  "Repeat  as  before  address  03" 

oe  322#h,  continue; 

oe  3a2»h,       continue;  "Write  to  cache  at  address  03" 

od  03a#h,       if  (nocdn)  then  ret;"Return  from  replace  routine" 


end. 


Figure  12.  Cache  Controller  Design  File 
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Two  components  used  in  the  temporary  storage  of  data  are 
LIFO  (Last  In,  First  Out),  and  FIFO  (First  In,  First  Out)  buffers. 
Usually,  a  FIFO  would  be  used  to  store  data  between  asyn- 
chronously operating  devices.  For  example,  data  sent  from 
one  microprocessor  system  to  a  second  might  reside  tempo- 
rarily in  a  FIFO  until  the  second  microprocessor  is  ready  to 
read  the  data.  Sometimes  the  FIFO  is  known  as  an  asynchron- 
ous temporary  buffer  store. 

The  LIFO  also  offers  temporary  storage  and  will  store  data  in 
the  same  way  as  a  FIFO  but  the  sequence  of  reading  the  data 
will  be  reversed.  LIFO  applications  in  digital  processing  sys- 
tems would  usually  be  to  store  return  addresses  and  essential 
registers  when  the  processor  is  executing  a  branch  to  subrou- 
tine or  interrupt  service  routine.  The  processor  will  retrieve  this 
information  when  it  executes  return  instructions.  The  LIFO  is 
an  essential  component  in  the  processor's  ability  to  perform 
context  switching. 

The  LIFO  is  sometimes  called  a  stack,  and  the  process  of  writ- 
ing to  the  stack  is  called  a  PUSH  operation  and  reading  data  is 
a  POP  operation.  In  the  central  processor  dedicated  instruc- 
tions for  PUSH  and  POP  operations  are  decoded  and  executed 
to  save  and  restore  return  addresses,  contents  of  accumula- 
tors, status  registers,  and  index  registers.  Other  LIFO  applica- 
tions could  be  found  in  data  buffering  where  the  order  of  data 
reception  should  be  reversed. 

The  concept  of  the  stack  is  used  in  performing  Reverse  Polish 
Notation  (RPN)  mathematics,  and  it  is  claimed  by  some  to  be 
superior  in  performance  to  the  conventional  algebraic  process- 
ing technique.  With  RPN,  instructions  and  operands  are 
pushed  onto  the  stack,  and  evaluated  as  they  are  popped  off; 
the  result  is  usually  left  in  the  general  purpose  register. 

When  a  memory  is  used  as  a  stack,  the  evaluation  process 
may  be  repeated  because  the  initial  evaluation  did  not  destroy 
the  RAM  contents.  This  non-destructive  reading  of  data  is  a 
feature  of  RAM-based  FIFO  and  LIFO  architectures,  and  is  not 
the  case  for  register-based  architectures.  FIFOs  and  LIFOs 
made  from  register  arrays  rely  on  data  shuffling  through  the 
array.  Data  in  the  registers  are  overwritten  during  each  cycle. 
The  RAM  control  for  a  LIFO  essentially  consists  of  an  Up/Down 
counter  to  address  locations  in  the  memory,  and  a  control 
sequencer  to  drive  the  memory's  chip  select  (CS)  and  write 
(WR)  inputs. 


The  new  PAL22RX8A  has  been  programmed  to  perform  a 
control  function  for  a  small  32-location-deep  stack.  A  two-chip 
solution  is  achieved  by  using  an  Am  9128  static  RAM  and  a 
PAL22RX8A  (see  Figure  1),  for  bytewide  applications. 
Additional  RAM  devices  may  be  added  in  parallel  for  16-  and 
32-bit  wide  applications.  A  deeper  LIFO  may  be  configured  by 
cascading  a  second  PAL22RX8A  to  create  a  higher-order  Up/ 
Down  counter. 
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Figure  1.  32-Location-Deep  LIFO  Controller  Uses  PAL22RX8A 


Figure  2  shows  the  block  diagram  of  the  PAL22RX8A  device. 
Fourteen  dedicated  inputs  drive  the  single  fuse  array  through 
true/complement  buffers.  One  of  those  inputs  includes  the 
clock,  so  logic  operations  may  be  performed  on  the  clock  or  its 
complement. 


10/CLK  |  > 
(2)'  


.  .  14 

 '  (17) 


Figure  2.  Block  Diagram 

Note:  » 

PLCC  pin  numbers  are  indicated  in  parentheses. 

PLCC  pins  1.8.  15  and  22  are  not  connected. 


There  exist  eight  macrocells  in  the  device.  The  schematic  view 
of  the  macrocell  architecture  is  shown  in  Figure  3.  The  D-type 
register  is  the  storage  element  in  the  device  and  is  clocked 
from  a  single  clock  input.  The  D  input  is  fed  from  an  exclusive- 
OR  (XOR)  gate,  the  two  inputs  of  which  are  driven  from  one 
product  term  and  a  sum  of  eight  product  terms.  This  XOR  gate 
can  be  used  to  control  the  polarity  at  the  output  pin;  the  single 
product  term  input  can  be  used  to  create  polarity  inversion. 


Figure  3.  PAL22RX8A  Macrocell 

With  regard  to  registered  output  configurations,  the  XOR  gate 
may  be  used  to  configure  alternative  registered  functions.  It 
can  be  shown  that  a  J-K  flip-flop  may  be  derived  by  using  feed- 
back, the  sum-of-product  inputs,  and  the  XOR  input,  as  shown 
in  the  equations  below.  Three  out  of  the  five  equations  use  the 
XOR  gate  to  create  the  J-K  function. 

1 .  Q  :=  Q*J  +  Q*K 


2.  Q 


Q*J  +  Q*K 


3.  Q  :=  Q  :+:  (Q*J  +  Q*K) 
4, 5       :+:  (Q*J  +  Q*K) 

5.  Q:=Q:+:(Q'J  +  a*K) 

Also,  S-R  and  other  subset  functions  may  be  configured  in  the 
macrocell. 

There  exist  two  multiplexers  in  the  macrocell  which  select 
either  a  registered  or  combinatorial  output  path.  If  the  control 
fuse  is  intact,  the  output  is  registered,  and  the  feedback  into 
the  fuse  array  is  through  a  buried  feedback  path  from  the  Q 
node.  A  combinatorial  route  may  be  selected  by  programming 
the  fuse;  the  register  is  flushed  and  feedback  to  the  array  is 
from  the  output  pin.  Programmable  I/O  may  be  achieved  in  this 
configuration  by  controlling  the  three-state  condition  of  the  in- 
verting output  buffer.  A  single  product  term  can  enable  the  out- 
put buffer  with  a  logic  High  or  disable  it  with  a  logic  Low.  When 
in  a  three-state  condition  the  output  pin  may  be  used  as  an  in- 
put. 

When  using  the  flip-flop  in  the  macrocell  it  is  possible  to  use 
the  asynchronous  Reset  or  Preset  feature.  A  single  input  pin 
may  be  dedicated  to  a  Reset  or  Preset  function,  and  driven  ac- 
tive to  initialize  the  register  in  the  device. 

Toggle  Function  as  a  J-K  Subset 

A  frequently  used  subset  of  the  J-K  function  used  in  binary 
count  applications  is  the  Toggle  (T)  function.  If  J  and  K  are  tied 
together  and  driven  with  a  logic  High  then  the  flip-flop  output 
will  Toggle  when  clocked;  if  driven  Low  the  output  will  Hold.  For 
a  given  flip-flop  (N)  in  the  count  sequence  the  toggle  equation 
will  be: 

6.  QN  :=  QN  :+:  QN-1*QN-2*...Q1*Q0. 

This  will  cause  the  flip-flop  to  Hold  until  the  product  term  of  all 
the  lesser  significant  registers  goes  to  a  logic  High;  then,  the 
flip-flop  will  toggle  polarity.  This  equation  is  suitable  for  an  Up 
count.  A  Down  count  is  given  by: 

7.  QN  :=  QN  :+:  QFTl*QTr2'..Q1*Q0. 
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Both  equations  can  be  combined  with  an  Up/Down  control  to 
effect  an  Up/Down  counter.  With  regard  to  the  LIFO  applica- 
tion, the  Up/Down  control  for  the  address  input  to  the  CMOS 
RAM  will  be  decoded  from  the  PUSH  or  POP  instruction.  The 


1 
2 

2a. 


sequence  will  be  "Increment  then  Write"  for  a  PUSH  operation 
and  "Read  then  Decrement"  for  a  POP  operation.  The  timing 
waveforms  (Figure  4)  show  the  sequence  for  PUSH  and  POP 
cycles.  Note  that  the  LIFO  starts  from  memory  address  zero. 


Active  Write  Cycle  -  PUSH 

PUSH  input  goes  active  High  at  tsu  or  greater 
before  the  clock  edge. 

First  clock  edge  samples  the  active  PUSH  input. 
WR  goes  active  Low. 
2b.  Address  increments. 
3.    Second  clock  edge;  PUSH  still  active  High. 


3a.  CS  goes  active  Low. 

4.  Third  clock  edge;  PUSH  still  active  High. 
4a.  CS  goes  inactive  High. 

5.  Fourth  clock  edge;  PUSH  still  active  High. 
5a.  WR  goes  inactive  High. 

6.  PUSH  instruction  is  removed. 


Active  Read  Cycle  -  POP 

1 .  POP  instruction  goes  active  High  at  tsu  or 
greater  before  the  clock  edge. 

2.  First  clock  edge  samples  the  active  POP  input. 
2a.  CS  goes  active  Low  to  select  the  RAM. 

3.  Second  clock  edge;  POP  still  active  High. 
3a.  CS  goes  inactive  High. 


4.  Third  clock  edge;  POP  still  active  High. 

4a.  WR  goes  active  Low;  the  memory  is  deselected 
so  no  Write  will  take  place. 

4b.  Address  lines  decrement. 

5.  POP  instruction  is  removed. 
5a.  WR  goes  inactive  High. 


Figure  4.  LIFO  Control  Timing  Diagrams 
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The  control  outputs  to  the  RAM  are  CSX  and  WR.  An  additional 
register  (XO)  is  required  to  enable  all  states  to  be  reached;  two 
registers  alone  are  not  enough.  From  the  timing  waveforms  it 
can  be  seen  that  an  active  write  cycle  is  initiated  after  an  ac- 
tive PUSH  input  has  been  sampled  by  the  system  clock.  The 
WR  signal  then  toggles,  causing  the  output  pin  to  go  active 
Low.  This  condition  is  maintained  until  the  control  register  XO 
goes  High,  at  the  end  of  the  PUSH  cycle. 

The  equations  after  the  XOR  symbol  in  the  design  specification 
(page  2-255)  cause  the  toggle  function;  when  these  product 
terms  become  active  High  a  toggle  operation  is  effected  in  the 
associated  flip-flop.  For  example,  the  CSX  flip-flop  is  in  a  Hold 
state  until  WR  goes  Low  and  while  XO  is  High  during  a  PUSH 
sequence;  during  a  POP  sequence,  CSX  holds  unless  both  WR 
and  XO  are  High.  The  equations  are  written  for  each  flip-flop  by 
using  one  product  term  after  the  XOR  label  to  effect  the 
change  of  state  or  toggle  condition  to  fit  the  requirements  of 
the  state  machine.  Without  the  XOR  gate,  an  Up/Down  counter 


would  require  more  than  the  eight  product  terms  available  in 
standard  PAL  devices. 


Instruction  Decoding 

Although  not  shown  in  the  design  specification,  it  is  possible 
for  the  PAL  device  to  decode  instructions  for  PUSH  and  POP. 
Assuming  the  unused  input  pins  are  converted  to  10-17,  which 
come  from  an  instruction  register  in  a  processor  system,  then 
decodjng  could  be  achieved  in  the  fuse  array.  If  an  additional 
input  VI  qualifies  a  valid  instruction  and  the  code  for  PUSH  is 
given  as  i7*]6*T5*T4*l3*l2*H*IO*VI,  then  this  equation  could 
be  added  to  the  string  declaration  statement  of  the  design 
specification.  The  PAL  device  could  decode  POP  operations 
similarly.  In  this  way,  external  decoding  logic  may  be  incorpo- 
rated into  the  PAL  device,  which  should  increase  the  operation 
speed. 
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CHIP  LIFOCONT  PAL22RX8 


THE  PAL22RX8  HAS  BEEN  DESIGNED  TO  CONTROL  32  RAM 
LOCATIONS  AS  A  LIFO  RAM  CONTROLLER.     THE  PAL  DEVICE 
DESIGN  SPECIFICATION  CONSISTS  OF  TWO  STATE  MACHINES. 
ONE  STATE  MACHINE  IS  AN  UP/DOWN  BINARY  COUNTER  THAT 
DRIVES  THE  ADDRESS  LINES  OF  THE  RAM,   AND  THE  SECOND 
DRIVES  THE  /WR  AND  /CS  LINES  OF  THE  RAM  TO  CONTROL 
WRITE  AND  READ  OPERATIONS.      DURING  A  PUSH  OPERATION 
DATA  IS  WRITTEN  TO  THE  LIFO;   THE  PROCEDURE  IS  INC- 
REMENT ADDRESS  COUNTER  AND  WRITE  DATA.     FOR  A  POP 
OPERATION  THE  READ  CYCLE  IS  PERFORMED  FOLLOWED  BY 
AN  ADDRESS  DECREMENT.     THE  XOR  GATE  IN  EACH  MACRO- 
CELL  ALLOWS  INCREMENT  AND  DECREMENT  OPERATIONS 
TO  BE  EFFECTED  WITHOUT  UTILIZING  A  LARGE  NUMBER  OF 
PRODUCT  TERMS. 


;PIN 

1 

2 

3 

4 

5 

6 

CLK 

PUSH 

POP 

NC 

NC 

NC 

;PIN 

7 

8 

9 

10 

11 

12 

NC 

NC 

NC 

NC 

/RST 

GND 

;PIN 

13 

14 

15 

16 

17 

18 

NC 

NC 

/CSX 

/XO 

/WR 

/Q4 

;PIN 

19 

20 

21 

22 

23 

24 

/Q3 

/Q2 

/Qi 

/Q0 

NC 

VCC 

GLOBAL 


STRING  INC  '/CSX*/WR*/X0*PUSH' 
STRING  DEC    '/CSX*/WR*  X0*POP' 


GLOBAL  TERM 
ENABLES  THE 
GLOBAL  RESET 

INCREMENT  LABEL 
DECREMENT  LABEL 


Figure  6.  LIFO  RAM  Controller  Design  File 
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PAL22RX8A 




EQUATIONS 

GLOBAL 

\J  J^J         i-J  Ll  l—l 

.  RSTF 

=  RST  ; 

RESET  AT  POWER  ON 

QO 

:  + : 
+ 

QO  '; 
INC  ; 
DEC  ; 

QO  HOLD 

INC  ADDRESS  FOR  PUSH 
DEC  ADDRESS  FOR  POP 

Ql 

:  +  : 
+ 

Ql  ! 
Q0*INC 
/Q0*DEC 

Ql  HOLD 
INC  Ql 
DEC  Ql 

Q2 

:  +  : 
+ 

Q2 

QO*  Q1*INC 
/Q0*/Q1*DEC 

Q2  HOLD 
INC  Q2 
DEC  Q2 

Q3 

•  — 

:  + : 

Q3 

QO*  Ql*  Q2*INC 
/Q0*/Q1*/Q2*DEC 

Q3  HOLD 
INC  Q3 
DEC  Q3 

Q4 

:  +  : 
+ 

Q4 

QO*  Ql*  Q2*  Q3*INC 
/Q0*/Q1*/Q2*/Q3*DEC 

Q4  HOLD 
INC  Q4 
DEC  Q4 

:  +  : 
+ 

CSX 

WR*/X0*PUSH 
/WR*/X0*POP 

•CHIP  SELECT 
■TO  RAM  DEVICE 

WR 

:  + : 
+ 
+ 

+ 

WR 

/WR*/CSX*/X0*  PUSH 
WR*/CSX*  XO*  PUSH 

/WR*/CSX*  XO*  POP 
WR*/CSX*/X0*/POP*/PUSH 

! WRITE  CONTROL 
;TO  RAM  DEVICE 

XO 

•  +  • 

•  • 

+ 
+ 
+ 

XO 

WR*  CSX*/X0*  PUSH 
/WR*/CSX*  X0*/PUSH 
/WR*  CSX*/X0*  POP 
/WR*/CSX*  XO*  POP 

•CONTROL  REGISTER 
;FOR  READ-WRITE 
; STATE  MACHINE. 

Figure  6.  LIFO  RAM  Controller  Design  File  (Continued) 
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SIMULATION 

TRACE_ON  CLK  QO  Ql  Q2  Q3  Q4 

/CSX  /WR  /XO  PUSH  POP 
RST 

SETF         /CLK  RST  /PUSH  /POP 

CLOCKF  CLK 

SETF  /RST 

SETF  PUSH 

FOR     I   :=  1  TO  4  DO 

BEGIN 

CLOCKF  CLK 
END 

SETF  /PUSH 
CLOCKF  CLK 
SETF  PUSH 

FOR  I   :=  1  TO  4  DO 
BEGIN 

CLOCKF  CLK 
END 

SETF  /PUSH 
CLOCKF  CLK 
SETF  POP 

FOR  I    :=  1  TO  4  DO 
BEGIN 

CLOCKF  CLK 
END 

SETF  /POP 

FOR     I    :=   1  TO  3  DO 
BEGIN 

CLOCKF  CLK 


TRACE  OFF 


SIMULATION 

SECTION. 

TRACE  ESSENTIAL 

SIGNALS. 

RESET  PAL  DEVICE. 

CLOCK  RESET  TO 

REGISTERS. 

ENABLE  PUSH  INPUT. 

PERFORM  ONE  PUSH 

CYCLE. 

PUSH  AND  POP  MUST 

BE  INACTIVE  BETWEEN 

CYCLES.      SET  PUSH 

INACTIVE. 

SET  PUSH  ACTIVE 

AND  PERFORM  SECOND 

PUSH. 

SET  PUSH  INACTIVE 
AND  CLOCK  DEVICE 
THEN  SET  POP 
ACTIVE.  PERFORM 
POP  CYCLE. 


SET  PUSH  AND  POP 
INACTIVE.  CLOCK 
SYSTEM  FOR  A  PASSIVE 
MODE. 


Figure  6.  LIFO  RAM  Controller  Design  File  (Continued) 
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Graphics  systems  are  commonly  used  in  most  computers  for 
displaying  pictures  and  text.  Functions  of  a  graphics  system 
include  creation,  manipulation  and  storage  of  picture  databases 
and  text.  Usually,  graphics  systems  also  allow  users  to  dynami- 
cally control  a  picture's  content,  size  or  colors. 

Graphics  systems  are  closely  related  to  Image  processing  sys- 
tems. While  graphics  systems  manipulate  images  which  already 
exist  as  a  database,  image  processing  systems  analyze  images 
taken  from  cameras  by  converting  them  to  a  graphical  database 
and  then  processing  that  database.  The  hardware  used  for  these 
systems  is  very  similar.  In  fact,  it  is  so  similar  that  many  systems 
offer  integrated  graphics  and  image  processing  functions. 

In  this  section  we  will  examine  the  possible  uses  of  PLDs  in  a 
graphics  and  image  processing  system  and  their  implementation. 
We  will  also  determine  which  PLDs  are  suited  for  different 
applications  in  graphics  and  image  processing  systems.  Laterwe 
will  also  see  some  design  examples  of  PLD  applications. 

A  simple  graphics  system  consists  of  five  components,  as  shown 
in  Figure  1. 

The  first  component  is  the  graphics  processor,  which  draws  the 
image  (referred  to  as  "image  rendering").  It  also  provides  an 
interface  to  the  main  system  processor  (also  known  as  the  host 
processor).  The  second  component  is  the  display  controller, 
which  is  an  interface  between  the  graphics  processor  and  the 
graphics  frame  buffer.  The  display  controller  is  essentially  a 
modified  DRAM  controller.  The  third  component  is  the  graphics 


frame  buffer,  which  is  typically  a  memory  that  stores  the  image 
data.  The  data  from  the  frame  buffer  is  then  serialized  by  data 
serializers  which  make  up  the  fourth  component  of  a  graphics 
system.  The  data  serializers  are  essentially  modified  shift  regis- 
ters. The  serial  data  is  then  converted  to  an  analog  signal  suitable 
for  driving  the  video  monitors  for  display.  This  function  is  per- 
formed by  the  fifth  and  the  final  component  of  the  graphics 
system,  a  digital-to-analog  converter  (DAC). 

Local  Graphics  Processor 

The  graphics  (or  image)  processor  gets  all  the  commands  from 
the  host  processor.  The  interface  between  the  graphics  proces- 
sor and  the  host  processor  is  through  a  bus  (Figure  2),  where 
commands  and  data  bytes  are  transferred.  Most  of  the  popular 
standard  bus  interfaces  are  used  for  this  function.  Some  of  the 
commonly  used  bus  interfaces  are: 

•  Multibus 

■  VME 

•  PC-bus 

•  Nu-bus 

■  MicroChannel 

The  bus  interface  performs  three  major  functions: 

•  Bus  arbitration 

•  Bus  control 

•  Address  decoding 


FRAME  BUFFER 
MEMORY 


DATA 
SERIALIZERS 


DISPLAY 
CONTROLLER 


GRAPHICS  BOARD 


LOCAL  GRAPHICS 
PROCESSOR  (OPTIONAL) 


HOST 

DISK 
APPLICATION 
SOFTWARE 

PROCESSOR 

Figure  1.  Components  of  Graphics  System  Hardware 
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i  nure  are  two  major  lypes  01  ouses,  syncnronous  ana  asynchro- 
nous. For  synchronous  buses,  the  bus  arbitration  and  control 
functions  can  be  effectively  performed  by  fast  registered  PLDs 
such  as  the  PAL16R8/6/4,  PAL20R8/6/4  and  PAL22V10.  For 
asynchronous  buses,  the  arbitration  and  control  functions  can  be 
provided  by  combinatorial  PLDs,  or  by  an  asynchronous  PLD  (the 
PAL20RA1 0  or  AmPALC29MA1 6).  The  details  for  designing  bus 
interface  functions  are  given  in  a  separate  section  on  page  2-325. 
The  address  decoding  function  is  performed  most  effectively  by 
a  combinatorial  PAL16L8,  PAL20L8  or  PAL22V10.  Address  de- 
coding is  discussed  along  with  other  combinatorial  functions  in 
the  section  on  page  2-35. 


LOCAL  GRAPHICS 
PROCESSOR 


SYSTEM  BUS 


HOST 
PROCESSOR 


Figure  2.  System  Bus  Interface 

The  ocal  graphics  processor  is  used  to  interpret  the  database  of 
the  graphical  image  ("display  list"),  down-loaded  from  the  host 
processor.  The  graphics  processor  interprets  the  display  list  into 
objects  with  geometrical  entities.  It  also  performs  coordinate 
transformation  functions  for  recalculating  object  size  as  the  object 
moves  or  viewer  perspective  changes.  The  last  function  per- 
formed by  the  graphics  processor  is  the  actual  image  rendering 
(drawing)  functions  of  drawing  arc/vectors,  moving  blocks  of 
image  data  (bit-blit),  and  character  manipulation  for  handling 
large  chunks  of  text. 

In  an  image  processing  system  the  local  processor  performs  the 
role  of  an  image  processing  engine.  The  image  processing 
software  tasks  are  very  different  from  the  graphics  tasks.  Image 
processing  tasks  include  interpreting  the  list  of  commands  sent  by 
the  host  and  executing  required  functions  such  as  image  en- 
hancement, pattern  recognition,  or  computer  vision.  The  most 
intensive  task  is  image  enhancement  which  requires  generating 
image  transforms  in  their  spatial  or  frequency  domain.  Generat- 
ing transforms  requires  a  large  number  of  multiplication  opera- 
tions. Typical  image  processing  systems  use  either  a  special 
microprocessor  or  a  floating  point  multiplier  for  this  purpose. 
PLDs  can  also  be  used  to  interface  this  multiplier  to  the  local 
microprocessor.  Some  image  processing  tasks  are  very  uniform 
and  repetitive,  and  dedicated  hardware  can  be  easily  designed  to 
speed  up  such  tasks  in  the  interest  of  overall  system  perform- 
ance. Such  dedicated  hardware  can  be  readily  made  with  PLDs. 

The  Display  Controller 

For  implementing  graphics  functions  the  local  processor  must  be 
able  to  write  to  the  frame  buffer  (draw  an  image).  Similarly  for 
implementing  image  processing  functions  the  local  processor 
must  be  able  to  read  from  and  write  to  the  frame  buffer.  The 
reading  and  writing  of  frame  buffer  data  is  possible  with  the  help 


ot  the  display  controller,  as  shown  in  Figure  3.  The  display 
controller  is  essentially  a  timing  generator  which  performs  three 


major  functions 


•  DRAM  timing 

•  Monitor  control  signal  timing 

•  Screen  refresh 

The  display  controller  generates  the  read  and  write  cycle  timings 
as  well  as  the  refresh  timing  of  the  DRAMs.  The  read  and  write 
cycles  are  initiated  by  the  local  graphics  processor.  These 
functions  are  identical  to  a  regular  DRAM  controller  discussed  in 
the  section  on  memory  control  (page  2-1 79).  The  second  function 
required  for  the  display  controller  is  the  generation  of  monitor 
control  signals.  This  function  is  usually  integrated  with  the  DRAM 
controller,  although  it  can  also  be  implemented  separately. 


FRAME  BUFFER 
MEMORY 

.  SCREEN  REFRESH 

TO  SERIAL IZER 

DISPLAY 

.  MONITOR 

CONTROLLER 

*  CONTROL 

Figure  3.  The  Display  Controller 


The  controller  needs  to  generate  the  DOTCLK  signal  which 
determines  the  rate  at  which  data  is  sent  to  the  monitor.  The  video 
monitor  synchronization  signals  HSYNC  for  every  new  line,  and 
VSYNC  for  every  new  frame  (Figure  4)  are  generated  by  dividing 
this  DOTCLK  signal.  Anothersignal,  BLANK,  asserted  only  when 
the  pixels  are  being  displayed,  is  also  generated  by  this  controller. 
The  HSYNC,  VSYNC  and  BLANK  signals  not  only  drive  the 
monitor,  but  are  also  used  as  synchronization  signals  by  the 
subsequent  stages  of  the  graphics  system. 

In  an  image  processing  system,  the  HSYNC  and  VSYNC  signals 
are  generated  by  the  camera  which  captures  the  image.  The 
internal  DOTCLK  signal  is  a  multiple  of  these  HSYNC  and 
VSYNC  signals  and  is  generated  by  using  a  phase  locked  loop 
(PLL). 

The  third  function  of  the  display  controller  is  to  refresh  the  screen. 
The  controller  automatically  reads  data  in  sequential  addresses 
from  the  frame  buffer  at  fixed  time  intervals,  and  sends  it  to  the 
monitor  at  a  rate  determined  by  the  DOTCLK  signal.  The 
sequential  addresses  are  also  generated  by  the  display  controller 
as  shown  in  the  example  on  page  2-261 .  Each  chunk  of  data  is 
called  a  pixel  and  is  displayed  on  the  screen  as  a  dot. 

Registered  PLDs  can  easily  implement  these  display  control 
functions.  As  this  would  be  a  fast  state  machine,  the  range  of 
options  includes  the  PAL32VX10/A,  PAL16/20R8/6/4, 
AmPAL23S8,  and  the  PLS105/167/168.  An  example  for  a  line 
sync  generator  (HSYNC),  and  another  for  a  frame  syncgenerator 
(VSYNC),  are  discussed  later  on  pages  2-263  and  2-265  respec- 
tively. 
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Figure  4.  A  Raster  Scan  Display  Timing  Control 


The  Frame  Buffer 

The  frame  buffer  (Figure  5)  is  traditionally  made  from  dynamic 
RAMs  (DRAMs).  The  DRAMs  provide  cost  effective  means  for 
storing  image  data.  Graphics  systems  require  two  ports  for 


PIXEL 


PLANE  4 


PLANE  3 


PLANE  2 


PLANE  1 


Figure  5.  The  Frame  Buffer 


reading  and  writing  the  frame  buffer  data.  The  first  port  is  for  the 
display  update  to  and  from  the  graphics  processor;  the  second 
port  is  for  the  screen  refresh  to  the  monitor. 

The  performance  of  a  graphics  system  is  measured  by  the  screen 
update  bandwidth.  This  is  the  rate  at  which  the  image  data  can 
be  written  to  the  frame  buffer.  The  DRAMs  can  be  organized  in 
various  ways  to  provide  access  to  multiple  bits  of  image  data 
simultaneously,  which  can  increase  update  bandwidth.  Another 
related  factor  is  the  screen  resolution,  which  determines  the  rate 
at  which  pixels  have  to  be  sent  to  the  monitor  in  order  to  provide 
a  flicker  free  display.  The  table  in  Figure  6  shows  this  pixel  clock 
(DOTCLK)  rate  required  for  various  screen  resolutions.  Varying 
DRAM  organizations  can  also  be  used  to  meet  this  screen 
resolution  bandwidth. 


RESOLUTION 

FREQUENCY 

PIXEL  CLOCK  (DOTCLK) 

512X512 

25  MHz 

40  ns 

640  X  480 

25  MHz 

40  ns 

720  X  350 

50  MHz 

20  ns 

1 000 X 1000 

100  MHz 

10  ns 

1 280 X 1000 

125  MHz 

8  ns 

Figure  6.  Various  Screen  Resolutions  and 
Bandwidth  Required 
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Graphics  and  Image  Processing  Systems 
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Figure  7.  A  Video  RAM 

Video  DRAMs  (VRAMs)  are  specialized  memories  which  provide 
two  ports.  The  VRAMs  (Figure  7)  include  a  built-in  serializer 
capable  of  supporting  high  data  bandwidths.  These  serializers 
are  used  to  support  high  bandwidth  requirements  of  the  monitors. 
The  VRAMs  require  extra  control  signals  which  are  typically 
generated  by  the  display  controller. 

The  performance  of  a  graphics  system  is  also  determined  in  part 
by  the  frame  buffer  organization.  There  are  two  main  types  of 
organizations  used  in  the  industry. 

•  Bit-plane  mapped 

•  Pixel-packed  mapped 

Bit-plane  mapping  allows  the  graphics  processor  to  access  single 
bits  (also  called  planes)  from  a  number  of  adjacent  pixels  simul- 
taneously. The  pixel-packed  organization  allows  the  graphics 
processor  to  access  all  the  bits  of  one  pixel  simultaneously.  The 
former  was  used  in  the  monochrometype  of  displays  whereas  the 
latter  is  increasingly  being  used  in  the  modern  color  displays. 

For  high  performance  and  large  screen  resolution  systems, 
multiple  pixel  access  is  possible  only  with  custom  DRAM  organi- 
zations, which  require  a  custom  display  controller  design.  Also, 
different  frame  buffer  organizations  (bit-plane  or  pixel-packed), 
and  the  special  timing  requirements  of  VRAMs,  further  reduce  the 
application  of  dedicated  VLSI  display  controllers.  PLDs  provide 
an  easy  means  of  designing  custom  display  controllers  and  are 
often  used  in  such  systems. 

Serializers 

The  function  of  the  serializer  is  to  accept  multiple  pixels  of  parallel 
data,  as  sent  by  the  display  controller  at  slow  speeds,  and 
transform  them  into  a  serial  pixel  data  stream.  The  serializer  is 
required  even  for  VRAMs,  which  can  only  support  25  MHz 
bandwidths.  The  serializer  speed  requirements  are  based  on 
monitor  resolution  and  are  shown  in  Figure  5. 

The  serializer  is  synchronized  by  the  monitor  control  signals 
HSYNC,  VSYNC  and  BLANK,  which  are  generated  by  the  display 
controller.  A  serializer  is  essentially  a  shift  register,  similar  to  the 
one  discussed  earlier  on  page  2-90.  Certain  modifications  are 
added  to  accommodate  the  specific  design  requirements  of  a 
graphics  system.  These  include  synchronization  with  the  display 
control  signals. 

Many  standard  video  serializers  (679501 ,  679502,  and  Am81 77) 
are  available,  which  can  be  used  for  standard  DRAM  organiza- 
tions. For  non-standard  organizations,  custom  serializers  can  be 
made  with  PLDs.  Sometimes  graphics-related  functions,  which 
are  not  available  in  standard  serializers,  can  also  be  added  to 
such  custom  serializers.  For  example,  a  ZOOM  function  can  be 


easily  implemented  in  a  custom  serializer  by  having  it  repeat  the 
same  pixel  datatwoor  more  times.  An  example  of  a  seven  bit  shift 
register  used  as  a  serializer  is  shown  on  page  2-271.  This 
serializer  has  the  capability  of  providing  reverse  video,  where  it 
inverts  the  outgoing  pixel  data  stream. 

In  certain  character-based  terminals,  a  dedicated  DRAM  or 
VRAM  based  frame  buffer  is  not  used.  The  screen  characters  are 
stored  in  a  static  RAM  and  are  sent  to  the  monitor  through  a  font 
RAM  or  ROM  which  storesthe  displayed  shape  of  each  character. 
This  font  RAM  or  ROM  is  also  known  as  the  character  generator. 
The  data  from  the  font  RAM  constitutes  the  pixels  which  are  sent 
to  the  monitor  via  serializers.  One  such  application  of  a  dual  port 
serializer  with  changeable  fonts  is  illustrated  on  page  2-275. 

PLDs  which  allow  fast  state  machines  are  ideal  candidates  for 
such  serializers.  Due  to  the  very  high  speed  requirements,  the 
most  suitable  PLDs  are  the  registered  "D"  PALdevices  at  55  MHz 
or  the  registered  ECL  PAL  devices  running  at  125  MHz  speed. 
Sometimes  buried  registers  are  also  very  useful  for  state  machine 
designs.  The  serializer  application  on  page  2-275  uses  a 
PAL32VX10A  as  it  provides  buried  registers  for  state  machine 
functions. 

Look  Up  Table  and  Digital  to  Analog 
Converter 

The  last  stage  of  a  graphics  system  is  the  look-up  table  (LUT)  and 
digital-to-analog  converter  (DAC).  The  numbers  of  simultane- 
ously displayable  colors  or  the  gray  scales  is  determined  by  the 
number  of  bits  in  a  pixel.  The  LUT  is  used  to  increase  the  number 
of  possible  colors  or  gray  scales  that  can  be  displayed  on  the 
monitor. 

The  LUT  (Figure  8)  is  a  fast  RAM  addressed  by  the  pixels  from  the 
serializers.  The  data  stored  in  the  RAMs  is  used  as  the  color  or 
gray  scale  value.  Since  the  data  speeds  can  be  very  fast  (as  little 
as  8  ns  for  a  125  MHz  display)  and  the  static  RAM  access  times 
are  relatively  slow,  multiple  static  RAMs  are  used  to  store  identical 
copies  of  look  up  tables.  Different  static  RAMs  are  addressed  for 
sequential  pixels  in  order  to  maintain  a  fast  overall  data  rate. 
Controllers  made  with  such  fast  PAL  devices  as  the 
PAL1 0H20EV8  or  PAL1 6R8/6/4D  can  be  used  to  spread  sequen- 
tial pixels  between  various  static  RAMs  for  this  function.  The  LUT 
output  is  then  routed  to  the  DAC  stage  of  the  circuit. 

The  DAC  is  the  only  analog  portion  of  the  circuit,  and  is  used  to 
convert  the  color  or  gray  scale  data  into  an  analog  format  usable 
by  the  video  monitors.  There  are  many  standard  monolithic  and 
hybrid  solutions  available  for  such  functions. 
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Figure  8.  The  Lookup  Table 
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This  small  system  video  controller  is  designed  for  an  80  character 
wide  terminal  display  of  24  rows  (Figure  1 ).  The  display  resolution 
is  640  X  400  pixels,  with  a  character  box  of  8  X  16  pixels.  The 
character  font  used  is  a  dot  matrix  of  7  X  9  pixels  which  is  stored 
in  a  character  PROM.  The  first  seven  horizontal  pixels  display  the 
character  while  the  eighth  pixel  provides  the  horizontal  character 
separation.  Similarly  thefirst  nine  lines  display  a  visible  character 
while  the  remaining  seven  are  blank  to  provide  line  separation. 
The  frame  buffer  is  an  SRAM  which  stores  character  ASCII  codes 
of  80  X  24  characters  and  can  be  addressed  by  the  system 
processor  for  update  or  by  the  display  controller. 

The  display  controller  provides  all  the  necessary  timing  and 
addressing  for  supporting  this  display.  Synchronized  to  the 
monitor  timing  signals,  it  generates  the  row  and  column  ad- 
dresses (Figure  2),  of  the  character  being  scanned.  The  row  and 
column  addresses  access  the  character  ASCII  code  from  the 
frame  buffer  at  the  instant  the  character  is  displayed.  This  ASCII 
code  addresses  the  character  PROM  which  stores  the  7  X  9  dot 
matrix  font.  Suppose  the  letter  "A"  is  to  be  displayed  (Figure  3). 
The  contents  of  the  character  PROM  pertaining  to  the  current 
character  row  and  column  are  read  and  loaded  into  the  video  shift 
register  to  be  clocked  out  as  a  video  signal.  The  video  shift 
register  must  be  seven-bits  long,  and  the  entire  character  is 
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Figure  1.  A  Character  Display 
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Figure  2.  Functional  Block  Diagram 
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displayed  over  nine  lines.  To  display  a  character  'A',  the  ASCII 
code  of  '41'  hexadecimal  selecting  the  letter  is  read  from  a  frame 
buffer  location.  This  character  code  will  point  to  the  base  location 
in  the  PROM.  This  base  location  holds  the  first  line  of  a  table  of 
nine  locations  which  hold  the  dot  information  for  each  scan  line. 
The  line  address  counter  first  selects  the  RO  line  address,  then  the 
next  line  address  R1  and  then  R2  to  R8,  until  the  entire  character 
is  displayed. 

The  character  column,  row  addresses  and  the  line  address  can 
be  generated  by  what  is  essentially  one  big  counter.  As  can  be 
seen  from  Figure  1 ,  there  are  eight  pixels  in  a  column,  80  columns 
in  a  line,  and  1 6  lines  in  a  row.  The  characters  are  counted  by 
dividing  the  pixel  clock  (DOTCLK)  by  eight,  which  is  the  number 
of  pixels  in  a  character  box.  The  character  column  addresses  are 
generated  by  a  seven  bit  counter  which  counts  up  to  80  character 
columns.  For  every  eighty  columns  of  character  addresses  a  line 
address  (a  four-bit  counter)  is  incremented.  And  finally  for  every 
16  line  addresses  a  character  row  address  is  incremented.  This 
counter  is  implemented  in  two  different  devices  which  also  gen- 
erate related  monitor  timing  signals. 

The  Implementation 

The  functions  of  the  display  controller  (Figure  4)  are  divided  into 
three  portions.  The  related  tasks  of  generating  the  system 
HSYNC  and  BLANK  signals  and  charactercolumn  addresses  are 
performed  by  a  line  sync  generator.  The  tasks  of  generating  the 
VSYNC  signal  and  the  line  address  and  character  row  address 
are  performed  by  the  frame  sync  generator.  Finally,  the  task  of 
generating  character  period  reference  from  the  DOTCLK  has 


been  conveniently  combined  with  the  video  shift  register,  which 
also  uses  this  reference  to  shift  out  appropriate  font  pixels.  The 
character  period  reference  is  also  used  by  line  sync  and  frame 
sync  generators  to  count  number  of  characters. 

The  sequential  frame  buffer  column  address  MA0-MA6  for  dis- 
playing these  80  characters  per  row,  is  generated  by  the  line  sync 
generator  (IC2  in  Figure  4).  The  line  sync  generator  is  a  counter 
clocked  by  DOTCLK  with  a  count  enable  function  under  the 
control  of  a  character  period  reference.  This  enables  it  to  count 
the  number  of  character  periods.  The  line  sync  generator  also 
generates  HSYNC  and  BLANK  signals  at  the  end  of  each  line. 

Thecharacter  display  is  divided  into  24  rows  of  16  lines  each.  The 
four-bit  line  address  (R0-R3)  is  used  by  the  character  PROM  to 
generate  the  appropriate  bits  of  font  for  the  different  lines  in  a 
character  row.  The  line  address  to  the  character  PROM  is 
generated  by  the  frame  sync  generator  (IC3),  using  the  BLANK 
signal  as  a  new  line  reference.  For  every  new  line  (BLANK  signal) 
the  line  address  counter  is  incremented.  The  frame  syncgenera- 
tor  also  generates  the  five-bit  row  address  (A4-A8)  for  the  24 
different  rows  of  characters  stored  in  the  frame  buffer.  This 
address  is  generated  by  a  counter  which  is  incremented  once  for 
every  16  counts  of  the  line  address  counter. 

The  final  component  is  the  video  shift  register  (IC1)  which 
serializes  the  character  PROM  seven-bitcharacter  line  data.  This 
function  is  accomplished  by  using  a  three-bit  counter  which 
divides  the  DOTCLKfrequency  by  eight  to  generate  the  character 
period  reference. 
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Figure  3.  Character  Generation 
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Line  Sync  Generator 


Addressing  the  Frame  Buffer 


The  PAL32VX1 0  used  for  line  sync  generation  (Figure  5)  has  a 
versatile  macrocell  structure  capable  of  transforming  a  basic  D- 
type  flip-flop  into  a  more  complex  T-type  flip-flop  which  efficiently 
implements  counter  designs.  The  seven-bit  character  column 
address  counter  uses  seven  of  these  T-type  flip-flops.  The  line 
sync  (HSYNC)  signal  is  generated  based  on  the  value  of  the 
counter.  The  HSYNC  signal  initiates  lineflyback  which  separates 
the  many  lines  of  which  a  visual  display  is  composed.  The  device 
also  generates  the  blanking  signal  for  monitor  control.  This 
design  has  been  developed  into  state  machine  logic  contained  in 
the  PAL32 


32VX10. 


The  frame  buffer  address  lines  MA0-MA6  are  shared  by  a  host 
processor  which  updates  the  memory  with  new  characters. 
These  addresses  are  generated  by  a  binary  character  period 
counter.  The  only  system  clock  conveniently  available  is  the 
DOTCLK.  Since  the  counter  has  to  keep  track  of  character 
periods.  One  easy  way  is  to  clock  the  device  with  DOTCLK,  but 
enable  count  only  when  all  three  bits  of  the  pixel  counter  S2,  S1 
and  SO  are  high.  These  three  bits  high  indicate  a  character 
boundary;  this  will  effectively  result  in  clocking  the  counter  every 
character  period. 
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Figure  4.  Small  System  Video  Controller 
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The  count  is  extended  up  to  1 07  to  provide  a  timing  reference  to 
generate  the  monitor  control  signals  (HSYNC  and  BLANK)  also. 
Seven  registers  in  the  PAL32VX10  provide  this  count.  The 
registers  are  programmed  as  T-type  flip-flops  (Figure  5)  for  opti- 
mizing the  number  of  product  terms  used.  For  each  scan  line  after 
reaching  1 07  the  T-type  counters  are  reset  to  provide  the  start  of 
the  next  active  line.  Of  the  ten  registered  cells  in  the  PAL32VX1 0, 
nine  registers  are  used  for  addressing  memory  (MA0-MA6),  line 
sync  generation  (HSYNC)  and  blanking  control  (BLANK). 

Since  the  processor  can  not  read  or  write  to  the  frame  buffer  at  the 
same  time  as  the  display  controller,  the  address  lines  (MA0-MA6) 
of  the  display  controller  are  three-stated  during  a  processor  cycle. 
When  processor  signal  /WR  is  asserted  LOW,  indicating  a  write 
cycle,  the  MA0-MA6  address  outputs  arethree-stated  toallowthe 
processor  address  bus  to  drive  the  memory  address. 

The  line  blanking  signal  (BLANK)  is  designed  to  go  active  low 
after  80  character  cycles.  This  is  followed  by  the  line  sync  signal 
(HSYNC),  which  goes  active  eight  character  cycles  later.  This 
gives  an  eight  character  back  porch  to  the  display  (see  Figures  6 
&  7).  The  line  flyback  time  (HSYNC  width)  of  1 6  blank  characters 
ensures  adequate  time  for  the  CRT  line  circuitry  to  stabilize 
before  starting  the  next  active  line  scan.  The  blanking  signal  is 
then  turned  off  three  character  periods  later  to  provide  a  three 
character  front  porch  to  the  display.  The  duty  cycle  for  one  scan 
line  is  (80+8+1 6+3)=1 07  character  periods. 

Selecting  a  DOTCLK  of  1 6  MHz,  and  a  character  of  eight  bits,  the 
resulting  2  MHz  character  cycle  can  be  divided  down  to  a  line  duty 
cycle  of: 


HSYNC  Line  frequency 


Character  freq/Line  duty  cycle 
2000  KHz  /107 
18.7  KHz 


Video  data  in  the  frame  buffer  can  be  updated  by  the  processor 
only  for  durations  when  video  is  not  displayed.  A  BLANK  signal 
is  a  good  indicator  to  the  processor  for  when  to  begin  and  stop 
updates.  Most  processors,  however,  need  an  advance  warning, 
preferably  a  few  microseconds  earlier. 

The  RDY  signal  is  used  to  provide  this  advance  warning  to  the 
processor.  When  HIGH,  the  RDY  signal  allows  data  modification 
in  the  frame  buffer.  This  flag  is  turned  off  beforethe  removal  of  the 
blanking  pulse  (BLANK)  allowing  a  margin  of  four  micro-seconds 
to  elapse  at  the  end  of  the  blanking  period.  It  is  assumed  that  four 
microseconds  is  ample  time  to  test  the  flag  and  perform  the  last 
write  operation.  However,  the  time  taken  for  a  processor  to  read 
this  flag  and  respond  can  be  programmed  into  the  PAL  device 


Figure  5.  Schematic  Diagram  of  Line  Address 
Generator  Circuit 
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depending  upon  the  software  requirements  of  the  system.  The 
RDY  signal  is  also  set  during  vertical  retrace  when  all  processor 
updates  are  allowed  and  no  display  controller  accesses  are 
required.  The  design  uses  the  system  VSYNC  for  accomplishing 
this. 

The  RDY  signal  is  also  disabled  unless  it  is  being  read  by  the 
processor.  This  is  done  because  the  RDY  signal  is  usually 
directly  connected  to  the  processor  data  bus  which  is  used  to 
carry  data  from  different  sources  (drivers).  The  disabling  is  done 
using  the  CS  chip-select  signal  asserted  by  the  processor.  If  CS 
is  driven  LOW,  the  RDY  output  buffer  is  enabled  and  the  RDYflag 
may  be  read  by  the  processor. 


Frame  Sync  Generator 

Another  PAL32VX10  (Figures  8  &  9)  is  programmed  to  perform 
the  function  of  a  frame  sync  generator  (VSYNC),  with  the  ability 
to  access  the  character  PROM  and  the  frame  buffer.  The  VSYNC 
signal  controls  frame  flyback  for  the  CRT  controller.  Each  frame 
of  displayed  data  is  terminated,  and  the  electron  beam  in  the  CRT 
performs  a  retrace  operation  prior  to  the  display  of  the  next  frame. 
Clocked  by  the  system  BLANK  signal,  it  uses  a  counter  to 
generate  the  row  address  for  the  character  being  displayed.  It 
also  generates  the  line  address  to  access  the  dot  pattern  for  each 
scan  line,  stored  in  character  PROM. 
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THE  PAL32VX10  HAS  BEEN  DESIGNED  AS  A  LINE  SYNC  GENERATOR  FOR  A 
THREE  PAL  SOLUTION  TO  A  VIDEO  DISPLAY  CONTROLLER.      THE  DEVICE 
HAS  BEEN  PROGRAMMED  TO  GENERATE  ADDRESSING  TO  ACCESS  A  CHARACTER 
RAM  MAO   -  MA6  ARE  CONFIGURED  TO  ACCESS  ASCCII   CHARACTERS  STORED  IN 
THE  RAM  FOR  AN  EIGHTY  CHARACTER  WIDE  DISPLAY.   THESE  ADDRESS  OUTPUTS 
MAY  BE  PUT  INTO  A  THREE  STATE  CONDITION  IF  THE  HOST  PROCESSOR 
REQUIRES  TO  CHANGE  THE  CONTENTS  OF  THE  DISPLAYED  MEMORY.  WITH 
THE  /WR  INPUT  ACTIVE  LOW  THE  HOST  PROCESSOR  MAY  DRIVE  THE 
ADDRESS   INPUTS  TO  ACCESS  AND  MODIFY  THE  CONTENTS  OF  THE  VIDEO  RAM. 
THE  INTERFACE  SIGNALS  TO  THE  VIDEO  SHIFT  REGISTER   (PAL  DESIGN  01 
OF  THE  THREE  PAL  DESIGNS)   ARE  S0,S1,S2,   AND  BLANK .   WHEN  S0*S1*S2 
ARE  HIGH  THE  ADDRESS  OUTPUTS  MAO  -  MA6   INCREMENT  SYNCHRONOUSLY 
AS  A  RESULT  OF  THE  DOT  CLOCK .    SO, SI  AND  S2  ARE  STATE  MACHINE 
OUTPUTS  FROM  THE  VIDEO  SHIFT  REGISTER  THAT  CONTROL  THE  LOADING 
AND  SHIFTING  OF  PIXEL  DATA.   THE  BLANK  OUTPUT  IS  ACTIVE  HIGH  AND 
IS  FED  TO  THE  VIDEO  SHIFT  REGISTER  TO  PROVIDE  A  BLANKING  SIGNAL 
DURING  LINE  SYNC  FLYBACK  WITH  A  MARGIN  FOR  FRONT  AND  BACK  PORCH. 
A  RDY  REGISTER   IS  AVAILABLE  TO  INDICATE  THAT  THE  SCREEN   IS  BLANK 
DURING  LINE  AND  FRAME  FLYBACK  PERIODS  AND  VIDEO  RAM  MAY  BE  UPDATED. 
THE  OUTPUT  IS   3-STATED  AND  IS  ENABLED  BY  AN  ACTIVE  /CS  INPUT. 
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Figure  7.  Design  File  of  Line  Sync  Generator 
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/Q7 

Q6*Q5*Q4*Q3*Q2*Q1*CEI 
Q7*HEX6B*CEI 
Q7 
GND 
/WR 

/Q8 

/Q8*HEX57*CEI 

08*HEX67*CEI 
/Q8 

GND 

/Q9 

/Q9*HEX50*CEI*/VSYNC 
/Q9*CEI*VSYNC 
Q9*HEX6  3*CEI*/VSYNC 

09 

GND 

CS 


SIMULATION 

TRACE_ON  CLK  RST  SO  SI   S2  MAO  MAI  MA 2 
MA3  MA4  MA5  MA6  HSYNC  BLANK 
RDY  /VSYNC 

SETF  WR  /VSYNC 

SETF         /CLK  RST  SO   SI   S2  /WR  CS 
CLOCKF  CLK 
SETF  /RST 

PRLDF       /QO  /Ql  /Q2  /Q3  Q4 
/Q5  /Q6  Q7  /Q8  /09 
FOR  I    :=   1   TO  60  DO 
BEGIN 

CLOCKF  CLK 
END 

SETF  VSYNC 

FOR   I    :=   1  TO  8  DO 

BEGIN  CLOCKF  CLK 

END 

/SO 
CLK 
/SI 
CLK 
/S2 


SETF 
CLOCKF 
SETF 
CLOCKF 
SETF 
CLOCKF 
TRACE  OFF 


PERIOD.    DE  MORGAN  OF 
EQU  Q7*Q6*/Q5*Q4*/Q3=K 
/Q9   =  RESET  FUNCTION. 
ASSIGN  Q3  TO  MA2 
REGISTERED  OUTPUT. 
CONTROL  O/P  WITH  /WR 
COUNT  EQUATION  Q4 
Q3*Q2*Q1*CEI   =  T. 
Q4*HEX6B*CEI   =  K. 
ASSIGN  Q4  TO  MA3 . 
ENABLE  REGISTER  O/P. 
THREE  -  STATE  CONTROL. 

COUNT  EQUATION  Q5 . 
Q4*Q3*Q2*Q1*CEI   =  T. 
Q5*HEX6B*CEI   =   K . 
ASSIGN  Q5  TO  MA4 . 
ENABLE  REGISTER  O/P. 
THREE  -  STATE  CONTROL. 

COUNT  EQUATION  Q6 . 
Q5*Q4*Q3*Q2*Q1*CEI   =  T. 
Q6*HEX6B*CEI   =  K. 
ASSIGN  Q6  TO  MAS. 
ENABLE  REGISTER  O/P 
THREE  -  STATE  CONTROL 

COUNT  EQUATION  Q7 . 
Q6*Q5*Q4*Q3*Q2*Q1*CEI  = 
Q7*HEX6B*CEI   =  K. 
ASSIGN  Q7  TO  MA6. 
ENABLE  REGISTER  O/P 
THREE  -  STATE  CONTROL 

J   -  K  EQUATION  FOR  Q8 
HEXS7*CEI   =  J. 
HEX67*CEI   =  K. 
ASSIGN  Q8  TO  HSYNC. 
ENABLE  REGISTER  O/P 
J  -  K  EQUATION  FOR  Q9 
RDY  STATUS 

FLYBACK. HEX50*CEI   =  Jl 
AND  FRAME  =  J 2 
HEX63*CEI   =  Kl  RESET 
AFTER  LINE  AND  FRAME 
ASSIGN  RDY  TO  Q9 
ENABLE  REGISTER  O/P 
WHEN  /CS  =  LOW  RDY 
OUTPUT   IS  ENABLED. 
START  OF  SIMULATION. 
TRACE  ESSENTIAL  I/P 
AND  O/P  SIGNALS. 


TEST  THREE  -  STATE 
CONTROL.  INITALISE 
INPUTS.  PERFORM 
RESET.   AND  PRELOAD 
REGISTERS . 

APPLY  60  CLOCK  PULSES 
TO  TRACE  THE  MEMORY 
ADDRESS  COUNTER,  HSYNC, 
BLANK  AND  CEO. 

TEST  THAT  AN  ACTIVE 
FRAME  WILL  SET  THE 
RDY  FLAG . 

DISABLE  COUNTER  BY 
SETTING  SO, SI  AND  S2 
LOW. 


END  OF  SIMULATION. 


Figure  7.  Design  File  of  Line  Sync  Generator  (Cont'd.) 


Memories  hr\ 


2-267 


Small  System  Video  Controller 


DOTCLK 
RST 
SO 
S1 
S2 
CEi 
WR 
TST 


RJSE 
ARRAY 

AND 
GLOBAL 
INTERCONN- 
ECT ITY 


THREE  STATE 
CONTROL 


GLOBAL 
RESET 


T 

0 

> 

Q 

T 

0 

> 

Q 

!|-CE'- 


T 

o 

> 

Q 

MEMORY 

ADDRESS 

OUTPUTS 

DRIVE  LINE 

DISPLAY 

RAM. 


T 

O 

> 

o 

tf>- 


3 


H>- 


R2 


T 

Q 

> 

Q 

H>- 


J 

0 

> 

K 

5 

The  PALdevice  is  programmed  as  a  counter.  The  least  significant 
four  bits  count  through  sixteen  line  addresses  R0-R3  for  each 
character  row.  The  higher  significant  bits  of  the  counter  track  the 
24  rows  of  characters.  The  frame  sync  pulse  is  generated  on  the 
25th  character  row  and  frame  flyback  is  turned  on  for  sixteen  line 
count  periods,  which  is  the  duration  of  the  twenty-fifth  row.  With 
24  character  rows,  each  having  sixteen  lines,  the  system  gave 
384  visible  lines.  Since  the  frame  flyback  occurs  over  sixteen 
lines,  the  total  number  of  lines  in  the  system  is  (384+16)=  400. 
The  frame  repetition  rate  is  derived  by  dividing  the  line  frequency 
by  the  total  number  of  lines  in  the  system. 


Frame  repetition  rate 


:  1 8.7  X  10J  HZ/400  (1). 

.  46.75  Hz 


The  video  terminal  used  for  testing  the  final  design  functioned 
best  at  this  frame  repetition  rate.  If  desired,  other  frame  repetition 
rates  can  also  be  selected.  For  example  a  thirteen  character  row 
with  nine  rows  active  and  the  same  7X9  character  box  would  give 
a  57  Hz  frame  rate.  Most  video  terminals  have  a  synchronizing 
circuit  that  determines  the  choice  of  frame  frequency.  This  is 
similar  to  a  phase-locked  loop,  providing  a  window  of  acceptable 
frame  oscillator  frequencies  to  which  the  display  will  be  locked. 
Outside  of  this  capture  range,  the  display  will  not  be  synchro- 
nized, and  the  result  will  look  like  a  rolling  unstable  picture. 

Signal  Assignments 

The  A/SYNC  output  is  an  active  LOW  signal.  When  displaying 
data  this  output  is  driven  HIGH.  As  with  the  line  sync  generator 
design,  the  /WR  input  is  driven  from  the  host  processor  system. 
When  LOW,  the  video  memory  address  outputs  MA7-MA11  are 
disabled.  The  host  system  may  enable  its  output  buffers  and  drive 
the  frame  buffer's  address  inputs  directly  to  modify  video  data. 

For  normal  counting,  the  CEI  input  should  be  driven  LOW.  A  TST 
input  was  provided  to  minimize  the  number  of  test  vectors 
required  to  test  the  device.  This  pin  is  driven  LOW  for  normal 
operation. 


PAL32VX1 0  USES  J-K  AND  T  FLIP-FLOP 
CONFIGURATIONS. 


Figure  8.  Schematic  of  Frame  Sync  Generator 
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TITLE  FRAME  SYNC  GENERATOR. 

PATTERN  02 

REVISION  02. 

AUTHOR  CHRIS  JAY. 

COMPANY  MMI   SANTA  CLARA,  CA. 

DATE  20TH  AUGUST  1986. 


CHIP  VIDEO_l   PAL32VX1 0 

; FRAME     SYNC  GENERATOR  PAL 

; THE  PAL  32VX10   IS  THE  THIRD  OF  A  THREE  PAL  SYSTEM  DESIGNED 
;AS  A  SIMPLE  VIDEO  CONTROLLER.   THE  DEVICE  GENERATES  ADDRESSING 
;FOR  A  CHARACTER  MEMORY  AND  CPU  RAM.   ALSO  A  FRAME  SYNC  PULSE 
;/VSYNC   IS  USED  TO  PROVIDE  FRAME  FLYBACK  AT  THE  END  OF  ONE 
; COMPLETE  FRAME.   THERE  ARE  FOUR  ROW  OUTPUTS  FROM  THE  PAL  WHICH 
; ADDRESS  THE  CHARACTER  MEMORY.   THESE  OUTPUTS  ARE  DRIVEN  FROM 
; AN   INTERNAL  COUNTER  WHICH   IS   INCREMENTED  AFTER  EACH  LINE 
;SCAN.   THE  R0,R1,R2  AND  R3   INCREMENT  AS  A  SIXTEEN  BIT  COUNTER 
; ADDRESSING  CHARACTER  MEMORY.    MA 7   -  MA11   ADDRESS  VIDEO  MEMORY 
; ACCESSING  THE  ASCII  CODES  OF  THE  CHARACTER  TO  BE  DISPLAYED 
;ON  EACH  ROW  OF  THE  VIDEO  DISPLAY.   THE  COUNTER  MA 7  -  MA11 
;IS   INCREMENTED  AFTER  ONE  COMPLETE  CHARACTER  SCAN  THAT  IS 
;SIXTEEN  LINES.   THE  /WR   INPUT  CAN  BE  DRIVEN  BY  THE  HOST  CPU 
;TO  3  -  STATE  MA 7  -  MA11  OUPUTS  AND  DYNAMICALLY  CHANGE  THE 
; CONTENTS  OF  THE  VIDEO  RAM.   THIS   INPUT  CAN  BE  MAPPED  INTO 
;HOST  PROCESSOR'S  MEMORY  ADDRESS  RANGE.   UPDATING  RAM  SHOULD 
; BE  DONE  BY  THE  HOST  DURING  LINE  AR  FRAME  FLYBACK.   THE  PAL 
; DEVICE  MAY  BE  RESET  AT  POWER  ON  AND  /CEI   SHOULD  BE  WIRED  LOW 
;TO  ENABLE  THE  ADDRESS  COUNTERS.   TST  IS  HARDWIRED  LOW,  THIS 
;PIN  WAS  USED  AS  A  TEST  INPUT  PULLED  HIGH  TO  DISABLE  SOME  OF 
; LOWER  ORDER  COUNTER  REGISTERS  WHEN  GENERATING  TEST  VECTORS 
;OF  THE  TEST  VECTORS.   THIS  AVOIDED  GENERATING  UNECESSARY 
; REPETATI VE  LOWER  ORDER  COUNTS  WHEN  TESTING  THE  COUNT 


; EQUATIONS  FOR 

HIGHER 

ORDER 

REGISTERS 

IN  THE 

COUNTER 

;CHAIN. 

;PINS 

1 

2 

3 

4 

5 

6 

CLK 

/RST 

/CEI 

NC 

NC 

NC 

;PINS 

7 

8 

9 

10 

11 

12 

NC 

NC 

NC 

/WR 

TST 

GND 

;PINS 

13 

14 

15 

16 

17 

18 

NC 

MA  7 

MA8 

MA9 

MA10 

MA11 

;PINS 

19 

20 

21 

22 

23 

24 

/VSYNC 

R0 

Rl 

R2 

R3 

VCC 

GLOBAL     Q0  Ql  Q2  Q3  Q4  Q5  Q6  Q7  08  Q9 


STRING  VARIABLE  DECLARATIONS  FOR  A  NON-INTERLACED  SYSTEM. 


STRING  HEX18NI 
STRING  HEX19NI 
STRING  ROWNI 


'Q9*Q8*Q7*Q6*Q4*Q3*/Q2*/Q1*/Q0 ' 
'Q9*Q8*Q7*Q6*Q4*Q3*/Q2*/Q1*Q0  ' 
'Q6*Q7*Q8*Q9 ' 


SET  VSYNC 
CLEAR  VSYNC 
ROW  COMPLETE 


EQUATIONS 


GLOBAL. SETF  =  RST 


/Q0 


MA  7 

MA7.CMBF  = 
MA7.TRST  = 

/Ql  :  = 

:  + : 

MA8 

MA8.CMBF  = 
MA8.TRST  = 


/QO 

CEI*ROWNI 

CEI*HEX19NI 

QO 

GND 
/WR 

/Ql 

Q0*CEI»ROWNI*/Q5 
Ql 
GND 
/WR 


RESET  AT  POWER  UP 

COUNT  EQUATION. 

TOGGLE  WHEN  CEI*ROWNI  TRUE 

RESET  AFTER  FRAME  SCAN. 

ENABLE  MA 7  OUTPUT 

SET  REGISTER  MODE 

THREE  -  STATE  O/P  WHEN  /WR 

ACTIVE. 

COUNT  EQUATION  Ql 

ENABLE  TOGGLE 

ENABLE  MA8  OUTPUT . 

SET  REGISTER  MODE. 

THREE  -  STATE  O/P  WHEN  /WR 

ACTIVE. 


Figure  9.  Design  File  of  Frame  Sync  Generator 
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MA9 

MA9.CMBF  = 
MA9.TRST  = 

/Q3  :  = 


MA10 

MA10.CMBF 
MAIO.TRST 


/Q4 


MA11 

MAU.CMBF 
MAI 1 . TRST 


/Q5 


/VSYNC 
/VSYNC.CMBF 

/Q6  :  = 

:  + : 

RO 

RO.CMBF  = 

/Q7  :  = 


Rl 

Rl .CMBF 


/Q8 


R2 

R2 . CMBF 


R3 

R3 .CMBF 


02 

GND 
/WR 

/Q3 

Q2*Q1»QO*CEI*ROWNI 
Q3*HEX19NI*CEI 
Q3 
GND 
/WR 

/Q4 

Q3*Q2*Q1*QO*CEI*ROWNI 
Q4*HEX19NI*CEI 
Q4 
GND 
/WR 

/QS 

/Q5*HEX18NI«CEI*ROWNI 
Q5*HEX19NI*CEI*ROWNI 

/OS 
GND 

/Q6 
CEI 
06 
GND 

/Q7 

Q6*CEI*TST 
/Q7*/TST 

07 

GND 

/QB 

Q6*Q7*CEI*TST 
/Q8*/TST 
Q8 

GND 

/Q9 

Q6*Q7*Q8*CEI*TST 
/Q9*/TST 
09 
GND 


SIMULATION 

TRACE_0N  CLK  RST  MA7  MA8  MA9  MA10 

MA11   /VSYNC  RO  Rl   R2  R3 
SETF         /CLK  RST  CEI  /WR 
CLOCKF  CLK 
SETF         /RST  /TST 
PRLDF       QO  Ql  Q2  /Q3  Q4 

A35  /Q6   /Q7  /Q8  /Q9 
I    :=  1  TO  40  DO 
CLOCKF  CLK 


FOR 
BEGIN 
END 


SETF  TST 

PRLDF       00  Ql  Q2  /Q3  Q4 

A35  /Q6  /Q7  Q8  Q9 
I    : =   1   TO  4  0  DO 
CLK 


FOR 
BEGIN 
END 
SETF 
FOR 
BEGIN 
END 

TRACE  OFF 


CLOCKF 

/CEI 
I   :=  1 
CLOCKF 


TO  8  DO 
CLK 


TltHSftr-rfJflSR    tlOillt  SI_»N 
ENABLE  MA9  OUTPUT. 
SET  REGISTER  MODE . 
THREE  -  STATE  O/P  WHEN  /WR 
ACTIVE. 

COUNT  EQUATION  Q3 

ENABLE  TOGGLE 

RESET  AFTER  FRAME  SCAN 

ENABLE  MA10  OUTPUT. 

SET  REGISTER  MODE 

THREE  -  STATE  O/P  WHEN  /WR 

ACTIVE. 

COUNT  EQUATION  Q4 

ENABLE  TOGGLE 

RESET  AFTER  FRAME  SCAN 

ENABLE  MA11  OUTPUT 

SET  REGISTER  MODE 

THREE  -  STATE  O/P  WHEN  /WR 

ACTIVE. 

J  -  K  VSYNC  O/P 

J  EQUATION  TO  SET  VSYNC 

K  EQUATION  TO  RESET  VSYNC 

ENABLE  Q5  TO  VSYNC. 

SET  REGISTER  MODE 

COUNT  EQUATION  FOR  Q6 
TOGGLE 

FRAME  SYNC.   ENABLE  Q6 
REGISTER  OUTPUT  FOR 
ROW  0. 

COUNT  EQUATION  FOR  Q7 
ENABLE  TOGGLE 
DISABLE  COUNT  DURING  TEST. 
ENABLE  Q7  REGISTER  OUTPUT 
FOR  ROW  1. 

COUNT  EQUATION  FOR  Q8 
ENABLE  TOGGLE 

DISABLE  COUNT  DURING  TEST. 
ENABLE  Q8  REGISTER  OUTPUT 
FOR  ROW  2. 

COUNT  EQUATION  FOR  Q9 
ENABLE  TOGGLE 
DISABLE  COUNT  DURING  TEST. 
ENABLE  Q9  REGISTER  OUTPUT 
FOR  ROW  3. 

SIMULATION  SECTION 
TRACE  ESSENTIAL  SIGNALS 

SET  INITIAL  CONDITIONS 
ENABLE  COUNT  ON 
DISABLE  ROW  Rl,   R2,  R3. 
PRELOAD  INITIAL  STATE 
TWO  COUNTS  BEFORE  FRAME 
SYNC.   CLOCK  INPUT  TO  VIEW 
WAVEFORMS  OF  ROW  0,  VSYNC 
AND  MEMORY  ADDRESSES  MA 7  TO 
MA11  . 

DISABLE  TEST  MODE,   THE  ROW 
COUNTERS  ARE  ENABLED. 
PROVIDE  CLOCK   INPUTS  TO 
GENERATE  TEST  VECTORS  TO 
VERIFY  THE  NORMAL  COUNT 
OPERATION  OF  THE  ROW 
ADDRESS  COUNTER. 
DISABLE  COUNT  MODE  AND 
APPLY   CLOCK   INPUT  TO 
VERIFY  THE  COUNTER 
HOLD  OPERATION. 


Figure  9.  Design  File  of  Frame  Sync  Generator  (Cont'd.) 
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Video  Shift  Register 

The  third  component  of  this  video  system  (Figures  10  &  11), 
provides  a  means  of  converting  parallel  data  into  a  high  speed 
serial  data  stream.  The  PAL32VX10  implements  one  such  video 
shift  register,  where  it  receives  parallel  data  from  a  character 
memory  and  clocks  the  serial  data  out  at  a  rate  determined  by  the 
pixel  clock  (DOTCLK).  The  conversion  of  parallel  character  data 
to  a  serial  stream  is  synchronized  to  the  character  period  of  the 
system.  The  character  period  is  generated  by  dividing  the 
DOTCLK  by  eight  and  this  circuit  has  been  conveniently  imple- 
mented in  the  same  device.  The  design  also  offers  blanking 
controls  and  a  reverse  video  option. 

The  PAL32VX10  contains  ten  registers,  three  of  which  are  used 
as  a  binary  counter  which  synchronously  counts  through  eight 
states  at  a  rate  determined  by  the  DOTCLK  of  the  video  system. 
This  count  to  eight  determines  a  character  period  reference  since 
there  are  eight  pixels  (dots)  in  every  character.  The  outputs 
S0-S2  are  used  by  the  rest  of  the  system  as  a  character  period 
reference. 

The  three-bit  output  S2,  S1  and  SO  of  the  character  period  counter 
are  also  used  internally  as  a  timing  reference  for  loading  new 
character  data.  When  all  three  S2,  S1  and  SO  are  high,  the 
remaining  seven  registers  of  the  device  Q1-Q7  are  loaded  with 
the  seven  bit  character  data  from  inputs  D0-D6.  The  registers 
used  as  a  counter  are  Q8,  Q9  and  Q1 0.  Q1 0,  the  least  significant 
bit,  is  assigned  to  the  output  pin  SO,  Q9  is  assigned  to  pin  S1 ;  and 
Q8  is  assigned  to  pin  S2. 

The  "new  character"  load  command  is  given  as  a  string  statement 
in  the  design  specification:  STRING  LOAD  'Q8'Q9*Q10'. 

This  parallel  load  is  followed  by  seven  shift  states  when  the  data 
is  transferred  from  one  register  to  another.  The  binary  sequence 
of  0-6  at  counter  S2-S0  enable  this  synchronous  data  shift 
through  registers  Q1 -Q7,  Q7  being  the  last  register  in  the  chain. 
The  VIDEO  output  of  the  PAL  device  is  therefore  assigned  to  the 
last  register  Q7. 

The  shift  command  is  developed  from  the  DeMorgan  expansion 
of  NOT(LOAD):  08"+  09"+  Q10. 

Additional  Control  Features 

The  video  shift  register  has  a  number  of  inputs  which  control  the 
serial  output  of  the  video  data  stream.  The  BLANK  input,  when 
asserted,  inhibits  the  video,  causing  it  to  go  LOW  independent  of 
activity  at  the  parallel  data  inputs.  It  is  necessary  to  use  the 
BLANK  signal  to  override  active  video  during  line  flyback  or 
retrace.  The  VSYNC  input  also  inhibits  the  loading  of  video  data 
and  suppresses  video  drive  during  frame  flyback. 


The  reverse  video  input  REV  can  be  used  to  create  an  inverse 
video  display.  For  normal  video  this  input  is  LOW.  The  characters 
displayed  on  the  video  screen  will  be  bright  on  a  dark  background. 
When  the  reverse  video  control  is  set  HIGH,  the  video  data  is 
inverted  when  it  is  loaded.  Thedisplay  background  isthen  bright, 
and  the  displayed  characters  are  dark.  The  BLANK  and  VSYNC 
controls  have  the  same  effect  of  driving  the  video  output  LOW 
during  line  and  frame  flyback  whether  or  not  the  video  is  reversed. 

THREE  BIT  BINARY  COUNTER 


LOAD  SHIFT 
DIN  Q1 
i  — 1>    POUT         RST  jp- 


X 


LOAP      SHIFT  — 
PIN  02 
>    POUT         RST  p- 


I 


LOAP  SHIFT 
DIN  Q3 
>     POUT  RST 


LOAP  SHIFT 
PIN    .  04 
i  ~-|>     POUT  RST 


LOAD 

SHIFT  I 

DIN  Q5 

>  DOUT 

LOAD      SHIFT  — 
PIN  Q6 
!  — 1>     POUT  RST  |>- 


I 


LOAP  SHIFT 
PIN  Q7 
I — [>  POUT 


-  so 

-  S1 
•  S2 


Figure  1 0.  Schematic  of  Video  Shift  Reg  ister 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


VIDEO  SHIFT  REGISTER. 

01 

02. 

CHRIS  JAY . 

MM I  SANTA  CLARA,  CA. 
20TH  AUGUST  1986. 


THE  PAL32VX10  HAS  BEEN  DESIGNED  AS  A  VIDEO  SHIFT  REGISTER 
TO  SUPPORT  A  THREE  PAL  SOLUTION  TO  A  SMALL  VIDEO  CONTROLLER 
CIRCUIT.    SEVEN   INPUTS  FROM  THE  CHARACTER  GENERATOR  ARE  LOADED 
INTO  THE  INTERNAL  SHIFT  REGISTER     THROUGH  INPUTS  Dl   -  D7 , 
AND  SUBSEQUENTLY  SHIFTED  THROUGH  IT.   AN  INTERNAL  THREE  BIT 
BINARY  COUNTER  CONTROLS  THE  LOAD  AND  SHIFT  ACTIVITY.  WHEN 
THE  BINARY  OUTPUTS  SO,  SI  AND  S2  ARE  ALL  HIGH  THE  SHIFT 
REGISTER   IS  LOADED  WITH  NEW  DATA  FROM  THE  CHARACTER  MEMORY, 
SHIFTING  TAKES  PLACE  FOR  ALL  THE  OTHER  BINARY  STATES. 
THE  DATA  IS  LOADED  AND  SHIFTED  SYNCHRONOUSLY,   AT  A  RATE 
DETERMINED  BY  THE  APPLIED  CLOCK   INPUT.     THE  DEVICE  MAY  BE 
ASYNCHRONOUSLY  RESET  BY  APPLYING  A  LOGIC  LOW  TO  THE  /RST  PIN. 
THERE  IS  A  REV  INPUT,   WHICH  CAUSES  THE  DATA  TO  BE  REVERSED 
FOR  REVERSE  VIDEO.   THIS   INPUT   IS  HELD  LOW  FOR  NORMAL  DATA. 
THE  BLANK  AND  /VSYNC  CONTROLS  ARE  FEEDBACK   INPUTS  FROM  THE 
LINE  SYNC  AND  FRAME  SYNC  PAL  DEVICE  OUTPUTS .   THESE  CAUSE  THE 
VIDEO  DATA  OUTPUT  TO  BE  INACTIVE  DURING  LINE  AND  FRAME  SYNC . 
PERIODS.   THE  SO,   SI  AND  S2  OUTPUTS  DRIVE  THE  LINE  AND  FRAME 
SYNC  PAL32VX10  AND  ACT  AS  COUNT  ENABLE   INPUTS  TO  THOSE 
DEVICES. 

CHIP  VIDEO  1  PAL32VX10 


; VIDEO 

SHIFT  REGISTER  PAL 

;PINS 

1 

2 

3 

4 

5 

6 

CLK 

/RST 

Dl 

D2 

D3 

D4 

;PINS 

7 

8 

9 

10 

11 

12 

D5 

D6 

D7 

BLANK 

/VSYNC 

GND 

;PINS 

13 

14 

15 

16 

17 

18 

REV 

NC 

NC 

NC 

NC 

NC 

;PINS 

19 

20 

21 

22 

23 

24 

NC 

VIDEO 

S2 

SI 

SO 

VCC 

GLOBAL 

Ql  02 

Q3  Q4  05 

Q6  Q7 

Q8  09  Q10 

STRING 

LOAD 

•08*09*010 ' 

; ENABLE 

LOADING 

OF 

EQUATIONS 

GLOBAL. SETF  =  RST 


/Q10 
SO 

50.  CMBF 

m 

SI 

51 .  CMBF 

/Q8 
S2 

52.  CMBF 


Q10 
Q10 
GND 

/Q9 

Q10 

Q9 
GND 

/QB 

Q10*Q9 

Q8 

GND 


ASYNCHRONOUS  RESET. 

INTERNAL  STATE  COUNTER 
LEAST  SIGNIFICANT  BIT  Q10. 
ENABLE  REGISTER  TO  PIN 
SO  . 

INTERNAL  STATE  COUNTER 
Q10  =  HIGH  =  TOGGLE. 
ENABLE  REGISTER  OUTPUT 
TO  PIN  SI . 

INTERNAL  STATE  COUNTER 
Q10*Q9   =  HIGH  =  TOGGLE . 
ENABLE  REGISTER  OUTPUT 
TO  PIN  S2. 


/Ql 


/Dl *  LOAD*/REV* /BLANK  */VSYNC 
Dl * LOAD* REV* /BLANK* /VSYNC 

/Q8*/REV 

/Q9*/REV 

/Q10*/REV 
BLANK 
VSYNC 


LOAD  DATA   INPUT  Dl . 
LOAD  DATA  INPUT  /Dl . 
SHIFT  HIGH  INTO  Ql . 


BLANK  DATA  DURING 
LINE  AND  FRAME  SYNC 
INPUTS . 


Figure  11.  Design  File  of  Video  Shift  Register 
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aMonoUtMaMMon.oMesa 


Small  System  Video  Controller 


/Q2 


A23 


/Q4 


/Q5 


/Q6 


/Q7 


/VIDEO 
/VIDEO. CMBF 


SIMULATION 
TRACE  ON 


SETF 

FOR  J    :  =   0  TO  5 
BEGIN 

IF  J   =   1  THEN 

BEGIN  SETF  VSYNC  /BLANK 

END 


/D2* LOAD* /REV* /BLANK */VSYNC 
D2  *  LOAD* REV*  /BLANK  */VSYNC 

/Ql*/Q8 

/Ql*/Q9 

/Ql*/Q10 
BLANK 
VSYNC 

/D3*LOAD*/REV*/BLANK*/VSYNC 
D3* LOAD* REV* /BLANK* /VSYNC 

/Q2*/Q8 

/Q2*/Q9 

/Q2*/Q10 
BLANK 
VSYNC 

/D4* LOAD* /REV*/BLANK* /VSYNC 
D4  *  LOAD*  REV  * /BLANK* /VSYNC 

/Q3*/Q8 

/Q3*/Q9 

/Q3*/Q10 
BLANK 
VSYNC 

/D5  *  LOAD* /REV* /BLANK* /VSYNC 
D5* LOAD* REV* /BLANK* /VSYNC 

/Q4*/Q8 

/Q4*/Q9 

/Q4*/Q10 
BLANK 
VSYNC 

/D6*LOAD*/REV*/BLANK*/VSYNC 
D6* LOAD* REV* /BLANK* /VSYNC 

/Q5*/Q8 

/Q5*/Q9 

/Q5*/Q10 
BLANK 
VSYNC 

/D7*LOAD*/REV*/BLANK*/VSYNC 
D7* LOAD* REV* /BLANK* /VSYNC 

/Q6*/Q8 

/06*/Q9 

/Q6*/Q10 
BLANK 
VSYNC 

=  /Q7 

=  GND 


CLK  SO   SI  S2 
VIDEO  /VSYNC  BLANK 
REV  Ql  Q2  03  Q4  Q5 
Q6  Q7   Dl   D2  D3  D4  D5 
D6  D7  RST 
/CLK  RST  REV 
/Dl  /D2  D3  D4 
D5  /D6  /D7 
BLANK  /VSYNC 
/RST 
DO 


LOAD  DATA  INPUT  D2  . 
LOAD  DATA  INPUT  /D2 . 
SHIFT  DATA  FROM  01 
INTO  Q2,    FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 

LOAD  DATA  INPUT  D3 . 
LOAD  DATA   INPUT  /D3. 
SHIFT  DATA  FROM  Q2 
INTO  Q3,    FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 

LOAD  DATA  INPUT  D4. 
LOAD  DATA   INPUT  /D4 . 
SHIFT  DATA  FROM  Q3 
INTO  Q4,    FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 

LOAD  DATA  INPUT  D5. 
LOAD  DATA  INPUT  /D5 
SHIFT  DATA  FROM  Q4 
INTO  Q5,   FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 

LOAD  DATA  INPUT  D6 . 
LOAD  DATA  INPUT  /D6 
SHIFT  DATA  FROM  Q5 
INTO  Q6,   FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 

LOAD  DATA  INPUT  D7 
LOAD  DATA  INPUT  /D7 
SHIFT  DATA  FROM  Q6 
INTO  Q7,   FOR  SIX 
BINARY  COUNT  PERIODS. 
BLANK  DURING  LINE 
AND  FRAME  PERIODS. 
ENABLE  REGISTER 
OUTPUT  Q7  TO  VIDEO 
PIN. 


START  OF  SIMULATION 
TRACE  ALL  ESSENTIAL 
INPUTS  AND  OUTPUTS. 


SET  INITIAL  CONDITIONS. 
APPLY  ASYNCHRONOUS 
RESET.    SET  REVERSE 
VIDEO  INACTIVE  AND 
CREATE  ACTIVE  BLANK 
AND  REMOVE  RESET. 
INPUTS  GENERATE  FIVE 
LOOPS . 

SET  VSYNC  TO  TEST 
BLANKING. 


Figure  11.  Design  File  of  Video  Shift  Register  (Cont'd.) 
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; INPUT.   REMOVE  ACTIVE 
;VSYNC.   CLEAR  ALL 
; BLANKING  CONTROLS. 
; TEST  THE  LOADING 
; AND  SHIFTING  OF 
; DATA .   FOR  REVERSE 
; VI DEO  ACTIVE  AND 
; INACTIVE . 
; GENERATE  LOOPS 
;TO  LOAD  AND  SHIFT 
i DATA . 


END  OF  SIMULATION 


Figure  11.  Design  File  of  Video  Shift  Register  (Cont'd.) 


DOTCLK  £  V- 

J       3  Vcc 

DOTCLK  C  ^ 

^       H  Vcc 

DOTCLK  £  V- 

J       3  Vcc 

RST  £ 

□  RDY 

RST  □ 

D  R3 

RST  C 

□  so 

so  Z 

3  HSYNC 

CEI  C 

3  R2 

D1  C 

□  81 

S1  Z 

□  MA6 

NC  C 

3  R1 

D2  C 

□  S2 

S2  C 

□  MA5 

NC  C 

]  RO 

D3  C 

□  VIDEO 

VSYNC  C 

□  MA4 

NC  C 

3  VSYNC 

D4  C 

3  NG 

NC  C 

□  MA3 

NC  C 

D  MA11 

D5  fj 

□  NC 

NC  C 

□  MA2 

NC  C 

3  MA10 

D6  C 

□  NC 

NC  C 

□  MA1 

NC  C 

H  MA9 

D7  C 

I!  NC 

NC  Z 

□  MAO 

WR  C 

□  MAS 

BLANK  Z 

3  NC 

WR  C 

J  BLANK 

TST  £ 

3  MA7 

VSYNC  C 

3  NC 

GND  Z 

□  CS 

GND  C 

J  NC 

GND  C 

□  REV 

PAL32VX10 

PAL32VX10 

PAL32VX1 0 

LINE  SYNC  GENERATOR  FRAME  SYNC  GENERATOR  VIDEO  SHIFT  REGISTER 


END 

IF  J  =   3  THEN 

BEGIN  SETF  Dl  /D2  D3  /D4  D5 

/D6  D7  /BLANK  /REV 

END 

IF  J  =   4  THEN 
BEGIN  SETF  REV 
END 

FOR  I    :=   1  TO  8  DO 
BEGIN 

CLOCKF  CLK 

END 

END 

TRACE  OFF 


Figure  12.  Pinouts  for  Various  Components  of  a  Small  System  Graphics  Controller 


PAL32VX10  Uses  Buried  Register 
for  Input-Intensive 
State  Machine  Designs 


The  PAL=32VX10  offers  design  engineers  several  features  not 
previously  found  in  programmable  logic  devices  (PLDs).  Many 
of  these  features  are  found  in  the  output  macrocell,  which  al- 
lows over  thirty  possible  configurations.  This  application  note 
will  highlight  some  of  these  features,  and  show  how  they  are 
used  in  a  Dual  Port  Video  Shift  Register. 

The  PAL32VX10  is  a  24-pin  PLD  with  twelve  dedicated  inputs. 
The  ten  I/O  macrocells  have  dual  feedback,  providing  twenty 
additional  array  inputs.  Each  macrocell  has  the  architecture 
shown  in  Figure  1 . 


CLOCK  INPUT 


AP 

ASYNCHRONOUS 
PRESET 

Figure  1.  PAL32VX10  Macrocell 

Macrocell  Description 

A  D-type  flip-flop  is  clocked  by  a  dedicated  input  which  also 
feeds  the  programmable  array.  The  flip-flop  data  input  is  fed  by 
a  varied  number  of  product  terms,  from  eight  to  sixteen  per  out- 
put. The  sum  of  these  products  feeds  an  Exclusive-OR  (XOR) 
gate  controlled  by  an  additional  product  term.  Global  reset  and 
preset  signals  are  also  controlled  by  product  terms. 
A  fuse  controls  flip-flop  output  polarity.  A  product  term  allows 
the  flip-flop  to  be  bypassed,  and  another  product  term  enables 
the  output.  Feedback  signals  come  from  the  flip-flop  and  the 
I  /O  pin,  providing  dual  feedback. 


The  device  is  manufactured  with  Monolithic  Memories'  ad- 
vanced oxide-isolated  process,  providing  high  speed  and  low 
power  consumption.  The  propagation  delay  is  30  ns  maximum 
for  the  standard  version  and  25  ns  maximum  for  the  A-speed 
version.  Power  consumption  is  only  180  mA  maximum. 

J-K  Programmable  Function 

The  XOR  gate  in  each  of  the  ten  macrocells  provides  the  de- 
signer with  the  option  to  extend  the  functionality  of  the  D-type 
register  to  perform  J-K  and  subset  functions,  such  as  T-type 
toggle  flip-flops  for  binary  counter  applications.  This  option  can 
aid  product  term  economy  considerably.  In  many  state  ma- 
chine designs  large  numbers  of  product  terms  are  needed  as 
holding  functions  for  a  D-type  flip-flop.  The  hold  function  is  not 
inherent  to  the  truth  table  of  a  D-type  register,  but  in  a  J-K 
function  it  is,  so  a  logic  designer  can  invoke  the  hold  condition 
without  using  up  any  of  the  product  terms  in  the  macrocell. 
It  can  be  shown  that  there  exist  five  logic  equations  for  a  J-K 
function  using  a  D-type  flip-flop,  an  XOR  gate,  and  sum  of 
product  inputs; 
•t.Q>Q\J  +  Q*K 

2.  Q:=Q*J+Q*K 

3.  Q  :=  Q  :+:  (Q*J  +  Q*K) 

4.  Q:=Q:+:(Q*J  +  Q*K) 

5.  Q  :=  Q  :+:  (Q\T+  Q*K) 

See  Appendix  1  for  the  derivation  of  these  equations. 
The  key  feature  to  note  about  these  equations  is  that  the  first 
two  can  be  realized  from  the  sum  of  two  product  terms  but  ei- 
ther J  or  K  must  be  inverted.  A  DeMorgan  expansion  of  a  com- 
plex J  or  K  input  might  result  in  using  too  many  product  terms, 
and  may  not  fit  in  the  PAL32VX10.  This  limitation  can  be  cir- 
cumvented by  using  equation  3;  here,  J  and  K  inputs  need  no 
inversion  but  there  is  a  requirement  for  the  XOR  operator,  as 
shown  in  equations  3,  4  and  5.  Alternatively,  the  non-inverted  J 
and  K  functions  may  use  more  terms  than  the  inverted  ver- 
sions, pointing  to  the  use  of  equations  1 ,  2,  or  5.  The  macrocell 
structure  is  capable  of  making  use  of  any  one  of  these  five 
equations,  allowing  the  designer  to  choose  the  one  that  makes 
optimum  use  of  product  terms. 

The  fuse  array  was  extended  to  9,738  programmable  fuses  to 
accommodate  ten  additional  feedback  paths  from  the  ten  ma- 
crocells. These  are  the  ten  buried  feedback  paths  (from  the 
flip-flop)  that  make  it  possible  to  realize  buried  register  state 
machine  designs.  Any  macrocell  whose  output  pin  is  pro- 
grammed as  an  input  still  has  available  its  internal  feedback 
path,  and  the  registered  cell's  functionality  need  not  be  lost. 
The  cell  could  be  programmed  as  a  buried  register  for  use  in  a 
buried  state  machine.  This  feature  extends  the  application  of 
this  device  to  input-intensive  designs  such  as  the  Dual  Port 
Video  Shift  Register. 
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Figure  2.  Dual  Port  Video  Shift  Register  in  Video  System 


Dual  Port  Video  Shift  Register 

This  design  highlights  the  the  advantages  of  using  PAL  devices 
for  implementing  custom  designs.  This  design  is  shown  in  the  PAL 
device  Design  Specification  file  on  page  2-279.  Figure  2  shows 
how  the  device  would  be  configured  to  a  Microprocessor 
Unit  (MPU)  system  as  a  parallel-to-serial  converter  in  video 
applications. 

The  PAL32VX10  in  the  system  is  capable  of  supporting  one  of 
two  parallel  input  paths  from  character  storage  elements.  The 
selected  port  input  is  converted  to  a  serial  video  data  stream 
through  the  VIDEO  output  for  display  on  a  CRT  screen.  Events 
of  loading  and  shifting  are  synchronized  to  a  Dot  Clock  input 
which  is  applied  to  pin  1  of  the  device,  and  is  the  rate  at  which 
the  character  dots  are  displayed  on  the  CRT.  There  are  two  in- 
put ports,  DA1-DA7  and  DB1-DB7.  One  of  these  inputs  is  select- 


ed by  the  SELA  control.  When  SELA  is  High  the  DA1 -DA7  input 
is  loaded,  and  subsequently  shifted  to  the  VIDEO  output.  When 
SELA  is  Low  the  DB1-DB7  input  is  loaded  and  shifted. 

The  DA1-DA7  port  could  be  configured  to  a  character  PROM, 
the  content  of  which  is  a  fixed  character  font.  A  second  character 
memory  could  be  configured  to  the  DB1-DB7  port,  which  could 
be  a  RAM  circuit.  While  the  standard  character  font  of  the  PROM 
is  being  displayed  on  the  CRT,  the  RAM  memory  could  be  pro- 
grammed by  the  host  CPU  with  a  specialized  character  font.  The 
contents  of  the  RAM  could  be  changed  at  any  time,  so  a  wide 
variety  of  custom  fonts  could  be  loaded,  while  the  system  is  dis- 
playing the  regular  standard  font  stored  in  the  PROM.  Switching 
of  the  SELA  input  is  controlled  by  the  host  system  to  select  the 
required  character  memory. 
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Each  of  the  two  input  ports  is  seven  bits  wide  and  is  designed 
to  interface  with  character  matrix  sizes  of  7  by  9.  However,  if 
the  SELA  input  is  controlled  by  an  external  register  which  de- 
codes the  state  outputs  SO,  S1  and  S2,  the  two  ports  may  be 
configured  as  one  single  port,  fourteen  bits  wide. 
The  inputs  BLANK  and  VSYNC  are  used  to  force  a  video  blank- 
ing output  during  line  and  frame  flyback.  The  BLANK  input 
overlaps  the  line  sync  signal  in  a  practical  application  to  create 
front  and  back  porch  blanking  on  the  CRT  screen. 

Of  the  ten  registered  macrocells  in  the  PAL32VX10,  seven  are 
connected  as  a  parallel  load  serial  shift  register  to  the  VIDEO 
output.  The  three  remaining  registers  are  configured  as  a 
three-bit  binary  counter.  The  counter  outputs  SO,  S1  and  S2 
determine  whether  the  device  is  in  load  or  shift  mode.  When 
S0-S2  are  all  High  the  shift  register  loads  character  informa- 
tion; otherwise,  shifting  takes  place  for  the  seven  remaining 
states  of  the  binary  counter.  The  output  pins  SO,  S1  and  S2  are 
connected  to  the  buried  registers  Q10,  Q9  and  Q8  respective- 
ly, and  are  assigned  in  the  PAL  device  Design  Specification  by 
SO  ■  Q10,  etc.  Similarly,  the  VIDEO  output  pin  is  assigned  reg- 
ister Q7.  Registers  Q1  to  Q6  remain  buried;  the  output  pins 
normally  associated  with  these  register  cells  have  been  as- 
signed as  the  dedicated  DB  port  inputs  DB2-DB7. 
The  three-bit  binary  counter  has  been  designed  using  a  T-type 
toggle  register.  Choosing  Equation  4  and  equating  J  =  K  =  1  for 
the  Toggle  function  and  J  =  K  =  0  for  the  Hold  function,  in  gen- 
eral J  =  K  =  T.  The  new  input  for  a  Toggle  function  becomes  T 
and  the  equation  becomes: 

6.5:=Q:+:(Q  +  Q)'T 
but  since 

7.  (Q  +  Q)  =  1 

the  equation  becomes: 


The  oscilloscope  photograph  in  Figure  4  shows  the  SO,  S1  and 
S2  outputs,  running  from  the  top  trace  down,  and  the  serial 
VIDEO  data  on  the  bottom  trace.  The  Load  event  takes  place 
when  all  state  outputs  are  High.  For  the  first  section  of  the 
trace,  port  DB7-DB1  is  loaded  and  subsequently  shifted 
through  the  buried  register  file.  The  cursor  on  the  right  marks 
the  end  of  one  Load/Shift  cycle.  The  bit  pattern  loaded  into 
DB7-DB1  is  HLLHHHL.  In  the  second  half  of  the  the  trace,  port 
DA7-DA1  is  selected.  The  bit  pattern  of  LLHLHLH  is  clocked 
into  the  DA7-DA1  port  and  shifted  through  to  the  VIDEO  out- 
put. 


HLLHHHL  LLHLHLH 

Figure  4.  Video  Shift  Register  Waveforms 


8.  Q  >  Q  :+:  T. 

The  basic  function  in  a  binary  counter  is  that  a  flip-flop  must  re- 
main in  a  Hold  state  until  the  product  term  made  up  of  all  of  the 
less  significant  flip-flops  in  the  count  sequence  becomes  a  log- 
ic High.  So,  the  equation  for  Q8  in  the  design  specification  be- 
comes: 

9.  08  :=  Q8  :+:  Q9'Q10. 

When  Q9'Q10  =  0  flip-flop  Q8  holds  its  contents,  and  when 
Q9"Q10  =  1  Q8  changes  state. 

The  PAL  device  Design  Specification  shows  that  eighteen  in- 
puts and  four  outputs  are  required,  but  the  buried  register  fea- 
ture has  not  wasted  the  macrocells  whose  associated  pins 
have  been  used  as  inputs.  The  design  has  been  programmed 
with  two  other  PAL  devices  (not  shown  here),  one  as  a  line 
sync  generator  and  one  as  a  frame  sync  generator,  to  form  the 
basis  of  a  low-cost  video  system. 

A  simulation  section  has  been  included  with  the  design  to  veri- 
fy functional  operation  before  the  Design  Specification  is  com- 
mitted to  the  fuse  array  of  the  PAL  device.  It  is  recommended 
that  the  A-speed  part  is  used  in  this  specific  application  to  ac- 
commodate good  resolution  of  the  displayed  video  information. 
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The  complete  truth  table  of  a  J-K  function  is  shown  in  Table  1 . 
Inputs  J  and  K  are  tabulated  with  the  current  state  of  the  regis- 
ter (QN)  and  the  required  next  state  (QN+1).  Equations  1  and  2 
from  the  text  are  derived  by  directly  grouping  ones  (S  and  s)  or 
zeros  (R  and  r)  in  the  truth  table.  The  other  equations  for  the  J- 
K  function  are  derived  by  tabulating  active  'SET'  and  'RESET' 
functions  and  passive  'set'  and  'reset'  functions. 
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Q  =  /Q*J  +  QVK  EQ.  1 

/Q  =  /Q*/J  *  Q"K  EQ.  2 

Q  =  Q  :+:  (/Q*J  +  Q*K)  EQ.  3 

10  =  /Q  :+:  </Q*J  +  Q'K)  EQ.  4 

Q  =  Q     (/QVJ  +  QVK)  EQ.  5 

Tablel.  Truth  Table  of  J-K  Function  Using  Change  of  State. 


Identifying  Transition  and  Hold  conditions  as  well  as  logic  High 
and  Low  states  enables  the  designer  to  draw  a  Karnaugh  map 
for  an  XOR-gate  register  input.  The  XOR  gate  has  the  capabili- 
ty of  a  Hold  function  for  the  remain  in  'set'  and  'reset'  states, 
and  a  Change-of-State  function  for  active  'SET'  and  'RESET'. 
To  derive  Equation  3, 
Q  :=  Q  :+:  (Q*J  +  Q'K), 
the  following  equation  is  considered: 
Q:=s:+:(R  +  S), 

or,  "Q  holds  unless  an  active  transition  is  true."  The  entries  in 
the  equation  show  Hold  and  Transition  states.  When  both  'S' 
and  'R'  are  inactive  Low  the  register  Q  is  in  a  Hold  condition  by 
the 's'  entry  in  the  equation.  If 's'  and  'R'  become  High  simulta- 
neously the  result  will  be  an  active  Reset  after  the  following 
clock  edge;  for  this  reason  minimization  of  the 's'  equation  can 
include  any  'R'  entries  in  the  Karnaugh  Map.  Also,  the  states  of 
'S'  and 's'  are  mutually  exclusive,  so  with 's'  inactive  and  'S'  ac- 
tive the  output  will  transition  to  a  Set  condition. 

Minimize  all  the  Hold  'set'  states  but  include  the  active  'RESET' 
states  for  the  entry  to  the  left  of  the  XOR  function.This  gives  Q 
as  shown  in  group  2  of  the  Karnaugh  map.  Then  identify  the 
change-of-state  entries  shown  by  the  capital  letters  'S'  and  'R'. 
Minimization  achieved  in  groups  1  and  3  are  shown  to  the  right 
of  the  equation. 

With  regard  to  the  equation  for  Q  a  similar  procedure  applies: 
Q  :=  r :+:  (S  +  R) 

This  time  all  the  passive  'reset'  conditions  are  minimized  with 
active  'SET'  states,  and  XORed  with  minimization  of  active 
'SET'  and  'RESET'  states  (group  4).  This  leads  to  Equation  4. 

Equation  5  is  derived  by  examining  the  equation  for  "Q  will  tog- 
gle unless  a  Hold  is  true",  or 

Q:=Q:+:(s  +  r); 

the  derivation  is  similar  to  those  above. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


DUAL  PORT  VIDEO  SHIFT  REGISTER. 

01B 

01 

CHRIS  JAY. 

MM I  SANTA  CLARA,  CA. 
15TH  MARCH  1987. 


THE  PAL32VX10  HAS  BEEN  DESIGNED  AS  A  VIDEO  SHIFT  REGISTER 
WITH  A  DUAL  SEVEN  BIT  PORT  INPUT.     PARALLEL  CHARACTER  DATA 
MAY  BE  LOADED  FROM  A  FIXED  CHARACTER  ROM  VIA  PORT  INPUTS 
DAI  -  DA7,  WHILE  THE  SELA  INPUT  IS  HIGH.     PORT  INPUTS  DB1-DB7 
MAY  BE  CONNECTED  TO  A  CHARACTER  RAM,   THE  CONTENTS  OF  WHICH 
CAN  BE  CHANGED  BY  THE  HOST  MICROPROCESSOR.     THIS  ENABLES  THE 
VIDEO  SYSTEM  DISPLAY  A  STANDARD  CHARACTER  FONT  WHILE  DYNAMICALLY 
CHANGING  CUSTOM  CHARACTER  FONTS  IN  THE  CHARACTER  RAM.     WHEN  A 
NEW  CHARACTER  DATA  IS  LOADED  INTO  THE  RAM  THE  SELA  INPUT  CAN  BE 
DRIVEN  LOW  TO  SELECT  THE  DB1-DB7  PORT  INPUT.     THE  ON  CHIP 
STATE  COUNTER  SO,    SI  AND  S2  FORMS  A  SYNCHRONOUS  BINARY  COUNT. 
STATES  0  TO  6  REPRESENT  SHIFT  ACTIVITY  IN  THE  SEVEN  BIT  VIDEO 
SHIFT  REGISTER.     WHEN  THE  COUNTER  REACHES  A  COUNT  OF  SEVEN  THE 
NEXT  CHARACTER  DATA  IS  LOADED  INTO  THE  REGISTER  VIA  PORT  DA  OR  DB. 
THERE  ARE  TWO  BLANKING  INPUTS  /VSYNC  AND  BLANK.     WHEN  /VSYNC  IS 
LOW  THE  CONTENTS  OF  THE  VIDEO  SHIFT  REGISTER  ARE  LOADED  WITH  LOGIC 
ZEROS.      THIS  IS  ALSO  THE  CASE  WHEN  BLANK  IS  ACTIVE  HIGH.  THE 
VSYNC  AND  BLANK  INPUTS  SHOULD  BE  USED  DURING  FRAME  AND  HORIZONTAL 
FLYBACK,  RESPECTIVELY. 

CHIP  VIDEO_2  PAL32VX10 

DUAL  PORT  VIDEO  SHIFT  REGISTER  PAL 


;PINS 

1  . 

2 

3 

4 

5 

6 

CLK 

DAI 

DA  2 

DA  3 

DA  4 

DA  5 

;PINS 

7 

8 

9 

10 

11 

12 

DA6 

DA7 

BLANK 

/VSYNC 

SELA 

GND 

;PINS 

13 

14 

15 

16 

17 

18 

DB1 

DB2 

DB3 

DB4 

DB5 

DB6 

;PINS 

19 

20 

21 

22 

23 

24 

DB7 

VIDEO 

S2 

SI 

SO 

VCC 

GLOBAL 

Ql  Q2 

Q3  Q4  Q5 

Q6  Q7 

Q8  Q9  Q10 

STRING 

LOAD 

"Q8*Q9* 

Q10' 

; ENABLE 

LOADING 

OF 

EQUATIONS 

/Q10  := 
SO 

50.  CMBF  = 

/Q9  :  = 

•  +  • 

SI 

51.  CMBF 


Q10 
Q10 
GND 

/Q9 

Q10 

Q9 
GND 


INTERNAL  STATE  COUNTER 
LEAST  SIGNIFICANT  BIT 
Q10.    ENABLE  REGISTER 
OUTPUT  FOR  PIN  SO. 
INTERNAL  STATE  COUNTER 
Q10  =  HIGH  =  TOGGLE. 
ENABLE  PIN  SI 
REGISTERED  OUTPUT. 
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/Q8 

•  = 

/Q8                            ; INTERNAL  STATE  COUNTER 

:  +  : 

Q10*Q9                     ;Q10*Q9  =  HIGH  = 

TOGGLE . 

S2 

= 

Q8                              ; ENABLE  PIN  S2  AS  A 

S2.CMBF 

= 

GND                            ; REGISTERED  OUTPUT. 

/Ql 

:  = 

LOAD  DATA  INPUT 

DAI 

+ 

/HR1  *T,OAn*  /<?FT,A*  /RT.ANK*  /U^YNf 

LOAD  DATA  INPUT 

DB1 

+ 

SHIFT  HIGH  INTO 

Ql. 

+ 

/Q9 

WHEN  Q10,Q9  AND 

Q8 

+ 

/Q10 

ARE  NOT  ALL  HIGH 

+ 

otinfln 

SIMULTANEOUSLY. 

+ 

Vol  IMV- 

LINE/ FRAME  BLANKING 

/Q2 

:  = 

/r^a  *5  *  t  r\a  r\*  c  ipt  a*  /dt  a  m  iv  a  /wcvwr1 
/  DAZ  KLUAD*  bEiLA*/  BLANK  */  vbi  NL 

LOAD  DATA  INPUT 

DA  2 

+ 

/nP9*rnan*  /gt?t  a*  /rt  awv*  /t/ovkip 

/  UDZ     1j\JHU"  /  OCjIjA    /  QljANft    /VOX  NL 

LOAD  DATA  INPUT 

DB2 

/QL*/Q° 

SHIFT  DATA  FROM 

Ql 

+ 

/Q1VG9 

INTO  Q2. 

+ 

/Ql*/Q10 

+ 

BLANK 

LINE  AND  FRAME 

+ 

tjcvwp 

BLANKING . 

/Q3 

:  = 

/DA J  * LOAD* SELA*/ BLANK* /VSYNC 

LOAD  DATA  INPUT 

DA  3 

+ 

/r^n  t  *  r  r\  a  r\  Jt  /opt  a*  /dt  ami/*  /ttcvmp 
/  Ub  J  *  LUAD  x  /  ztbiL  A  *  /  BLANK  "/Vbi  NL. 

LOAD  DATA  INPUT 

DB3 

/Q2*/Q8 

SHIFT  DATA  FROM 

Q2 

+ 

/Q2*/Q9 

INTO  Q3. 

+ 

/Q*2*/Q10 

+ 

BLANK 

LINE  AND  FRAME 

+ 

VbYNL 

BLANKING. 

/Q4 

:  = 

/ DA4 * LUAD* SbLA*/ BLANK* /VSYNC 

LOAD  DATA  INPUT 

DA4 

+ 

/  DB4  * LUAU*/ bLLA*/ BLANK */ Vox NC 

LOAD  DATA  INPUT 

DB4 

/Q3*/Q8 

SHIFT  DATA  FROM 

Q3 

+ 

/Q3*/Q9 

INTO  Q4. 

+ 

/Q3*/Q10 

+ 

BLANK 

LINE  AND  FRAME 

+ 

VSYNC 

BLANKING. 

/Q5 

:  = 

/DAb "LOAD*  SELA*/ BLANK* /VSYNC 

LOAD  DATA  INPUT 

DA  5 

+ 

/  1JB  b  *  LUAU  */StLA*/  BLANK  "/VSYNC 

LOAD  DATA  INPUT 

DB5 

X 

/Q4*/Q8 

SHIFT  DATA  FROM 

Q4 

+ 

/Q4*/Q9 

INTO  Q5. 

+ 

+ 

BLANK 

LINE  AND  FRAME 

+ 

VSYNC 

BLANKING. 

/Q6 

:  = 

/DA6 * LOAD* SELA* /BLANK* /VSYNC 

LOAD  DATA  INPUT 

DA6 

+ 

/DB6* LOAD */SELA* /BLANK* /VSYNC 

LOAD  DATA  INPUT 

DB6 

+ 

/Q5*/Q8 

SHIFT  DATA  FROM 

Q5 

+ 

/Q5*/Q9 

INTO  Q6. 

+ 

/Q5*/Q10 

+ 

BLANK 

LINE  AND  FRAME 

+ 

VSYNC 

BLANKING. 
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/Q7  :=  /DA7* LOAD* SELA* /BLANK* /VSYNC 

+  /DB7* LOAD* /SELA* /BLANK* /VSYNC 

+  /Q6*/Q8 

+  /Q6*/Q9 

+  /Q6*/Q10 

+  BLANK 

+  VSYNC 

/VIDEO  =  /Q7 

/VIDEO.CMBF  =  GND 


LOAD  DATA  INPUT  DA6 . 
LOAD  DATA  INPUT  DB6 . 
SHIFT  DATA  FROM  Q6 
INTO  Q7. 

LINE  AND  FRAME 
BLANKING. 

OUTPUT  Q7  TO  VIDEO. 


SIMULATION 

TRACE_ON  CLK  SO  SI  S2 

VIDEO  /VSYNC  BLANK  SELA 
Ql  Q2  Q3  Q4  Q5  Q6 
DAI  DA 2  DA 3  DA 4  DA 5  DA6  DA7 
DB1  DB2  DB3  DB4  DB5  DB6  DB7 

PRLDF  Ql  Q2  Q3  Q4  Q5     Q6  Q7  Q8  Q9  Q10 

SETF  /CLK 

DAI  /DA2  DA 3  /DA4  DA 5  /DA6  DA7 
/DB1  DB2  /DB3  DB4  /DB5  DB6  /DB7 
SELA  BLANK  /VSYNC 

FOR  I    :=  0  TO  7  .DO 

BEGIN  CLOCKF  CLK 

END 

SETF  /SELA 
FOR  I   :=  0  TO  7  DO 
BEGIN  CLOCKF  CLK 
END 

SETF         /BLANK  SELA 
FOR  I   :=  0  TO  7  DO 
BEGIN  CLOCKF  CLK 
END 

SETF  /SELA 
FOR  I   :=  0  TO  7  DO 
BEGIN  CLOCKF  CLK 
END 

SETF  VSYNC 
FOR  I    :=  0  TO  7  DO 
BEGIN  CLOCKF  CLK 
END 

TRACE  OFF 


START  OF  SIMULATION 
TRACE  ALL  ESSENTIAL 

INPUTS  AND  OUTPUTS. 


PRELOAD  REGISTERS 

SET  INITIAL  CONDITIONS. 

ON  DATA  A  INPUT  PORT 

AND  DATA  B  INPUT  PORT. 

ENABLE  A  PORT  AND 

BLANK  INPUT. 

BEGIN  THE  DOT  CLOCK 

INPUT  FOR  EIGHT  CLOCK 

CYCLES.    SELECT  B  INPUT 

ENABLE  CLOCK  FOR  EIGHT 

CLOCK  CYCLES. 

REMOVE  BLANKING  INPUT 
SELECT  A  PORT  CLOCK 
FOR  EIGHT  CYCLES. 

ENABLE  B  INPUT  PORT 
CLOCK  FOR  EIGHT  CLOCK 
CYCLES. 

ENABLE  VSYNC  INPUT 
CLOCK  FOR  EIGHT 
CLOCK  CYCLES. 

END  OF  SIMULATION 
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Digital  Signal 
Processing 

The  field  of  Digital  Signal  Processing  (DSP)  was  originally  limited 
to  the  digital  filtering  and  processing  of  analog  signals.  Today,  the 
field  encompasses  a  wide  range  of  numeric  processing  applica- 
tions, primarily  due  to  the  versatility  of  the  Fast  Fourier  Transform 
algorithm  with  its  accompanying  repetitive  addition  and  multipli- 
cation operations. 

The  ability  to  perform  multiplication  and  addition  in  a  single  clock 
cycle  is  the  key  to  achieving  high-speed  mathematics  in  DSP 
hardware.  When  executing  math-intensive  algorithms,  even  the 
slowest  DSP  components  are  at  least  ten  times  faster  than  con- 
ventional microprocessors.  When  a  DSP  solution  is  used,  it  is 
usually  for  speed  considerations.  Conventional  microprocessors 
are  not  optimized  for  arithmetic,  and  have  to  simulate  math 
through  software.  This  simulation  tends  to  require  several  clock 
cycles.  DSP  systems  use  specialized  hardware,  including  a 
multiplier  and  adder,  that  allow  them  to  execute  numeric  opera- 
tions in  a  single  cycle.  A  typical  DSP  system  is  shown  in 
Figure  1. 


<: 


MAIN  SYSTEM  BUS 


ADDRESS 
CONTROL 


ROM 
Z 


ARITHMETIC 
PROCESSOR 


MICROPROGRAM  CONTROL 


Figure  1.  HIgh-Performance  DSP  System 

In  addition,  the  architecture  must  try  to  provide  data  to  the 
multiplier  and  adder  at  every  clock  cycle,  for  the  highest  efficiency. 
This  can  be  done  by  using  an  architecture  where  data  and 
instructions  are  stored  separately  and  have  their  own  buses,  one 
for  data  and  one  for  instructions.  If  two  data  buses  are  used,  two 
data  elements  can  be  moved  to  the  multiplier  or  adder  in  parallel 
for  execution,  while  the  CPU  is  fetching  an  instruction.  Another 
efficient  architecture  uses  pipelining,  allowing  the  processor  to 
work  on  subsections  of  the  operation  in  parallel. 


Application  Areas 

The  algorithms  implemented  in  DSP  components  are  being 
applied  to  a  wide  variety  of  systems.  Examples  include  standard 
signal  processing  for  applications  from  compact  disc  players  to 
artificial  intelligence,  matrix-manipulation  algorithms  for  graphics 
systems,  filtering  algorithms  used  in  image-processing  systems, 
and  floating-point  algorithms  used  in  workstations  and  scientific 
computers.  Many  of  these  applications  are  often  categorized 
under  "numerical  processing"  or  "number  crunching"  rather  than 
DSP,  but  any  algorithm  that  makes  extensive  use  of  repetitive 
multiplication  and  addition  operations  can  benefit  from  DSP 
hardware.  The  primary  application  area  is  communications,  with 
military  applications  also  a  major  part  of  the  market.  The 
application  areas  are  listed  below. 

•  Communications 
-Codecs 
-Modems 

-Multiplexers/Demultiplexers 
-Echo  Cancellation 
-Filtering 

-Gain  equalization 

•  Military 
-Radar/Sonar 
-Guidance  and  Control 
-Telemetry 

-Electronic  Warfare/Countermeasures 

•  Computers 

-PC-Board  Add-Ons  for  PCs 
-Minicomputers/Workstations 
-Parallel/Array  Processors 

•  Graphics  and  Image  Processing 
-CAD/CAE 

—Image  Enhancement  and  Restoration 

-Image  Subtraction,  Compression,  Decompression 

-CAT  Scanning 

•  Industrial  Automation 
-CAM 

-Vision  Systems/Robotic  Vision  and  Other  Senses 
-Artificial  Intelligence 

•  Voice  Data  Entry/Synthesis 

•  Test  Equipment 

•  Process  Control 

•  Consumer  Products 
-Television 

-Compact  Disc  Players 


Digital  Signal  Processing 


ANALOG 


Figure  2.  Typical  General-Purpose  DSP  Processor  System 


Speed  versus  Other  Requirements 


As  mentioned,  most  DSP  designs  are  chosen  because  of  speed. 
However,  speed  is  not  always  a  critical  element  once  a  DSP 
solution  is  selected:  the  slowest  DSP  design  will  still  be  fasterthan 
the  fastest  conventional  microprocessor  design. 

Speed  remains  a  critical  element  in  most  communications  appli- 
cations. An  exception  is  modems,  which  require  single-chip 
solutions  for  size  reduction  and  do  not  require  high  speed  to 
handle  voice  frequencies.  Speed  is  also  critical  in  computer 
applications,  but  price  and  performance  advantages  are  also 
important.  DSP  add-in  boards  require  small  size,  and  therefore 
single-chip  solutions.  In  military  applications,  however,  price  is 
less  critical,  and  designs  are  typically  customized  for 
performance. 

Consumer  applications  use  custom  single-chip  solutions  for  both 
size  reduction  and  cost  reduction;  these  are  usually  produced  by 
a  captive  supplier.  Many  of  the  other  applications  require  a 
flexible  solution  and  a  short  development  time,  both  efficiently 
provided  by  programmable  logic  devices. 


ALGORITHM  #1 
Am29540  FFT 
ADDRESS  SEQUENCER 


ALGORITHM  #3 


ADDRESS 
PIPELINE 
Am 29520/21 


Single  Chip  versus  Building  Block 

Similar  to  the  beginnings  of  the  conventional  microprocessor, 
dedicated  DSP  processors  are  being  introduced  along  with 
development  software.  Single-chip  DSP  solutions  offer  easier 
software  development,  smaller  size,  and  lower  cost.  Several 
DSP  processors  are  commercially  available,  and  are  taking  the 
place  of  conventional  microprocessors  in  DSP  systems  (Figure 
2).  Most  are  general-purpose  processors,  but  application-spe- 
cific processors  are  also  being  developed. 

In  addition  to  these  dedicated  DSP  processors,  custom  proces- 
sors can  be  built  with  an  ASIC  methodology.  Custom  processors 
are  especially  efficient  for  high-volume  applications. 

The  alternative  to  a  single-chip  DSP  processor  is  a  building-block 
solution  using  multiple  components.  These  components  would 
typically  involve  bit-slice,  multiplier/accumulators,  ALUs,  and 
program  memory,  as  performed  in  the  application  note  on  page 
2-307.  These  solutions  provide  higher  speed  and  are  efficient  in 
speed-critical  or  low-volume  applications.  An  example  is  medical 
image  processing,  where  only  a  few  dedicated  image  filtering 
routines  are  performed  repeatedly  on  the  images — the  building 
blocks  can  be  designed  to  perform  quickly  a  small  set  of  functions. 
Many  applications  use  a  mix  of  single-chip  solutions  with  accom- 
panying building  blocks  for  added  functionality  (see  Figure  3). 

HOST  COMPUTER 

 I 


IMAGINARY  DATA 


REAL 
PROCESSOR 
Am 29501 


ADDRESS 
PIPELINE 
Am29825 


SINE/COSINE 
COEFFICIENTS 
Am29526/7 


IMAGINARY 

PROCESSOR 
Am  29501 


MULTIPLIERS 
Am29516A/17A 


MICROPROGRAMMED 

CONTROL  UNIT 

Figure  3.  Am29500  Processor  Family  Architecture. 
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PLD  Applications 

PLDs  are  useful  both  in  single-chip  and  building  block  designs,  tf 
DSP  components  are  used  as  accelerators  in  conjunction  with 
standard  microprocessors,  the  interface  logic  can  be  placed 
into  PLDs. 

When  supporting  DSP  processors,  PLDs  perform  functions 
similar  to  those  supporting  a  general  purpose  microprocessor. 
These  include  address  decoding,  interrupt  control,  DMA  control, 
and  bus  control.  See  the  section  on  Microprocessor-Based 
Systems  for  more  detail. 

Programmable  logic  devices  are  also  useful  in  the  building-block 
applications.  PLDs  often  perform  functions  similar  to  their  use  in 
standard  bit-slice  designs,  as  counters  or  sequencers. 

The  need  for  a  continuous  supply  of  data  to  the  numeric  compo- 
nents of  the  DSP  system  provides  opportunities  for  PLDs  in  both 
building-block  and  single-chip  designs.  Handshaking  between 
components,  especially  between  the  numeric  chips  and  the 


memory  or  sequencer,  is  also  required,  and  with  multiple  buses 
in  the  system,  complex  bus  interface  logic  is  needed. 

The  application  note  on  page  2-307  shows  PAL  devices  providing 
several  functions  for  a  building-block  DSP  system  which  imple- 
ments an  audio  spectrum  analyzer.  The  PAL  devices  create  the 
sequencerfor  program  execution,  multiplex  the  dataoutputs to  an 
oscilloscope,  and  provide  several  miscellaneous  control  func- 
tions. 

PLDs  provide  an  effective  solution  in  any  type  of  DSP  architec- 
ture. General-purpose  DSP  processors  need  PLDs  to  perform 
typical  processor  support  functions,  while  ASIC  processors  may 
need  additional  logic  to  provide  the  proper  interface  to  other 
chips.  In  building  block  systems,  PLDs  perform  both  the  basic 
control  functions  and  interface  functions.  With  limited  availability 
of  software  development  tools  for  most  DSP  applications,  pro- 
grammable logic  devices  offer  the  flexibility  to  implement  last- 
minute  design  changes  to  adapt  to  the  needs  of  the  software  and 
application. 
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Waveform  Generator 


A  simple  waveform  generator  can  be  built  with  a  PAL  device  ad- 
dressing locations  within  a  PROM.  Using  a  PAL20X1 0  device  to 
address  locations  in  a  registered  PROM  (Monolithic  Memories 
63RA1 681 ),  digital  data  may  be  accessed  and  fed  to  a  Digital  to 
Analog  Converter  (DAC).  The  locations  in  the  memory  are  pro- 
grammed with  appropriate  data  to  provide  a  sinusoidal,  triangu- 
lar, sawtooth  or  step  function  output.  A  square  wave  output  may 
be  generated  directly  from  the  PAL  device  itself. 

Digital  vs.  Analog  Waveform 
Generation 

There  are  advantages  and  disadvantages  in  generating  analog 
functions  by  this  method  when  considered  against  a  dedicated 
analog  circuit  design.  The  main  advantages  favoring  the  digital 
method  of  waveform  generation  involve  controllability.  In  this 
design  example  phase  shift  generation  can  be  controlled  within  a 
tolerance  of  one  degree.  The  PALdevice  has  been  programmed 
as  a  180  degree  up/down  counter.  An  initial  count  and  up/down 
control  may  be  loaded  into  the  counter  so  precise  phase  shifts  can 
be  generated.  When  considering  the  phase  shift  of  a  sinusoidal 
waveform  in  an  analog  circuit,  complex  Resistor-Capacitor  (RC) 
networks  might  need  to  be  added  to  generate  phase  shifting. 

Another  advantage  is  the  ability  of  the  system  to  generate  a  wide 
range  of  waveform  types,  and  the  convenience  of  generating  new 
waveforms  by  simply  re-programming  a  PROM,  or  programming 
blank  locations  in  the  existing  PROM.  Also,  if  it  were  a  require- 
ment to  generate  a  burst  of  any  number  of  cycles,  this  could  be 
controlled  by  a  digital  interface  from  a  microprocessor. 

The  disadvantages  associated  with  the  digital  circuit  would  be  the 
generation  of  quantization  noise  due  to  the  encoding  of  a  digital 
code  into  a  close  approximation  of  the  analog  conversion.  For 
example,  an  eight -bit  digital  code  can  be  converted  to  256  differ- 
ent voltage  levels.  Supposing  a  1 0  volt  signal  were  to  be  evenly 
encoded  into  256  discrete  levels,  the  difference  between  each 
level  would  be  about  39  mV.  The  approximation  of  the  digital 
signal  to  its  analog  equivalent  gives  rise  to  quantization  noise,  in 
this  example  39  mV. 


A  way  of  reducing  the  quantization  noise  is  to  generate  a  larger 
digital  code.  For  a  1 0-volt  dynamic  range  encoded  by  a  fourteen- 
bit  digital  code,  the  quantization  noise  is  reduced  to  0.61  mV.  Of 
course  there  are  problems  in  increasing  the  resolution  of  the 
digital  code  in  the  respect  that  a  much  higher  sampling  frequency 
would  be  required  to  generate  the  analog  output  of  a  given  period. 

Additional  to  the  problem  of  quantization  noise  is  the  noise 
generated  by  the  super-imposition  of  elements  of  the  master 
sampling  clock  on  the  analog  wave  output.  Fortunately  the 
frequency  of  this  clock  is  much  higher  than  the  analog  output,  for 
a  resolution  of  eight  bits,  and  may  be  filtered  out  with  a  simple  RC 
filter. 

In  summary,  the  advantages  in  a  digital  waveform  generator  are 
in  the  controllability  of  phase  shifting,  and  the  number  of  cycles 
generated.  Also,  the  ability  to  guard  against  drift  might  be  an 
advantage.  With  an  analog  system,  purity  of  signal  output  and  the 
ability  to  generate  higher  frequencies  are  the  main  advantages. 

PAL  Device  Implementation 

The  diagram  in  figure  1  shows  the  circuit  used  to  create  the 
waveforms.  Photographs  of  the  waveforms  are  provided  in  figure 
2.  The  different  functions  generated  were  selectable  by  a  unique 
digital  code  applied  to  the  higher  order  address  inputs  A8,  A9,  and 
A1 0  of  the  PROM.  The  lower  order  addresses  were  driven  from 
the  PAL20X10;  outputs  Q0-Q7  were  fed  to  address  inputs 
A0-A7.  The  design  specification  of  the  1 80  Degree  Up/Down 
counter  (page  2-288)  was  encoded  into  the  PAL  device. 

Look-Up  Table  Generation 

To  generate  the  hexadecimal  look-up  table,  which  was  pro- 
grammed into  the  PROM,  a  small  TURBO  PASCAL  program  was 
written  to  provide  a  conversion  from  the  angle  (degrees)  to  the 
sine  of  the  angle,  in  a  hexadecimal  value  (page  2-290).  The 
program  generated  the  nearest  hexadecimal  value  to  100 
(sine(x)  +1),  for  a  range  of  180  degrees,  at  a  resolution  of  one 
degree.  The  sawtooth  and  triangular  waveforms  were  simple 
identity  functions;  the  PROM  location  address  was  programmed 
into  its  own  contents.  The  step  waveform  was  programmed  as 
constant  values  in  a  number  of  successive  adjacent  locations. 
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D5 
D6 
D7 
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AS 
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AO 
A1 
A2 


A9 


A10 
00 
Q1 
02 
Q3 


A3  63RA1681 

A4 

A5 

A6 

A7 


DIGrTAL  GROUND 
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Figure  1. 
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Figure  2. 
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Waveform  Generator 


TITLE   18  0DEGREE  COUNT 
PATTERN  01 
REVISION  01 
AUTHOR  CHRIS  JAY 
COMPANY  MMI  SANTA  CLARA 
DATE  17  OCTOBER  1986 

CHIP  DEGREE  PAL20X10 


;PIN 

1 

2 

3 

4 

5 

6 

CLK 

/LD 

DO 

Dl 

D2 

D3 

;PIN 

7 

8 

9 

10 

11 

12 

D4 

D5 

D6 

D7 

DOWN 

GND 

;PIN 

13 

14 

15 

16 

17 

18 

/OE 

Q0 

Ql 

Q2 

Q3 

Q4 

,-PIN 

19 

20 

21 

22 

23 

24 

Q5 

Q6 

Q7 

CUP 

CDN 

VCC 

;THE 

PAL20X10 

HAS 

BEEN 

DESIGNED 

AS     A  180 

DEC 

.•LOADABLE     UP/DOWN     COUNTER.      THE     COUNTER  OUTPUTS 
;  DRIVE     THE     ADDRESS     INPUTS     OF     A     PROM     WHICH  IS 
;  PROGRAMMED     WITH     A      'LOOK     UP'      SINE     TABLE.      A  COUNT 
;UP     FROM     ZERO     TO     180     WILL     GENERATE     ONE  HALF 
.•CYCLE     FROM     THE     PROM     CONTENTS.      THE     COUNTER  WILL 
; AUTOMAT I CALLLY     SWITCH     TO     A     DOWN     COUNT     AND  GENERATE 
;THE     SECOND     HALF     OF     ONE     COMPLETE     CYCLE.      IF  THE 
,-PROM     OUTPUTS     ARE     REGISTERED     AND     FED     TO     A  D/A 
; CONVERTER     A     CONTINOUS     SINUSOIDAL     SIGNAL     WILL  BE 
; GENERATED.      COUNTING     CAN     BE     INTERRUPTED     WITH  A 
,-LOAD.      PHASE     SHIFTING     OF     THE     SINUSIODAL  WAVEFORM 
;MAY     BE     ACHIEVED     WITH     A     RESOLUTION     OF     1  DEGREE. 

EQUATION 


/QO  :=  Q0*/LD 

+  /D0*LD 

/Ql  :=  /Q1*/LD 

+  /D1*LD 
:+:  Q0*/LD*CUP 
+  /Q0*/LD*/CUP 

/Q2  :=  /Q2*/LD 

+  /D2*LD 
i+i  Q0*Q1*/LD*CUP 
+  /Q0*/Q1*/LD*/CUP 

/Q3  :=  /Q3*/U> 

+  /D3*LD 

:+:  Q0*Q1*Q2*/LD*CUP 

+  /Q0*/Q1*/Q2*/LD*/CUP 

/Q4  :=  /Q4*/LD 

+  /D4*LD 

:+:  Q0*Q1*Q2*Q3*/LD*CUP 

+  /Q0*/QWQ2*/Q3*/LD*/CUP 


;  COUNT 
;  LOAD 

;HOLD 
,-LOAD 
;UP  COUNT 
;DOWN  COUNT 

;HOLD 
;  LOAD 
;UP  COUNT 
;DOWN  COUNT 
I 

;HOLD 
;  LOAD 
;UP  COUNT 
;DOWN  COUNT 

;  HOLD 
;  LOAD 
;UP  COUNT 
;DOWN  COUNT 

; 
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/Q5  :=  /Q5*/LD 

+  /D5*LD 

:+:  Q0*Q1*Q2*Q3*Q4*/LD*CUP 

+  /Q0*/Q1*/Q2*/Q3*/Q4*/LD*/CUP 

/Q6  :=  /Q6*/LD 

+  /D6*LD 

:+:  Q0*Q1*Q2*Q3*Q4*Q5*/LD*CUP 

+  /Q0*/Q1*/Q2*/Q3*/Q4*/Q5*/LD*/CUP 

/Q7  :=  /Q7*/LD 

+  /D7*LD 

:  +  :  Q0*Q1*Q2*Q3*Q4*Q5*Q6*/U3*CUP 

+  /Q0*/Q1*/Q2*/Q3*/Q4*/Q5*/Q6*/LD*/CUP 

/CUP  :=  /CUP*/LD 

+  DOWN*LD 

:+:  /Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*Q0*/LD*/CUP 
+  Q7*/Q6*Q5*Q4*/Q3*/Q2*Q1*Q0*/LD*CUP 

/CDN  :=  CUP*/LD 

+  /DOWN*LD 

:+:  /Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*Q0*/LD*/CUP 
+  Q7*/Q6*Q5*Q4*/Q3*/Q2*Q1*Q0*/LD*CUP 

SIMULATION 

TRACE_ON  CLK  /DOWN  LD  CUP  CDN 

QO  Ql  Q2  Q3  Q4  Q5  Q6  Q7 
SETF         /CLK  LD  OE  DO  Dl  D2 

/D3  /D4  /D5  /D6  /D7  DOWN 

CLOCKF 
SETF 
CLOCKF 
FOR  I 

BEGIN  CLOCKF 
END 


/LD 


0  TO  15  DO 


SETF 


/LD 


CLOCKF 
SETF 
CLOCKF 
FOR  I 
BEGIN  CLOCKF 
END 

TRACE  OFF 


/CLK  LD  /DO  /Dl  /D2 
/D3  D4  D5  /D6  D7  /DOWN 


0  TO  15  DO 


;HOLD 
;  LOAD 
;UP  COUNT 
;DOWN  COUNT 

;HOLD 
;LOAD 
;UP  COUNT 
;DOWN  COUNT 

»HOLD 
,-LOAD 
;UP  COUNT 
DOWN  COUNT 

COUNT  UP 
REGISTER 


COUNT  DOWN 
REGISTER 


; LOAD  03H 
;  AND  DOWN 
; COUNT.  SET 
;INTO  COUNT 
;MODE  AND 
; GENERATE 
;  VECTORS  TO 
;TEST  DOWN 
;TO  UP  TRANS- 
;-ITION.  LOAD 
;BOH  AND  UP 
; COUNT . 
; GENERATE 
; VECTORS  TO 
;TEST  UP  TO 
;DOWN  TRANS - 
;  -ITION. 
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{ *  * 

(* 
(« 
(* 
( * 
I* 
t* 
<* 
( * 
(* 
( < 
I* 
( 


The  program  'sinetable'  has  been  designed  to  generate  *) 

the  closest  possible  HEXIDECIMAL  value  to  100  X  [sine(x)  +1  *) 

for  -90  <=  x  =>  +90,   in  steps  of  delta  x  =  1  degree.  The  *) 

HEXIDECIMAL  PROM  address  locations  have  been  evaluated  as  *) 

as  n  =  x  +  90.  The  program  will  generate  a  SINE. DAT  file  *) 

which  will  contain  a  table  of  PROM  address  locations  from  *) 

from  00  to  B4  HEX,  and  the  contents  of  each  location  as  the  *) 

closest  possible  sine  value  as  computed  in  the  above  equation.  *) 

Program  written  by:  *) 

Chris  Jay  and  Ser-Hou  Kuang.  MMI  Santa  Clara.  *) 


const 

hexsize  = 
type 

hextype 


10; 


packed  array     [1.. hexsize]  of  char; 


var 

SinFile:  text; 
i , k : integer ; 
s , x , f : real ; 
hexar,hexcon  : 


hextype ; 


*************************************** 


**************************  ) 


( 

i* 

(*  Procedure  to  convert  decimal  values  into  hexadecimal  values 
(* 
(  1 


**************** 


Procedure  DecToHex  (dec  :  integer;  var  tohex:  hex_type); 
var 

i,  dig,   idx,  next  :  integer; 
Function  hex   (i   :   integer):  char; 


begin 

case 

i  of 

0  : 

hex 

'0'  ; 

1  : 

hex 

'  l '  ; 

2  : 

hex 

'2'  ; 

3  : 

hex 

'3'  ; 

4  : 

hex 

1  4  '  ; 

5  : 

hex 

'5'  ; 

6  : 

hex 

'6'  ; 

7  : 

hex 

'7'  ; 

8  : 

hex 

•8'  ; 

9  : 

hex 

•9'  ; 

10  : 

hex 

•A'  ; 

11  : 

hex 

'B'  ; 

12  : 

hex 

•C  ; 

13  : 

hex 

'D'  ; 

14  : 

hex 

•E'  ; 

15  : 

hex 

,F, 

end; 
end; 
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begi  n 

for  i   : =  1  to  hexsize  do 

tohex  I i ]   :=  '    1  ; 
idx     :=  hexsize; 
dig     :=  dec; 
next  :=  dig  div  16; 

while  (next  <>  0)  and     (idx  >  0)  do 
begin 

ToHex  [idx)   :=  hex  (dig  -  (next  *  16)); 
idx   :=  idx  -1; 
dig  :=  next; 
next  :=  dig  div  16; 
end ; 

ToHex  [idx)   :=  hex  (dig); 
end; 


begin 

Assign (SinFile, 1 B : SINE . DAT 1 ) ; 
Rewr i te ( SinFile ) ; 
x   :=  0; 
f   :=  0; 
i    :=  -90; 
s   :=  pi/180; 
while  i  <  91  do 
begin 

k   :=  i  +  90; 
x   :=  (i*s); 
E  !=  1  +  sin(x) ; 
£   :=  100*f; 

DecToHex  (round  (f),  hexar); 
DecToHex  (k.hexcon); 
if  i  =  -90  then 
begin 

writeln   (  1  DEGREES     LOC     100*SIN  X  +   1         HEXLOC  HEX'); 
end ; 

write  ( i : 3, '         ' ,k : 3, '    '  )  ; 
writeln  (f:10:5,'     '.hexcon,'       1    , hexar); 
if  i  =  -90  then 
begin 

writeln(SinFile, 'DEGREES  100*SIN(X)+1  HEXLOC  1 00*SIN ( X ) +1 [ HEX ] ' ) ; 
end; 

writeln(SinFile,k:5, '     ',f:10:5,    '    '.hexcon,'  '.hexar); 
i   :=  i   +  1; 
end; 

Close(SinFile) ; 


end . 
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PAL32VX10  Uses  Buried  Register 
for  Input-Intensive 
State  Machine  Designs 


The  PAL®32VX10  offers  design  engineers  several  features  not 
previously  found  in  programmable  logic  devices  (PLDs).  Many 
of  these  features  are  found  in  the  output  macrocell,  which  al- 
lows over  thirty  possible  configurations.  This  application  note 
will  highlight  some  of  these  features,  and  show  how  they  are 
used  in  Dual  Binary  Rate  Multipliers. 

The  PAL32VX10  is  a  24-pin  PLD  with  twelve  dedicated  inputs. 
The  ten  I/O  macrocells  have  dual  feedback,  providing  twenty 
additional  array  inputs.  Each  macrocell  has  the  architecture 
shown  in  Figure  1 . 


A  fuse  controls  flip-flop  output  polarity.  A  product  term  allows 
the  flip-flop  to  be  bypassed,  and  another  product  term  enables 
the  output.  Feedback  signals  come  from  the  flip-flop  and  the 
I/O  pin,  providing  dual  feedback. 

The  device  is  manufactured  with  Monolithic  Memories'  ad- 
vanced oxide-isolated  process,  providing  high  speed  and  low 
power  consumption.  The  propagation  delay  is  30  ns  maximum 
for  the  standard  version  and  25  ns  maximum  for  the  A-speed 
version.  Power  consumption  is  only  180  mA  maximum. 


CLOCK  INPUT 


33— 


FUSE  SYNCHRONOUS 
ARRAY  RESET 


OUTPUT  ENABLE 


-D- 


VARIED 
PRODUCT  TERMS; 
8,  10,  12,  14  OR  16 


33- 


REGISTER 
BYPASS  CONTROL 


REGISTER 
BYPASS 


REGISTER 
±-  POLARITY 
CONTROL 


BURIED  REGISTER 
FEEDBACK 


FEEDBACK  FROM  INPUT  PIN 


AP 

ASYNCHRONOUS 
PRESET 


Figure  1.  PAL32VX10  Macrocell 

Macrocell  Description 

A  D-type  flip-flop  is  clocked  by  a  dedicated  input  which  also 
feeds  the  programmable  array.  The  flip-flop  data  input  is  fed  by 
a  varied  number  of  product  terms,  from  eight  to  sixteen  per  out- 
put. The  sum  of  these  products  feeds  an  Exclusive-OR  (XOR) 
gate  controlled  by  an  additional  product  term.  Global  reset  and 
preset  signals  are  also  controlled  by  product  terms. 


Dual  Binary  Rate  Multipliers 

The  availability  of  the  clock  input  in  the  fuse  array  avoids  the 
addition  of  external  logic  in  applications  where  it  is  necessary 
to  use  the  system  clock  as  a  logic  input.  In  many  synchronous 
PLDs  the  system  clock  input  is  connected  only  to  the  register 
file.  The  rising  edge  of  the  clock  input  synchronizes  data  flow 
through  the  D-type  registers,  but  the  clock  is  not  available  as  a 
Boolean  signal  in  the  fuse  array. 

The  application  of  the  Dual  Binary  Rate  Multipliers  uses  the 
buried  register  capability,  the  XOR  function  to  build  counters 
with  a  T-type  register,  and  the  presence  of  the  clock  input  in 
the  fuse  array.  The  circuit  requires  sixteen  data  inputs.  There 
are  two  data  byte  inputs  DA0-DA7  and  DB0-DB7.  The  two  serial 
outputs  are  QA  associated  with  the  DA  port  and  QB  with  the  DB 
port.  The  clock  input  is  divided  by  sixteen  and  is  available  at 
the  QA  or  QB  outputs  multiplied  by  the  binary  data  inputs  DA0- 
DA3  or  DA4-DA7,  and  DB0-DB3  or  DB4-DB7. 

The  output  SELOP  (Select  Output)  selects  one  half  of  each  A 
or  B  byte  as  the  multiplying  vector.  When  SELOP  is  High  the 
lower  half  of  each  input  byte  is  selected  to  multiply  the  clock  in- 
put. When  Low,  the  upper  half  of  each  input  is  used  to  multiply 
the  clock  input. 

For  example,  if  data  inputs  DA4-DA7  and  DB4-DB7  were  binary 
encoded  as  six  and  three,  respectively,  then  six  and  three 
clock  pulses  would  be  gated  through  to  the  outputs  QA  and  QB 
respectively  while  SELOP  is  Low.  When  SELOP  goes  High  the 
data  inputs  DA0-DA3  and  DB0-DB3  are  selected  as  the  multipli- 
ers. If  DA0-DA3  contained  the  binary  encoded  input  of  eight, 
then  eight  clock  pulses  would  be  gated  to  QA.  At  the  same 
time,  if  DB0-DB3  contained  fifteen,  fifteen  clock  pulses  would 
appear  at  the  QB  output.  These  waveforms  are  shown  in  the 
oscilloscope  photograph  in  Figure  2. 
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SELOP 


PAL32VX10 


Figure  2.  Dual  Binary  Rate  Multipliers  Waveforms 


The  two  additional  outputs,  LSB  and  MSB,  are  byte-select  out- 
puts. This  internal  state  machine  has  four  unique  states,  per- 
forming a  binary  count  zero  to  three.  Just  as  the  SELOP  output 
can  be  used  to  select  the  multiplying  half  of  each  byte,  LSB 
and  MSB  may  be  decoded  externally  to  select  a  one-of-four 
byte  data  input  to  be  applied  to  parallel  inputs  A  or  B.  A  pro- 
grammable controlled  pulse  train  from  four  byte  inputs  can  then 
be  available  at  the  QA  and  QB  outputs. 


cu  rr 

24]vCC 

DAG  [T 

33]  MSB 

DA1  Q 

22]  LSB 

DA2  Q 

7j\  SELOP 

DA3  Q 

io]DB7 

DA4[7 

is]  55 

DA5 

Ts]5a 

DAS  [> 

Ti]DB6 

OA7[[ 

]s]  DBS 

DBDpj! 

15]  OBI 

obi  Pj 

30B3 

GND[i! 

ij]DB2 

Dual  Binary  Rate  Multipliers 
Figure  3.  Pin  Layout 
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PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


05 
01 

CHRIS  JAY 

MMI  SANTA  CLARA,  CA 
AUGUST  6,  1986 


CHIP  BINRATE  PAL32VX10 

THE  PAL32VX10  HAS  BEEN  PROGRAMMED  TO  PERFORM  BINARY 
RATE  MULTIPLICATION.     THE  OUTPUTS  AT 
QA  AND  QB  ARE  ONE-SIXTEENTH  OF  THE  CLOCK  INPUT 
MULTIPLIED  BY  THE  BINARY  VALUE  OF  THE  INPUTS 
DA0-DA3  AND  DB0-DB3 ,   RESPECTIVELY,   WHEN  SELOP 
IS  HIGH;   OR,   ONE-SIXTEENTH  OF  THE  CLOCK  INPUT 
MULTIPLIED  BY  THE  BINARY  VALUE  OF  THE  INPUTS  DA4-DA7 


; AND  DB4-DB7, 

WHEN  THE 

SELOP 

SIGNAL  IS 

LOW. 

;PINS 

1 

2 

3 

4 

5 

6 

CLK 

DAO 

DAI 

DA  2 

DA  3 

DA  4 

;  PINS 

7 

8 

9 

10 

11 

12 

DA  5 

DA6 

DA  7 

DBO 

DB1 

GND 

;PINS 

13 

14 

15 

16 

17 

18 

DB2 

DB3 

DB4 

DB5 

DB6 

QA 

;PINS 

19 

20 

21 

22 

23 

24 

QB 

DB7 

SELOP  LSB 

MSB 

VCC 

GLOBAL 

Q0  Ql 

Q2  Q3  NC 

NC  NC 

SEL  LSBI 

MSBI  ; 

BURIED 

STRING  Kl    'Q0*  Ql*  Q2*/Q3' 
STRING  K2    'Q0*  Ql*/Q2' 
STRING  K3  'Q0*/Q1' 
STRING  K4    'Q0*/Q1*  Q2*  Q3 ' 
STRING  K5    'Q0*/Q1*/Q2*/Q3 ' 
STRING  CKNL    ' /CLK*  SEL' 
STRING  CKNH  '/CLK*/SEL' 

EQUATIONS 

/Q0  :=  Q0 


STRING  STATEMENTS 
TO  DECODE  USABLE 
STATES  OF  INTERNAL 
DIVIDE  BY  SIXTEEN 
BINARY  COUNTER. 
CLOCK  SELECTION 
STRING  STATEMENTS 


; CLOCK  DIVIDE  BY  TWO. 
; BURIED  REGISTER. 


/Ql  :=  /Ql   :+:  Q0 


/Q2  :=  /Q2   :+:  Q0*Q1 


/Q3  :=  /Q3   :+:  Q0*Q1*Q2 


; CLOCK  DIVIDE  BY  FOUR. 
; BURIED  REGISTER. 

; CLOCK  DIVIDE  BY  EIGHT. 
; BURIED  REGISTER. 

; CLOCK  DIVIDE  BY  SIXTEEN. 
; BURIED  REGISTER. 
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/SEL 
SELOP 


QA 


QA. 


/LSBI 
LSB 

/MSB  I 

MSB 


• 

/SEL  : 

+ :  QC 

Q-L  U-* 

!  = 

SEL 

DAO 

DA  3 

/yu   v_- r\ Di i_i 

DAI 

+ 

DA2* 

DA3 

i\i  ti\"L 

1 

T 

DA2*/DA3 

+ 

DAI* 

DA  2 

+ 

/DAI* 

DA2* 

+ 

/DAI* 

DA2* 

nnT  *      T/  C  Ip[/K1T 

DAj*  KbxLKNL 

+ 

DA  4 

w  K1KLKNH 

+ 

DA  7 

/UU  Kv_KNH 

+ 

DA  5 

*  K2*CKNH 

+ 

DA6* 

DA7 

+ 

DA6*/DA7 

*  K3*CKNH 

+ 

DA5* 

DA6 

/DA  5* 

DA6* 

+ 

/DA5* 

DA6* 

DA / *    Kb  XLKNH 

= 

VCC 

— 

DBO 

*  K1"CKNL 

+ 

DB3 

*/Q0*CKNL 

+ 

DB1 

*  K2*CKNL 

+ 

DB2* 

DB3 

*  K^*CKNL 

+ 

DB2*/DB3 

*  K3*CKNL 

+ 

DB1  * 

DB2 

*  KJ*CKNL 

+ 

/DB1  * 

DB2  * 

r\o1*      y /I  iPVHT 

UBJK  K4KOKNL 

+ 

/DB1  * 

DB2* 

DBJ*  Kb*LKNL 

+ 

DB4 

K1XLKNH 

+ 

DB7 

*  /r\ A  * /""I/MO 

+ 

DB5 

*  K2*CKNH 

+ 

DB6* 

DB7 

*  K2*CKNH 

T 

DB6*/DB7 

*  K3*CKNH 

+ 

DB5* 

DB6 

*  K3*CKNH 

+ 

/DB5* 

DB6* 

DB7 *  K4*CKNH 

+ 

/DB5* 

DB6* 

DB7*  K5*CKNH 

VCC 

•  — 

/LSBI 

:+:  SEL*Q3*Q2*Q1* 

LSBI 

=  /MSBI 

+:  LSBI*SEL*Q3*Q2*Q1*Q0 
=  MSBI 


SELECT  0/P  HIGH  TO  SELECT 
DATA  0-3,   LOW  TO 
SELECT  DATA  4-7. 


COMBINATORIAL  OUTPUT  OF 
CLOCK  MULTIPLIED  BY  THE 
DA0-DA3   INPUT  WHEN  SEL 
IS  HIGH,   AND  DA4-DA7 
WHEN  SEL  IS  LOW. 


ENABLE  COMBINATORIAL 
OUTPUT  FUNCTION. 

COMBINATORIAL  OUTPUT  OF 
CLOCK  MULTIPLIED  BY  THE 
DB0-DB3   INPUT  WHEN  SEL 
IS  HIGH,   AND  DB4-DB7 
WHEN  SEL  IS  LOW. 


ENABLE  COMBINATORIAL 
OUTPUT  FUNCTION. 


m 
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PAL32VX1 0  Uses  Buried  Register  for  Input-Intensive  State  Machine  Designs 


SIMULATION 

TRACEON  CLK  QA  QB  QO  Ql  Q2  Q3 

SELOP  LSB  MSB  DAO  DAI 
DA 2  DA3  DA 4  DA 5  DA 6  DA7 
DBO  DB1  DB2.  DB3  DB4  DB5 
DB6  DB7 

PRLDF       /QO  /Ql  /Q2  /Q3  SEL  /LSBI  /MSBI 
SETF         /CLK     /DAO  /DAI  /DA2  /DA3 
DBO  /DB1  /DB2  /DB3 
/DA4     DA 5  /DA6  /DA7 
DB4     DB5  /DB6  /DB7 
FOR     J   :=     0  TO  4  DO 
BEGIN 

IF     J  =  1  THEN 
BEGIN  SETF     DA 2  DB2  DA 6  DB6 
END 

2  THEN 


IF  J  = 
BEGIN  SETF 


3  THEN 


END 
IF  J 

BEGIN  SETF 

END 
FOR  I 

BEGIN 

SETF  CLK 

SETF  /CLK 

END 
END 
TRACE  OFF 


/DA2  DA 3  /DB2  DB3 
/DA6  DA 7  /DB6  DB7 


DA 2  DB2  DA 6  DA 6 


1  TO  32  DO 


SIMULATION  SECTION. 
TRACE  ESSENTIAL 
WAVEFORMS . 


PRELOAD  INITIAL  VALUES. 
SET  INPUT  DA0-DA3  =  0. 
SET  INPUT  DB0-DB3  =  1. 
SET  INPUT  DA4-DA7  =  2. 
SET  INPUT  DB4-DB7  =  3. 

GENERATE  FOUR  LOOPS. 
ON  SECOND  LOOP  SET 
DA0-DA3  =  4,   DB0-DB3  =  5, 
DA 4 -DA 7   =  6,   DB4-DA7  =  7. 
ON  THIRD  LOOP  SET 
DA0-DA3  =  8,   DB0-DB3  =  9, 
DA4-DA7  =  10,  DB4-DB7  =  11. 
ON  FOURTH  LOOP  SET 
DA0-DA3  =  12,  DB0-DB3  =  13, 
DA4-DA7  =  14,   DB4-DB7  =  15. 


ON  EACH  LOOP  APPLY 

32  CLOCK  PULSES. 

OUTPUTS  QA  AND  QB 

WILL  GENERATE  THE 

CLK  DIVIDED  BY  16  MULTIPLIED 

BY  THE  BINARY  CODE  AT  THE 

SELECTED  INPUT. 
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Analog  to  Digital  Conversion 


It  is  often  necessary  to  interface  analog  inputs  into  a  digital  sys- 
tem. In  digital  instruments  the  signal  being  measured  is  usually 
an  analog  level.  Transducers  are  used  to  provide  a  linear  relation- 
ship between  a  measured  input  signal  and  a  voltage  output.  An 
example  of  a  transducer  would  be  a  strain  gauge.  In  this  case  the 
amplitude  of  the  voltage  output  would  be  a  measure  of  mechani- 
cal stress.  If  the  measuring  device  were  microprocessor-based 
or  minicomputer-based,  the  voltage  level  would  have  to  be 
converted  to  adigital  binary  code  before  it  could  be  read  and  proc- 
essed. An  analog  to  digital  converter  would  be  required  to  accom- 
plish this  task. 

Conversion  Techniques 


Successive  Approximation 

A  medium  to  high  conversion  speed  can  be  realized  with  the 
Successive  Approximation  technique  of  conversion,  and  many 
circuits  exist  to  perform  this  type  of  conversion.  Figure  1  shows 
a  diagram  of  an  A  to  D  converter  that  uses  a  PAL20RS10  as  a 
Successive  Approximation  Register.  The  advantage  of  using  the 
successive  approximation  technique  of  conversion  is  that  the 
circuit  is  relatively  simple  when  using  a  PAL  device,  and  the  cycle 
of  conversion  is  constant.  Conversion  using  counters  is  no  more 
simple,  and  does  not  give  a  constant  conversion  time  becausethe 
time  required  to  produce  the  closest  possible  binary  output  is 
dependent  on  the  amplitude  of  the  input  signal. 


There  are  a  number  of  techniques  used  to  provide  analog  to 
digital  (A  to  D)  conversion.  The  slowest  methods  of  A  to  D 
conversion  are  based  on  counters,  and  would  probably  be  used 
in  instruments  such  as  multimeters.  An  example  would  be  the 
self-calibrating  Dual  Slope  Integrator.  The  fastest  A  to  D  convert- 
ers are  based  on  look-up  tables  and  might  be  used  in  video 
applications.  In  the  case  of  a  Flash  Converter  high  conversion 
speed  is  obtained  at  the  expense  of  a  very  large  number  of  input 
comparator  circuits.  An  eight-bit  digital  output  code  would  require 
2a-1  comparator  circuits  at  the  input.  The  comparator  outputs 
would  drive  a  look-up  table  from  which  the  closest  binary  weight- 
ing to  the  analog  input  would  be  referenced.  Techniques  to 
simplify  the  input  circuitry  would  reduce  the  number  of  compara- 
tors required.  A  Half  Flash  converter  would  accomplish  this  but 
with  some  degradation  in  conversion  speed. 


The  PAL20RS10  has  been  programmed  as  a  state  machine  to 
perform  the  approximation  cycle.  The  Design  Specification  title 
SAR8  (page  2-300)  refers  to  the  eight-bit  Successive  Approxima- 
tion Register.  To  provide  more  bits  of  resolution,  a  fourteen-bit 
SAR14  has  been  programmed  into  a  PAL32R16  (page  2-303). 

Quantization  Error 

Why  would  a  system  require  additional  bits  of  data  in  the  SAR 
circuit?  Suppose  a  two-digit  code  is  used  to  represent  a  dynamic 
voltage  range  of  0  V  to  9  V.  A  linear  mapping  would  be: 


BINARY  CODE 

00 

01 

10 

11 

Input  Voltage  (Volts) 

0.0 

3.0 

6.0 

9.0 

EIGHT  BIT  DIGITAL  OUTPUT 


SYSTEM  CLOCK  INPUT- 


r 

GND 

VCC 


CLK 
CMP 
POL 


EN 


CLK 

OCTAL  REGISTER 

LD 


CC 
°7 
Q6 
05 

°1 

Qo 

EOC  \-  EOC  (END  OF  COUNT) 


COMPARATOR 
CIRCUIT 


ANALOG  INPUT 


Figure  1.  Eight-Bit  Successive  Approximation  Analog  to  Digital  Converter 


KM 
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unary  Bits  to  represent  any  voltages  between  each  separate 
avel.  The  difference  between  the  actual  value  and  the  approxi- 
nation  is  the  quantization  error.  The  resolution  can  be  increased 
md  quantization  error  reduced  by  adding  more  bits  to  encode  the 
lata.  With  a  three-digit  code  resolution  improves: 


BINARY  CODE 

VOLTAGE  LEVELS  (VOLTS) 

000 

0.0000 

001 

1.2857 

010 

2.5714 

011 

3.8571 

100 

5.1429 

101 

6.4286 

110 

7.7143 

111 

9.0000 

However,  there  remain  voltage  levels  that  cannot  be  accurately 
represented.  In  general,  if  we  assume  an  input  dynamic  range 
(Vdr)  is  to  be  encoded  into  equal  discrete  levels  (Vq)  by  a  number 
(n)  of  bits  of  binary  data,  the  relationship  becomes: 


Vdr/(2"  -  1 )  =  Vq, 

If  a  dynamic  range  of  1 0  V  were  represented  by  eight  bits  of  digital 
data  the  input  voltage  level  could  be  resolved  to  an  accuracy  of 
approximately 

10/(2B- 1)  =  approximately  40  mV. 

For  some  systems  this  might  be  accurate  enough.  To  decrease 
quantization  error  even  further  the  PAL32R16  has  been  pro- 
grammed to  perform  successive  approximation  with  14  bits  of 
binary  information.  With  the  given  figures  the  resolution  is 

10/(2"- 1)  =  approximately  0.6  mV. 

PAL20RS10  Implementation 


An  active-LOW  RESET  input  has  been  added  to  the  design  so  a 
synchronous  reset  may  be  applied  to  the  circuit.  When  LOW,  the 
clock  input  will  set  Q7  HIGH,  CC,  EOC,  and  Q6-Q0  LOW,  shown 
in  Figure  2  at  t,. 

The  SAR  output  (Q7  HIGH  and  Q6-Q0  LOW)  provides  a  binary 
input  to  the  D  to  A  converter,  generating  a  corresponding  analog 
signal.  The  comparator  will  compare  this  output  with  the  current 
value  of  the  analog  input.  If  voltage  level  of  the  converted  signal 
is  greater  than  the  current  level  of  the  analog  input  the  binary 
weighting  of  the  SAR  contents  is  too  large.  The  feedback  from  the 
comparator  output  to  the  CMP  input  will  cause  the  contents  of  Q7 
to  be  RESET  on  the  next  clock  rising  edge,  t,  in  Figure  2.  If  the 
level  of  the  converted  signal  is  less  than  the  input,  the  binary 
weighting  of  the  contents  of  the  SAR  is  not  large  enough.  The 
comparator  feedback  will  cause  the  Q7  register  contents  to 
remain  SET  after  the  arrival  of  the  clock  rising  edge  t2.  Simulta- 
neously Q6  will  be  SET  HIGH  to  perform  the  next  lesser  significant 
approximation. 

At  t  the  result  of  the  next  CMP  input  is  registered  at  Q6  while  Q5 
is  SET  for  the  next  lesser  significant  compare.  The  entire  cycle 
is  completed  with  the  contents  of  Q7-Q0  containing  the  closest 
approximation  of  an  eight-bit  digital  code  to  the  current  analog 
input.  The  end  of  one  complete  cycle  finishes  at  t10  and  starts 
again  at  t„. 

Afterthe  CMP  input  has  been  loaded  into  a  register,  the  contents 
of  that  register  must  be  held  for  the  duration  of  the  approximation 
cycle.  When  the  digital  code  is  set  up  in  the  register  file  of  the 
PAL20RS10  it  may  be  synchronously  loaded  into  an  octal  re- 
gister, as  shown  in  Figure  1.  CC  will  provide  an  enabling  HIGH 
(Figure  2)  and  the  digital  data  is  registered  at  time  t10  in  the  cycle. 
The  input  to  the  comparator  should  be  held  constant  during  the 
conversion  cycle  in  a  SAMPLE  and  HOLD  circuit.  The  EOC  signal 
may  be  used  as  an  interrupt  input  to  the  host  processor,  which  can 
read  the  data  over  an  eight-bit  data  bus  by  enabling  the  output  of 
the  octal  register. 


The  design  shown  in  Figure  1  is  an  eight-bit  A  to  D  system.  The 
PAL20RS10  has  a  clock  input  which  synchronously  clocks  the 
state  machine  programmed  into  the  PAL  device.  The  Q0-Q7 
outputs  will  contain  the  closest  digital  binary  weighting  to  the 
analog  input  (to  a  resolution  of  40  mV  for  a  10-volt  range)  after 
each  approximation  cycle.  Q7  is  the  most  significant  bit. 


The  CC  signal  is  an  active-HIGH  Conversion  Complete  signal. 
CC  can  be  converted  to  provide  an  additional  bit  of  code  to  make 
a  nine-bit  SAR  if  increased  system  resolution  were  required.  The 
EOC  output  is  an  End  Of  Count  or  Conversion  signal. 


The  input  CMP  is  driven  by  the  output  of  a  very  high  tolerance 
comparator  circuit.  The  TTL  level  output  is  fed  to  the  CMP  input 
and  clocked  into  the  selected  register  during  each  compare 
operation.  The  POL  input  is  hard-wired  HIGH  or  LOW;  this 
influences  the  polarity  of  the  CMP  input.  The  CMP  input  can  be 
either  active-HIGH  or  LOW  as  selected  by  the  POL  control. 

The  EN  input  is  the  ENable control.  If  HIGH  the  SAR  is  configured 


The  PAL20RS10  was  selected  because  some  of  the  outputs 
require  more  than  the  eight  product  terms  available  in  standard 
PAL  devices.  The  PAL20RS10  features  product  term  steering, 
whereby  sixteen  product  terms  are  shared  between  two  outputs. 
To  fit  in  the  device,  the  outputs  are  arranged  such  that  an  output 
using  more  than  eight  product  terms  is  adjacent  to  an  output  using 
less  than  eight  product  terms. 

The  oscilloscope  trace  (Figure  3)  shows  one  complete  cycle  of 
successive  approximation.  A  PAL20RS10  device  was  pro- 
grammed as  an  SAR  and  configured  with  an  AD558  D  to  A 
converter  and  a  comparator  circuit.  The  top  trace  shows  the 
output  of  the  comparator  and  the  bottom  trace  shows  the  output 
of  the  D  to  A  converter.  Initially  Q7  is  set  HIGH,  and  the  CMP 
feedback  holds  the  Q7  contents  HIGH  as  Q6  is  set  HIGH  for  the 
next  compare.  Q6  is  SET  HIGH  as  a  result  of  the  next  compare. 
When  Q5  is  set  HIGH  the  D  to  A  output  is  greater  than  the  input 
voltage  level  so  the  content  of  Qs  is  reset  LOW.  When  the  cycle 
is  complete  the  result  HHLH  HLLH  isthe  closest  binary  weighting 
to  the  level  of  the  analog  input. 


601  02 


PAL20RS10 

Figure  2.  Eight-Bit  Successive  Approximation  Register  Output  Waveforms 


COMPARATOR 
OUTPUT 


CONVERTER 
OUTPUT 


Figure  3. 
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Analog  to  Digital  Conversion 


; EIGHT  BIT  SUCCESSIVE  APPROXIMATION  REGISTER 
;WITH  CONVERSION  COMPLETE  AND  END  OF  CONVERSION. 

TITLE  SAP.  8 
PATTERN  01 
REVISION  01 
AUTHOR  CHRIS  JAY 
COMPANY  MMI   SANTA  CLARA 
DATE  29  JANUARY  1985 

CHIP  SAR  8  PAL20RS10 


;PIN 

1 

2 

3 

4 

5 

6 

CLK 

/RST 

CMP 

POL 

NC 

NC 

;PIN 

7 

8 

9 

10 

11 

12 

NC 

NC 

NC 

NC 

EN 

GND 

;PIN 

13 

14 

15 

16 

17 

18 

/OE 

EOC 

CC 

03 

02 

Q4 

;PIN 

19 

20 

21 

22 

23 

24 

Ql 

05 

00 

Q6 

07 

VCC 

EQUATIONS 

/Q7    :=  /RST*Q7*CMP*/Q6*/Q5*/Q4*/Q3 
*/Q2*/Ql*/Q0*/CC*/EOC*POL 

+  /RST*Q7*/CMP*/Q6*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC*/POL 

+  /RST*/Q7*/EOC 

Q6      :=  /RST*Q7*/Q6«/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC 

+  /RST*Q6*/CMP*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/EOC*POL 

+  /RST*Q6*CMP*/Q5*/Q4*/Q3 
*/Q2*/Ql*/Q0*/CC«/EOC*/POL 

+  /RST*Q6*Q5*/EOC 

+  /RST*Q6*Q4*/EOC 

+  /RST*Q6*Q3*/EOC 

+  /RST*Q6*Q2*/EOC 

+  /RST*Q6*Ql*/EOC 

+  /RST*Q6*Q0*/EOC 

+  /RST*Q6*CC*/EOC 

QO      :=  /RST*Q1*/QO*/CC*/EOC 

+  /RST*Q0*/CMP*/CC*/EOC*POL 

+  /RST*Q0*CMP*/CC*/EOC*/POL 

+  /RST*Q0*CC*/EOC 

Q5      :=  /RST*Q6*/Q5*/Q4*/Q3 

*/02*/Ql*/Q0*/CC*/EOC 

+  /RST*Q5*/CMP*/Q4*/Q3 

*  /Q2  *  /Ql  *  /QO  *  /CC  *  /EOC  *  POL 

+  /RST*Q5*CMP*/Q4*/Q3 

VQ2*/Q1*/QO*/CC*/EOC*/POL 

+  /RST*Q5*Q4*/EOC 

+  /RST*Q5*Q3*/EOC 

+  /RST*Q5»Q2*/EOC 

+  /RST*Q5*Q1*/E0C 

+  /RST*Q5*Q0*/EOC 

+  /RST*Q5*CC*/EOC 

Ql      :=  /RST*Q2*/Q1*/QO*/CC*/EOC 

+  /RST*Q1*/CMP*/QO*/CC«/EOC*POL 

+  /RST*Q1*CMP*/QO*/CC*/EOC*/POL 

+  /RST*Q1*QO*/EOC 

+  /RST*Q1*CC*/E0C 


ACTIVE  LOW  RESET  WILL  SET 
Q7  HIGH.   THE  CMP  OR  /CMP 
INPUT  IS  REGISTERED 
DEPENDING  ON  POL  INPUT. 
Q7   IS  HELD  UNTIL  THE  END 
OF  CONVERSION. 
ACTIVE  LOW  RESET  WILL 
RESET  Q6  LOW.   Q6  GOES 
HIGH  AFTER  Q7  AND  THE 
CMP  OR  /CMP  INPUT  IS 
REGISTERED. POL  SELECTS 
CMP  POLARITY.   THE  DATA 
REGISTERED  IN  Q6  IS 
HELD  THROUGH  THE  WHOLE 
APPROXIMATION  CYCLE  Q5 
DOWN  TO  CONVERSION 
COMPLETE.   THE  EOC  LOW 
SIGNAL,   END  OF  CONVER- 
-SION  CLEARS  THE 
REGISTERS. 
SET  REGISTER  QO 
REGISTER  CMP 
OR  /CMP  INPUT 
HOLD  QO  DATA 

SET  REGISTER  Q5 
AFTER  Q6.  DURING 
COMPARE  CYCLE. 
REGISTER  CMP 
OR  /CMP  INPUT 

HOLD  Q5  DATA 
DURING  THE 
APPROXIMATION 
CYCLE  FROM  Q4 
TO  CONVERSION 
COMPLETE. 

SET  REGISTER  Ql  HIGH 
AFTER  Q2  REG  CMP  OR  /CMP 
INPUT  SELECTED  BY  POL 
HOLD  Ql  DATA  FOR 
REST  OF  CONVERSION 


! 


Analog  to  Digital  Conversion 


Q4      :=  /RST*Q5*/Q4*/Q3*/Q2 

*/Ql*/Q0*/CC*/EOC 
+  /RST*Q4*/CMP*/Q3*/Q2 

*/Ql*/Q0*/CC*/EOC*POL 
+  /RST*Q4*CMP*/Q3*/Q2 

*/01 */Q0*/CC*/EOC*/POL 
+  /RST»Q4*Q3«/EOC 
+  /RST*Q4*Q2*/EOC 
+  /RST*Q4*Q1*/E0C 
+  /RST*Q4*Q0*/EOC 
+  /RST*Q4*CC*/E0C 

Q2      :=  /RST*Q3*/Q2*/Q1*/QO*/CC*/EOC 

+  /RST*Q2*/CMP*/Q1*/QO*/CC*/EOC*POL 

+  /RST*Q2*CMP*/Q1*/QO*/CC*/EOC8/POL 

+  /RST*Q2*Q1*/E0C 

+  /RST*Q2*Q0*/EOC 

+  /RST*Q2*CC*/E0C 

Q3      :=  /RST*Q4*/Q3*/Q2*/Q1*/Q0 

*/CC*/EOC 
+  /RST*Q3*/CMP*/Q2*/Ol*/Q0 

*/CC*/EOC*POL 
+  /RST*Q3*CMP*/Q2*/Q1*/Q0 

*/CC*/EOC*/POL 
+  /RST*Q3*Q2*/EOC 
+  /RST*Q3*Ql*/EOC 
+  /RST*Q3*Q0*/EOC 
+  /RST*Q3«CC*/E0C 

CC      :=  /RST*Q0*/CC*/EOC 

+  /RST*CC*/CMP*/EOC*/EN*P0L 
+  /RST*CC*CMP*/EOC*/EN*/POL 


EOC   :=  /RST*CC*/EOC 


SET  REGISTER  Q4 
AFTER  Q5  THEN 
COMPARE  CYCLE 
REGISTER  CMP  OR 
/CMP  INPUT. 

HOLD  Q4  DATA 
DURING  THE 
APPROXIMATION 
CYCLE  OF  Q3 
DOWN  TO  CC 
RESET  REGISTER  Q3 
SET  REGISTER  Q2  HIGH 
AFTER  Q3  REG  CMP  OR 
/CMP  INPUT  SELECT  W/POL 
HOLD  Q2  DATA 
FOR  APPROXIMATION 
CYCLE  TO  CC. 
RESET  REGISTER  Q3 
THEN  SET  Q3. 
REGISTER  THE  CMP 
OR  /CMP  INPUTS 
AND  HOLD  CONTENTS 
OF  Q3  DURING  THE 
REST  OF  THE  APP- 
ROXIMATION CYCLE 
DOWN  TO  CC 


RESET  CC.   CC  GOES  HIGH 
AFTER  QO  TO  INDICATE 
THAT  CONVERSION  IS  NOW 
COMPLETE.    IF  EN  IS  LOW 
THEN  CC  CAN  PROVIDE  ONE 
ADDITIONAL  BIT  OF 
RESOLUTION . 

END  OF  COUNT 
GOES  HIGH  TO 
RESET  THE  SAR . 


SIMULATION 

TRACE  ON  CLK  /RST  EN  CMP  POL 

Q7  Q6  Q5  Q4  03  02  Ql  QO  CC  EOC 


SETF  OE  /CLK  RST  EN  CMP  /POL 

CLOCKF 

SETF  /RST 

CLOCKF 

FOR   I    :=   1  TO  9  DO 

BEGIN  CLOCKF 

END 

SETF  /CMP 

FOR  J   :=   1  TO  9  DO 
BEGIN  CLOCKF  CLK 
END 

SETF  POL 

FOR  K   :=   1  TO  9  DO 
BEGIN  CLOCKF  CLK 
SETF  CMP 

FOR  L   : =   1  TO  9  DO 
BEGIN  CLOCKF  CLK 
END 

CLOCKF 
END 

TRACE  OFF 


SET  INITIAL  CONDITIONS 
AND  RESET  Q6  -  QO ,  CC 
AND  EOC,   SET  Q7 
FOR  NINE  CLOCK  CYCLES 
PERFORM  ONE  PASS  OF 
SUCCESSIVE  APPROXIMATION 
WITH  CMP  HIGH  AND  POL 
LOW.   PERFORM  SECOND 
PASS  WITH  CMP  LOW 

SET  POLARITY  PIN  ACTIVE 
TEST  SUCCESSIVE  CYCLE  OF 
APPROXIMATION  WITH  THE 
INVERSION  INPUT  ACTIVE 
FOR  CMP  HIGH  AND  CMP 
LOW 
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PALASM  SIMULATION,   V2.21A  MARKET  VERSION  (07-24-86) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 


Title        :  SAR 
Pattern     :  01 
Revision  :  01 


Author       :  CHRIS  JAV 
Company     :  MMI  SANTA  CLARA 
Date  :    29  JANUARY  1985 


SAR 

Page  :  1 
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Analog  to  Digital  Conversion 


; FOURTEEN  BIT  SUCCESSIVE  APPROXIMATION  REGISTER 

TITLE  SARI 4 
PATTERN  01 
REVISION  01 
AUTHOR  CHRIS  JAY 
COMPANY  MMI   SANTA  CLARA 
DATE  29  JANUARY  1985 

•THE  PAL32R16  HAS  BEEN  DESIGNED  AS  A  FOURTEEN  BIT  SUCCESSIVE 
; APPROXIMATION  REGISTER  FOR  MEDIUM  TO  HIGH  SPEED  ANALOG  TO 
; DIGITAL  CONVERSION.   THE  FOURTEEN  OUTPUTS  ARE  CONNECTED  TO 
; THE  INPUTS  OF  A  FOURTEEN  BIT  DAC,   WITH  Q13  AS  THE  MOST  SIG- 
NIFICANT BIT  DOWN  TO  00,   WHICH  IS  THE  LEAST  SIGNIFICANT 
;BIT.   THE  CC  OUTPUT  IS  THE  CONVERSION  COMPLETE  SIGNAL,  WHEN 
;  HIGH   IT   INDICATES  THAT  THE  CONTENTS  OF  THE  FOURTEEN  BIT  REGISTER 
; HOLDS  THE  CLOSEST  BINARY  WEIGHTING  TO  THE  CURRENT  VERSION  OF 
; THE  ANALOG  SIGNAL  UNDERGOING  CONVERSION.   THE  DAC  OUTPUT  AND 
; THE  ANALOG   INPUTS  ARE  FED  TO  A  HIGH  QUALITY  COMPARATOR  CIRCUIT, 
; THE  OUTPUT  OF  WHICH  IS  A  TTL  LEVEL  THAT  DRIVES  THE  CMP  COMPARE 
; INPUT  OF  THE  PAL32R16. 

CHIP  SARI 4  PAL32R16 


PIN 

1 

2 

3 

4 

5 

6 

7 

8 

013 

NC 

Q12 

CC 

/OE1 

/RST 

CMP 

NC 

PIN 

9 

10 

11 

12 

13 

14 

15 

16 

NC 

VCC 

NC 

NC 

NC 

NC 

/PLD2 

/CLK2 

PIN 

17 

ie 

19 

20 

21 

22 

23 

24 

Qll 

Q0 

Q10 

Ql 

Q9 

02 

08 

03 

PIN 

25 

26 

27 

28 

29 

30 

31 

32 

/OE2 

NC 

NC 

NC 

NC 

GND 

NC 

NC 

PIN 

33 

34 

35 

36 

37 

38 

39 

40 

NC 

NC 

/PLD1 

/CLK1 

Q7 

04 

06 

05 

STRING 

H13 

'  Ql  3  */CC 

;  HOLD 

EQUATION 

FOR 

Q13 

STRING 

H12 

'Q12*/CC*/RST' 
'Q11*/CC*/RST' 

;  HOLD 

EQUATION 

FOR 

Q12 

STRING 

Hll 

;HOLD 

EQUATION 

FOR 

Qll 

STRING 

H10 

'Q10*/CC*/RST' 

;  HOLD 

EQUATION 

FOR 

Q10 

STRING 

H9 

'Q9*/CC*/RST' 

;  HOLD 

EQUATION 

FOR 

Q9 

STRING 

H8 

'Q8*/CC*/RST' 

;  HOLD 

EQUATION 

FOR 

Q8 

STRING 

H7 

'Q7*/CC*/RST' 
'Q6*/CC*/RST' 
'Q5*/CC*/RST' 
'Q4*/CC*/RST' 
'Q3*/CC*/RST' 
|Q2*/CC*/RST' 

;  HOLD 

EQUATION 

FOR 

Q7 

STRING 

H6 

;  HOLD 

EQUATION 

FOR 

Q6 

STRING 

H5 

;  HOLD 

EQUATION 

FOR 

Q5 

STRING 

H4 

;  HOLD 

EQUATION 

FOR 

Q4 

STRING 

H3 

;  HOLD 

EQUATION 

FOR 

Q3 

STRING 

H2 

;  HOLD 

EQUATION 

FOR 

02 
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EQUATIONS 


Q13    :=  Q13*CMP*/Q12*/Q11*/Q10*/Q9*/Q8*/Q7 
*/Q6*/Q5*/Q4*/Q3*/Q2*/Ql*/Q0*/CC 

+  H13*Q12   +  H13*Q11   +  H13*Q10 

+  H13*Q9     +  H13*Q8     +  H13*Q7 

+  H13*Q6     +  H13*Q5     +  H13*Q4 

+  H13*Q3     +  H13*Q2     +  H13*Q1 

+  H13*Q0     +  RST  +  CC 

Q12    :=  Q13*/Q12*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6 
*/Q5*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+  Q12*CMP*/Q11*/Q10*/Q9*/08*/Q7*/Q6 

*/Q5*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+     H12*Q11   +  H12*Q10   +  H12*Q9 
+     H12*Q8     +  H12*Q7     +  H12*Q6 
+     H12*Q5     +  H12*Q4     +  H12*Q3 
+     H12*Q2     +  H12*Q1     +  H12*Q0 

Qll    :=  Q12*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5 
*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+  Q11*CMP*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5 

*/Q4*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+     H11*Q10  +  H11*Q9   +  H11*Q8 
+     H11*Q7     +  H11*Q6   +  H11*QS 
+     H11*Q4     +  H11*Q3  +  H11*Q2 
+     H11*Q1     +  H11*Q0 

Q10    :=  Qll*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4 
*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+  Q10*CMP*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4 

*/Q3*/Q2*/Ql*/Q0*/CC*/RST 
+  H10*Q9  +  H10*Q8  +  H10*Q7 
+  H10*Q6  +  H10*Q5  +  H10*Q4 
+  H10*Q3  +  H10*Q2  +  H10*Q1 
+  H10*Q0 

09      :=  Q10*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4*/Q3 
*/Q2*/Ql*/Q0*/CC*/RST 
+  Q9*CMP*/Q8*/Q7*/Q6*/Q5*/Q4*/Q3 

*/Q2*/Ql*/00*/CC*/RST 
+     H9*Q8  +  H9*Q7  +  H9*Q6 
+     H9*05   +  H9*04   +  H9*Q3 
+     H9*Q2  +  H9*Q1 
+  H9*Q0 

Q8      :=  Q9*/Q8*/Q7VQ6*/Q5*/04*/Q3*/Q2 

*/Ql*/QO*/CC*/RST 
+  Q8*CMP*/Q7*/Q6*/Q5*/Q4*/Q3*/Q2 

*/Ql*/QO*/CC*/RST 
+     H8*Q7   +  H8*Q6  +  H8*Q5 
+     H8*Q4   +  H8*Q3   +  H8*Q2 
+     H8*Q1   +  H8*Q0 

07      :=  Q8*/Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1 

*/Q0*/CC*/RST 
+  Q7*CMP*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1 

*/CC*/RST 
+     H7*Q6   +  H7*Q5   +  H7*Q4 
+     H7*Q3  +  H7*Q2  +  H7*Q1 
+  H7*Q0 


; REGISTER  CMP  I/P 

HOLD  RESULT  OF  COMPARE 
THE  APPROXIMATION  CYCLE 
Q12  TO  QO  CLEAR  ON  RESE 
CYCLE  Q12  TO  QO 
AND  CONVERSION  COMPLETE 

CLEAR  Ql  2.  DURING 
FIRST  APPROXIMATION 
REGISTER  COMPARE 
INPUT. 

HOLD  CONVERSION  RESULT 
DURING  APPROXIMATION  CY 
Qll  TO  QO  CLEAR  ON  RESE 
AND  CONVERSION  COMPLETE 

HOLD  Qll  LOW 
BEFORE  COMPARE 
CYCLE.  REGISTER 
COMPARE  INPUT. 
HOLD  RESULT  OF  COMPARE 
DURING  APPROXIMATION 
CYCLE  Q10  TO  QO .  RESET 
AND  CC  CLEARS  REGISTER. 

HOLD  Q10  LOW 
BEFORE  COMPARE 
CYCLE.  REGISTER 
RESULT  OF  COMPARE 
HOLD  RESULT  ON 
APPROXIMATION 
CYCLE  Q9  TO  QO . 
RESET  AND  CC 
CLEARS  REGISTER. 
HOLD  Q9  LOW 
BEFORE  COMPARE. 
REGISTER  RESULT 
OF  COMPARE. 
HOLD  RESULT  ON 
APPROXIMATION 
CYCLE  Q8  TO  QO . 
RESET  AND  CC 
CLEARS  REGISTER. 
HOLD  Q8  LOW 
BEFORE  COMPARE 
REGISTER  RESULT 
OF  COMPARE. 
HOLD  RESULT  ON 
APPROXIMATION 
CYCLE  Q7  TO  QO 
HOLD  Q7  LOW 
BEFORE  COMPARE 
REGISTER  RESULT 
OF  COMPARE. 
HOLD  CONTENTS 
OF  RESULT  ON 
APPROXIMATION 
CYCLE  Q6  TO  QO 
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Q6      :=  Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+  Q6*CMP*/Q5*/Q4*/Q3*/Q2*/Q1»/Q0*/CC*/RST 

+  H6*Q5   +  H6*Q4  +  H6*Q3 

+  H6*Q2  +  H6*Q1  +  H6*Q0 

Q5      :=  Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+  Q5*CMP*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+  H5*Q4  +  H5*Q3  +  H5*Q2 

+  H5*Q1   +  H5*Q0 

Q4      :=  Q5*/Q4*/Q3*/Q2*/Q1*/Q0«/CC*/RST 

+  Q4  *CMP*/Q3  *  /Q2  *  /Ql  *  /QO  *  /CC  *  /RST 

+  H4*Q3  +  H4*Q2  +  H4*Q1 

+  H4*Q0 

Q3      :=  Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

+  Q3*CMP*/Q2*/Q1*/00*/CC*/RST 

+  H3*Q2  +  H3*Q1  +  H3*Q0 

Q2      :=  Q3*/Q2*/Q1*/Q0*/CC*/RST 

+  Q2*CMP*/Q1*/Q0*/CC*/RST 

+  H2*Q1   +  H2*Q0 

Ql      :=  Q2*/Q1*/Q0*/CC*/RST 

+  Q1*CMP*/Q0*/CC*/RST 

+  Q1*Q0*/CC*/RST 

QO      :=  Q1*/Q0*/CC*/RST 

+  QO*CMP*/CC*/RST 


HOLD  Q6  LOW  BEFORE  COMP 
REGISTER  RESULT  OF 
COMAPRE .   HOLD  RESULT 
ON  APPROXIMATION 
Q5  -  QO. 

HOLD  Q5  LOW  BEFORE  COMP 

REGISTER  RESULT  OF  COMP 

HOLD  RESULT  ON 

APPROXIMATION 

CYCLE  Q4  -  QO. 

HOLD  Q4  LOW 

REGISTER  COMPARE 

HOLD  RESULT  ON 

Q3   -  QO. 

HOLD  Q3  LOW 
REGISTER  COMPARE 
HOLD  RESULT  ON 
CYCLE  Q2  -  QO. 
HOLD  Q2  LOW. 
REGISTER  COMPARE 
HOLD  RESULT 
ON  Q2  -  QO. 
HOLD  Ql  LOW. 
REGISTER  COMPARE. 
HOLD  RESULT. 

; 

; HOLD  QO  LOW . 
.•REGISTER  COMPARE. 


Q0*/CC*/RST 


; HIGH  CONVERSION 
; COMPLETE . 


SIMULATION 


TRACE_ON  CLK1  CMP  RST  Q13  Q12  Qll  Q10  Q9 
Q8  Q7  Q6  Q5  Q4  Q3  Q2  Ql  QO  CC 


SETF 

CLOCKF 

CHECK 


SETF 
CLOCKF 
FOR 

BEGIN 

END 
SETF  CMP 
FOR 

BEGIN 

END 
TRACE  OFF 


/CLK1  /CLK2  OE1  OE2  /PLD1  /PLD2  RST  /CMP 
CLK1  CLK2 

Q13  /Q12  /Qll  /Q10  /Q9  /Q8  /Q7  /Q6  /Q5 
/Q4  /Q3  /Q2  /Ql  /QO 
/RST 
CLK1  CLK2 
I    : =  1  TO  1 4  DO 
CLOCKF  CLK1  CLK2 


I  :=  1  TO  14  DO 
CLOCKF  CLK1  CLK2 


;SET  INITIAL 

; CONDITIONS 

; SYNCHRONOUS 

; RESET  CHECK 

; INITIAL  REGISTER 

; STATE . 

;APPLY   14  CLOCK 
; PULSES  WITH  CMP 
; INPUT  LOW. 

;SET  CMP  INPUT 
; HIGH  AND  APPLY 
;14  CLOCK  PULSES. 


PALASM  SIMULATION ,   V2.21  P-  PROGRAMMER  VERSION  (21-OCT-1986) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 


Title        :  SARI 4 
Pattern    :  01 
Revision  :  01 


Author      :  CHRIS  JAY 
Company    :  MMI  SANTA  CLARA 
Date  :    29  JANUARY  1985 
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PAL  Devices,  PROMs,  FIFOs,  and  Multipliers 
Team  up  to  Implement  Single-Board 
High-Performance  Audio  Spectrum  Analyzer  an  100 


Introduction 

This  application  note  illustrates  a  high-performance  audio  spec- 
trum analyzer.  This  circuit  can  analyze  high-fidelity  audio  sig- 
nals with  a  resolution  of  20  Hz  and  an  input  bandwidth  of  20  kHz. 
It  is  useful  in  production  test,  performance  evaluation,  or 
adjustment  of  high-fidelity  audio  equipment.  The  analyzer  pro- 
vides a  sweep  generator  output  for  rapid  analysis  of  audio  filter 
frequency  response. 

The  design  techniques  used  to  implement  the  analyzer  are  quite 
general,  and  can  be  applied  to  a  wide  variety  of  DSP  tasks.  An 
understanding  of  the  approach  used  will  suggest  solutions  to  a 
number  of  DSP  problems.  The  architecture  chosen  for  the 
spectrum  analyzer  is  controlled  by  a  microprogram  stored  in 
PROM.  Many  other  applications  can  be  accommodated  by 
changing  the  microprogram.  The  high  performance  of  this 
architecture  provides  an  attractive  price/performance  alterna- 
tive to  other  DSP  approaches. 

Spectrum  Analyzer  Functions 

The  spectrum  analyzer  requires  many  of  the  functions  com- 
monly used  in  DSP.  Figure  1  shows  the  analyzer  functions.  An 
input  signal  is  mixed  with  a  swept  audio  sinewave  oscillator 
(below). 


Frequency  Sweep 

The  frequency  sweep  acts  as  a  sampler,  starting  from  DC  and 
increasing  to  its  maximum  frequency. 

Mixing  is  accomplished  by  multiplying  the  input  signal  by  the 
sinewave.  From  basic  trigonometry: 

cos  w-|t  X  cos  W2t  =  1/2  cos  (wi  +  W2)t  +  1/2  cos  (wi  -W2)t  (1) 

The  mixing  process  generates  two  new  sinewaves  whose  fre- 
quencies are  the  sum  and  difference  of  the  input  sinewave 
frequencies.  When  the  sinewave  oscillator  matches  the  fre- 
quency of  an  input  signal  component,  a  DC  term  is  generated  in 
proportion  to  the  amplitude  of  that  component: 

cos  w-|t  x  cos  w-|t  =  cos2  w-jt  =  1+2  cos  w-|t 

The  DC  term  is  extracted  by  a  narrow  lowpass  filter.  Due  to  the 
finite  bandwidth  of  this  lowpass  filter,  mixer  output  signals 
whose  frequencies  fall  within  the  filter  passband  also  appear  at 
the  filter  output.  As  a  result,  the  analyzer  output  will  represent 
the  energy  contained  in  a  range  of  frequencies,  from  the  sine- 
wave frequency  minus  the  filter  cutoff  frequency,  to  the  sine- 
wave frequency  plus  the  filter  cutoff  frequency.  The  effective 
bandwidth  of  the  analyzer  is  twice  the  lowpass  filter  bandwidth. 


INPUT 
SIGNAL 
DC-20  kHz 


)— 

LOWPASS 

FILTER 

OUTPUTS 
—  O  VERTICAL 


SWEPT 
SINE  WAVE 
OSCILLATOR 


-o  HORIZONTAL 


SWEEP 
GENERATOR 


Figure  1.  Spectrum  Analyzer  Functions 

A  detector  converts  the  lowpass  filter  output  to  a  DC  voltage 
representing  the  total  energy  in  the  filter  passband.  If  this  DC 
voltage  is  plotted  on  a  vertical  axis  with  the  sinewave  oscillator 
frequency  (represented  by  the  sweep  voltage)  controlling  the 
horizontal  axis,  a  spectrum  of  the  input  signal  results. 

Other  mixing  schemes  can  be  used  to  extract  the  spectrum. 
However,  this  "direct  conversion"  approach  has  two  significant 
advantages.  As  shown  in  Figure  2,  the  swept  oscillator  output  can 
be  used  to  plot  the  frequency  response  of  an  audio  filter.  Other 
schemes  require  additional  mixing  to  achieve  the  same  results. 


OSCILLOSCOPE 


SPECTRUM 
ANALYZER 
IN  HORIZ 

vco 

FILTER 
UNDER  TEST 

VERT 
HORIZ 


Figure  2.  Filter  Test  Mode  Setup 

The  direct  conversion  scheme  confines  the  frequencies  of  all 
signals  following  the  mixer  to  the  lowpass  filter  bandwidth. 
Limiting  the  signal  bandwidth  has  great  benefit  when  the 
analyzer  is  implemented  digitally.  This  benefit  can  be  better 
understood  with  a  brief  review  of  DSP  theory. 
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Digital  Signal  Processing 
Theory  Review 

Digital  signal  processing  is  accomplished  by  first  converting  the 
continuous  analog  input  signal  to  a  series  of  digital  numbers. 
The  digital  numbers  are  then  manipulated  to  perform  the 
required  signal  processing.  The  processed  digital  numbers  are 
then  converted  back  to  a  continuous  analog  signal,  completing 
the  processing.  The  functions  required  for  DSP  are  shown  in 
Figure  3. 

 ANALOG  -•  DIGITAL  ~  ANALOG  


ALIASING 
FILTER 


® 


®  © 


DIGITAL 
PROCESSING 
H(I) 


SMOOTHINGIPUT 
FILTER 


®  © 


© 


Figure  3.  DSP  Functions 


Sampling 

Representing  a  continuous  input  signal  would  require  an  infinite 
array  of  digital  numbers.  A  finite  collection  of  digital  numbers 
can  be  obtained  by  considering  the  signal  amplitude  at  discrete, 
periodic  points  in  time.  This  process  is  called  sampling,  and  is 
equivalent  to  multiplying  the  input  signal  by  a  periodic  train  of 
impulses  of  unit  amplitude.  The  sampling  theorem  states  that 
the  input  signal  can  be  reconstructed  without  distortion  if  the 
input  is  bandlimited  to  contain  no  frequency  components 
greater  than  half  the  sampling  frequency.  The  sampling  theorem 
means  that  the  discrete  samples  completely  represent  the  input 
signal,  as  long  as  the  bandwidth  constraint  is  met. 

Aliasing 

What  is  really  happening  during  the  sampling  process?  Consider 
the  Fourier  series  representation  of  a  periodic  unit  impulse  train. 
It  can  be  shown  that: 


The 


f(t) 


k  =  • 

2 


=  0,  1,  2,  3, 


(2) 


where  f.  =   

b    sample  period 


;  periodic  impulse  train  is  equivalent  to  a  series  of  sinusoids 
consisting  of  all  harmonics  of  the  sampling  frequency,  including 


FREQUENCY— "-DC  ls>2  Is  2  ■  Is  3  Is 

Figure  4.  Aliasing  Spectra  of  Figure  3  DSP  Functions 


a  DC  term.  Recalling  Eq.  (1)  all  possible  sum  and  difference 
frequencies  will  be  generated  when  the  impluse  train  and  the 
input  signal  are  multiplied.  This  process  is  shown  graphically  in 
Figure  4.  Observe  that  if  the  input  contains  frequencies  greater 
than  half  the  sampling  frequency,  the  spectra  in  Figure  4  will 
overlap.  This  overlap  phenomenon  is  known  as  aliasing  distor- 
tion, and  introduces  noise  in  the  signal. 

Another  consequence  of  the  sampling  process  is  that  high- 
frequency  signal  components  near  a  harmonic  of  the  sampling 
frequency  will  be  mixed  to  produce  new  signal  components  near 
DC.  These  new  components  have  frequencies  within  the  desired 
signal  passband,  but  are  really  "alias"  high-frequency  compo- 
nents. The  phenomenon  is  called  aliasing. 

To  eliminate  the  undesirable  effects  of  aliasing,  a  continuous 
analog  lowpass  filter  is  placed  before  the  sampler.  This  aliasing 
filter  removes  frequency  components  beyond  the  fs/2  limit. 

Quantizing 

The  input  samples  are  converted  to  a  series  of  digital  numbers  by 
an  analog-to-digital  (A/D)  converter.  The  A/D  converter  operates 
by  quantizing  the  continuous  sample  amplitude  into  a  finite 
number  of  amplitude  ranges,  and  then  assigning  a  digital 
number  to  represent  the  quantized  amplitude  value.  As  might  be 
expected,  this  process  introduces  noise  in  the  signal,  known  as 
quantization  distortion.  The  quantization  distortion  is  in  the  form 
of  a  "white"  or  broadband  random  noise,  whose  RMS  ampli- 
tude is: 


1_?-2b 
12 


(3) 


where  b  is  the  number  of  bits  in  the  output 
digital  word,  excluding  the  sign  bit 

The  effect  of  aliasing  on  quantization  noise  is  to  alias  high 
frequency  noise  components  to  the  DC  to  fs/2  range.  The  result- 
ing noise  spectral  density  is  equivalent  to  a  white  noise  of  ampli- 
tude a2,  bandlimited  to  fs/2. 

Dynamic  Range 

The  A/D  output  contains  a  finite  number  of  bits.  Dynamic  range 
is  defined  as  the  ratio  of  the  maximum-to-minimum  signal 
amplitude  that  can  be  represented  by  the  digital  numbers. 
Dynamic  range  is  determined  by  the  number  of  bits  in  the  digital 
numbers,  and  by  the  noise  "floor." 

For  a  digital  number  containing  b  bits  plus  a  sign  bit,  the 
dynamic  range  would  be: 


Dynamic  range  (dB)  =  10  log1Q  2 


■2b 


(4) 


The  noise  floor  is  the  sum  of  all  noise  components  that  can 
appear  at  the  DSP  output.  The  primary  noise  factors  are 
quantization  noise  and  limit  cycle  noise  (to  be  discussed 
shortly).  Digital  filtering  will  affect  the  noise  floor  by  eliminating 
components  of  the  noise  signal.  For  example,  the  quantization 
noise  at  the  DSP  output  is: 

NQ(dB)  =  10log10pI|]  (5) 

where  BW  is  the  net  bandwidth  of  the  digital  filters 

The  noise  components  are  uncorrelated,  and  are  therefore 
combined  by  adding  the  power  of  each  noise  component. 
Remember  that 


Power  (absolute)  =  log10  1  £  Power  (dB)/10 


(6) 
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The  resulting  dynamic  range  is: 


0.5 


Dynamic  range  (dB)  =  10  log,n   ;  

lu  5,  Noise  Power 


(7) 


where  0.5  =  the  maximum  mean-squared  amplitude 

The  overall  dynamic  range  is  the  lesser  of  the  result  given  by  Eq. 
(4)  or  Eq.  (7).  In  a  practical  system,  the  width  of  the  digital 
numbers  can  vary.  The  dynamic  range  is  usually  calculated  for 
all  critical  points  in  a  digital  system,  with  the  overall  dynamic 
range  being  the  worst  case  value. 

Digital  Processing 

The  digital  numbers  from  the  A/D  converter  are  manipulated  to 
process  the  signal.  Carrier  generation,  filtering,  and  nonlinear 
operations  are  performed  by  appropriate  "number  crunching". 

Generation  of  sinusoidal  carriers  is  easily  accomplished  using  a 
linear  ramp  function  (digital  up/down  counter)  and  converting 
the  results  to  sinusoidal  samples  using  ROM  lookup  tables. 
Alternately,  recursive  equations  can  produce  the  desired  carriers. 

Nonlinear  operations  on  the  digital  numbers  must  be  handled 
with  care.  Since  aliasing  is  always  present  in  the  sampled 
domain,  harmonics  generated  by  nonlinear  operations  can  alias 
to  lower  frequencies.  The  aliasing  occurs  "immediately,"  since  it 
can  be  shown  that  performing  a  nonlinear  operation  in  the 
sampled  domain  is  equivalent  to  first  performing  the  nonlinear 
operation  on  a  continuous  signal  and  then  sampling  the  result 
without  bandlimiting  the  sampler  input. 

The  sampling  rate  can  be  changed  to  improve  the  efficiency  of 
the  digital  processing.  For  example,  discarding  every  other 
digital  number  would  reduce  the  effective  sampling  rate  by  a 
factor  of  two.  If  the  processing  at  the  higher  sample  rate 
includes  digital  aliasing  filters  to  remove  components  greater 
than  half  the  lower  sample  rate,  the  requirements  of  the 
sampling  theory  are  still  met.  The  sampling  rate  can  be 
increased  by  repeating  digital  sample  values.  This  repetition  is 
equivalent  to  a  "sample  and  hold"  operation,  and  modifies  the 
signal  spectrum  by 


F  (jw)  =  F(jw)xsin(wT/2) 
wT/2 


(8; 


where  w  =  27r  X  freq 

T  =  input  (longer)  sample  period 
The  effects  of  changing  the  sampling  rate  are  best  determined 
by  plotting  the  resulting  aliasing  spectra. 

Digital  Filtering 

Digital  filtering  is  accomplished  using  multiplication,  addition, 
and  delay.  For  example,  consider  the  biquadratic  filter  section 
in  Figure  5.  If  z"1  is  defined  to  be  a  unit  sample  period  delay 
operator,  then  the  input-to-output  transfer  function  of  the 
biquadratic  section  is: 


H(z)  = 


1  +  a^z 


+  a2z 


1  +  b^z~'[  +b2z"2 


(9) 


The  biquadric  sections  can  be  cascaded  to  implement  higher- 
order  filters. 

The  Laplace  transform  of  a  unit  delay  is  e~sT,  where  T  is  the 
delay  period.  Remember  that  z~'  represents  an  inverse  opera- 
tor, so  that  z  x  z_1  =  t.Thus, 

cT 

z  =  e    ,wheres  =  a  +  jw  (10) 


x'  N-2 

X'  N  =  XN  -bi  X-  N_1  -  b2  X  N_2 
YN  =  X'N  +S1X'  N_!  ♦  a2X'N_2 

Figure  5.  Digital  Biquadratic  Filter  Section 

Digital  filter  poles  and  zeroes  (in  the  z-plane)  can  be  mapped 
into  the  s-plane  to  determine  the  equivalent  analog  filter 
function,  and  vice-versa.  The  digital  filter  section  of  Eq.  (9) 
corresponds  to  an  analog  biquadratic  filter  section  with, 


H(s) 


s2  +  o0w0s  +  w02 


-o,w,  s  +  w, 


(11) 


However,  the  periodic  nature  of  the  esT  function  causes  the 
digital  filter  passband  to  repeat  periodically.  The  effect  is  the 
same  as  aliasing.  The  analog  filter  response  is  mixed  with  the 
sampling  frequency  harmonics  to  generate  the  true  digital  filter 
response. 

Designing  Digital  Filters 

How  does  one  go  about  designing  a  digital  filter?  One  approach 
is  to  perform  a  least  mean  squared  error  optimization  using  a 
computer.  The  desired  function  is  specified,  and  the  computer 
adjusts  the  an  and  bn  values  until  the  desired  response  is 
achieved. 

A  second  approach  is  to  design  an  equivalent  analog  filter  and 
then  convert  that  design  to  a  digital  filter.  This  approach  has 
great  merit,  since  analog  filter  design  theory  is  well  developed. 
However,  the  digital  passband  will  be  distorted  if  the  analog 
equivalent  filter  has  significant  response  to  frequencies  greater 
than  fs/2.  The  aliased  passbands  overlap  at  that  point. 

To  circumvent  this  problem,  the  analog  filter  function  can  be 
modified  to  compensate  for  the  aliasing  effects.  The  analog 
transfer  response  is  modified  using  several  transforms  to 
compensate  for  aliasing.  Unfortunately,  the  nature  of  the 
s-plane  to  z-plane  mapping  is  such  that  no  transform  can 
compensate  for  all  aliasing  effects  without  introducing  other 
forms  of  distortion. 

The  standard  (or  impulse  invariant)  z-transform  represents  a 
direct  mapping  to  the  z-plane.  No  frequency,  amplitude,  or 
phase  distortion  is  introduced,  but  aliasing  effects  are  not 
compensated.  This  transform  should  be  used  when  the  analog 
filter  has  negligible  response  to  frequencies  greater  than  f2/2. 

The  bilinear  z-transtorm  preserves  the  filter  amplitude  response 
in  the  presence  of  aliasing.  However,  the  bilinear  transform 
introduces  a  distortion  or  warping  of  the  frequency  axis.  As  a 
result,  only  the  filter  cutoff  frequency  can  be  accurately  trans- 
formed, using  a  pre-warping  technique.  Frequencies  within  the 
filter  passband  remain  warped,  introducing  phase  distortion  in 
the  digital  filter  response.  The  bilinear  transform  is  used  when 
the  filter  amplitude  response  is  more  critical  than  the  phase 
response. 
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amplitude  distortion,  unlike  the  aliasing  distortion,  can  be 
corrected  by  placing  additional  zeroes  in  the  transfer  function. 
The  matched  transform  is  used  when  the  filter  phase  response  is 
ritical,  and  either  the  amplitude  response  is  not  critical  or  the 
ditional  compensation  zeroes  can  be  accommodated, 
forming  the  transforms  by  hand  is  quite  tedious.  Fortunately, 
imputer  programs  are  widely  available  which  handle  the 
omplete  filter  synthesis  procedure,  including  z-transforms  and 
pre-warping. 


imit  Cycle  Noise 

An  effect  of  using  digital  numbers  with  a  finite  number  of  bits  is 
the  generation  of  quantization  noise.  When  implementing  digital 
filters,  the  quantization  noise  introduces  oscilaltions  that  are 
analogous  to  ringing  in  analog  filters.  These  oscillations  are 
called  limit  cycles.  The  limit  cycle  generates  a  noise  which  peaks 
at  frequencies  corresponding  to  the  filter  pole  frequencies.  The 
noise  power  is  roughly  proportional  to  pole  Q.  Limit  cycle  noise 
for  a  second  order  filter  section  of  Equation  (11)  is  given  by: 

,2 


2  -2b  1  +  r 
L(dB)  =  10log10(-2  — 


1 


2r4 


+  1-2r':cos  2w 


(12) 


where  b  =  number  of  digital  number  bits  (excluding  sign  bit) 
pole  freq.  =  w1 
pole  Q  =  1/«i 


w  =  In 


pole  freq. 

T 


exp( 


2  •  pole  Q 


The  limit  cycle  noise  must  be  calculated  for  each  complex  pole 
pair,  and  adjusted  to  reflect  the  response  of  subsequent  filter 
as  to  the  limit  cycle  frequency.  Computer  programs  can 
calculate  limit  cycle  noise  power,  including  all  of  these  considerations. 

Output  Signal  Reconstruction 

Once  manipulation  of  the  digital  sample  numbers  is  complete, 
resulting  digital  numbers  must  be  converted  back  to  a 


series  of  analog  output  pulses. 

A  sample-and-hold  (S&H)  circuit  eliminates  transients  that  are 
introduced  during  the  D/A  conversion  process.  The  spectrum  of 
the  S&H  output  is  modifed  as  follows: 

t  sin  (wt/2) 
S&H  F  (M  =  F(jw)7  w</2 

where  t  =  hold  time      T  =  sample  period 

An  output  smoothing  filter  completes  the  reconstruction  by 
removing  all  components  with  frequencies  greater  than  fs/2. 
The  smoothing  filter  is  often  optional,  depending  on  the  impor- 
tance of  removing  the  high-frequency  output  components. 

The  spectral  effects  of  reconstruction  are  shown  in  Figure  4. 

Implementing  the  Spectrum  Analyzer 

The  architecture  used  for  the  spectrum  analyzer  is  shown  in 
Fig.  6.  Input  signals  are  digitized  and  buffered  with  FIFOs  before 
interface  with  a  common  16-bit  data  bus.  The  16-bit  arithmetic 
unit  (AU)  provides  multiply  and  accumulate  operations.  A  16-bit 
wide  RAM  stores  intermediate  results.  A  16-bit  temporary 
register  facilitates  z_1  delays  and  data  movement.  Outputs 
are  provided  using  a  D/A  converter  and  S&H  circuits. 

The  VCO  output  is  buffered  using  FIFOs  to  provide  a  uniform 
high-speed  sample  rate.  The  VCO  output  is  12  bits  wide, 
providing  a  signal-to-quantization  noise  ratio  of  91  dB,  using 
Equations  (5)  and  (7).  The  calculation  assumes  a  500-Hz  band- 
width. A  smoothing  filter  at  the  VCO  output  is  not  necessary.  The 
filter  test  configuration  of  Figure  2  allows  the  input  aliasing  filter 
to  remove  the  effects  of  VCO  high-frequency  components,  as 
long  as  the  filter  under  test  is  a  linear  analog  circuit. 

The  vertical  and  horizontal  outputs  are  intended  to  interface  an 
oscilloscope  or  X-Y  plotter.  The  sampling  of  these  outputs  can 
be  non-uniform,  as  long  as  the  outputs  track  each  other.  The 
elastic  storage  at  the  input  and  VCO  interfaces  permits  arbitrary 
non-uniform  processing  of  the  analyzer  functions. 

The  16-bit  resolution  of  the  internal  data  word  provides  90-dB 
dynamic  range  according  to  Equation  (4),  or  115-dB  dynamic 
range  according  to  Equations  (5)  and  (7),  assuming  500-Hz 
bandwidth  and  no  limit  cycle  noise  and  aliasing. 
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VCO    HORIZ  VERT 


Figure  6.  Analyzer  Architecture 


Microprogram  control  was  selected  for  the  analyzer.  PAL  devi- 
ces can  efficiently  implement  sequential  state  machines.  It  is 
possible  to  encode  all  control  information  in  PAL  devices,  but 
only  three  packages  would  be  saved  (one  PROM  and  two 
buffers).  Distributing  the  control  among  several  PAL  devices 
would  reduce  flexibility  and  make  corrections  very  difficult.  The 
few  extra  packages  required  for  microprogram  control  provide 
an  extremely  flexible  architecture  and  greatly  simplify  the  PAL 
device  functions. 


With  the  theoretical  background  and  architecture  in  mind,  the 
spectrum  analyzer  functions  can  be  defined  in  detail.  The  objec- 
tive is  to  realize  a  circuit  capable  of  high-resolution  analysis  of 
audio  signals  in  the  DC  to  20-kHz  range.  Selectable  bandwidth 
and  linear/logarithmic  output  display  are  highly  desirable. 
Detailed  functions  are  shown  in  Figure  7. 
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Input  Aliasing  Filter 

An  analog  lowpass  filter  removes  high-frequency  components 
from  the  input  signals.  With  a  sample  frequency  of  50  kHz  and  a 
maximum  input  frequency  of  20  kHz,  the  lowest  aliasing  fre- 
quency is  (50-20)  =  30  kHz. 

An  eighth-order  Chebychev  filter  with  0.1-dB  passband  ripple 
will  provide  44  dB  of  attenuation  at  30  kHz,  and  86  dB  attenuation 
at  50  kHz.  It  is  desirable  to  provide  at  least  60-dB  overall  dynamic 
range  for  high-performace  analysis.  To  eliminate  spurious 
responses  above  the  -60-dB  "floor,"  the  input  signal  should  have 
all  components  above  30  kHz  suppressed  by  at  least  (60-44)  = 
16  dB.  Most  input  signals  will  meet  this  requirement.  If  not, 
additional  filtering  must  be  provided. 


H  and  A/D  Converter 

The  input  S&H  maintains  a  constant  sampled  signal  level  while 
the  A/D  conversion  is  in  progress.  No  sin  X/X  correction  is  made 
for  this  S&H  since  the  net  effect  of  the  S&H  plus  A/D  action  is  an 
impulse  sample  at  the  start  of  the  "hold"  period. 

The  A/D  conversion  time  should  be  less  than  16  mS.  The  A/D 
converter  output  digital  number  should  "saturate"  when  the 
input  signal  exceeds  the  maximum  level.  The  digital  numbers 
should  be  in  inverted  two's  complement  form.  The  S&H 
acquisition  time  should  be  less  than  4  jis. 
For  a  full  60-dB  overall  dynamic  range,  a  12-bit  A/D  is  required. 

Mixer 

The  mixer  multiplies  the  A/D  output  by  the  swept  sinewave 
osc  llator  value.  The  multiplication  produces  sum  and  difference 
frequencies,  according  to  equation  (1). 

Two's  complement  fractional  arithmetic  is  used  throughout  the 
anayzer.  Multiplication  cannot  overflow,  since  all  numbers  are 
less  than  1  in  magnitude. 


Swept  Sinusoidal  Oscillator  (VCO) 

A  precision  swept  sinusoid  from  DC  to  20  kHz  must  be 
generated  to  mix  with  the  input  signal.  A  technique  particularly 
well  suited  to  this  application  is  solving  the  two  equations: 


sin  (x+y)  =  sin  x  cos  y  +  cos  x  sin  y 
cos  (x+y)  =  cos  x  cos  y  -  sin  x  sin  y 


(13) 
(14) 

These  two  trigonometric  identities  generate  a  new  sin  and  cos 
value  with  y  representing  the  phase  shift  per  sample  period.  The 
technique  is  a  "CORDIC"  algorithm,  based  on  coordinate 
rotation.  Exact  results  are  produced,  but  truncation  and  round- 
off errors  due  to  the  finite  digital  word  length  can  cause  a  slow 
change  or  "drift"  of  carrier  amplitude.  Fortunately,  the  swept 
sinusoid  is  periodically  reset  in  the  spectrum  analyzer,  arresting 
this  amplitude  drift. 

The  VCO  frequency  is  swept  by  varying  the  value  of  y.  However, 
since  equations  (13)  and  (14)  requiresinyandcosy.an  identical 
CORDIC  algorithm  is  used  to  obtain  these  values.  To  sweep  the 
VCO,  then,  sinA  and  cosA  are  placed  in  RAM,  selected  by  the 
bandwidth  setting.  These  are  two  fixed  numbers  originally  stored 
in  PROM,  and  represent  the  frequency  shift  per  sample  time. 
Equations  (13)  and  (14)  are  then  applied  to  calculate  sin  y  and 
cos  y,  which  represent  the  desired  phase  shift  per  sample  time. 
Equations  (13)  and  (14)  are  executed  again  to  generate  the 
actual  sinusoidal  output. 


The  calculation  of  sin  y  and  cos  y  can  take  place  at  a  reduced 
sample  rate  to  save  processing  time.  Only  the  last  execution  of 
equations  (13)  and  (14)  must  be  performed  at  the  full  50-kHz 
sample  rate. 

A  linear  ramp  is  generated  to  provide  horizontal  drive  for  an 
oscilloscope  or  X-Y  plotter.  The  ramp  is  incremented  each  time 
the  sin  y  and  cos  y  values  are  updated,  tracking  the  VCO  sweep. 
When  the  ramp  value  overflows,  the  analyzer  sweep  cycle  is  re- 
initialized. 


«M  H(I,=i  ♦b1i-nb2*-2 


Figure  8.  All-Pole  Digital  Filter  Section 


Digital  Filters 

Fig.  8  shows  the  implementation  of  the  all-pole  digital  filter 
sections.  Because  of  the  low  pole  Q  values  in  all  filters,  the 
second  order  sections  can  be  simply  cascaded  to  implement 
high-order  filters.  Fig.  8  shows  a  technique  for  handling 
coefficients  greater  than  1  with  fractional  number  representa- 
tion. 

Scaling  must  be  performed  to  ensure  maximum  dynamic  range. 
Filter  sections  with  high-Q  poles  will  show  peaking  of  signals 
near  the  pole  frequencies.  The  input  to  such  sections  must  be 
scaled  down  to  prevent  overflow  of  the  arithmetic.  For  a  second- 
order  all-pole  section,  this  peaking  factor  is  exactly  the  Q  of  the 
poles.  Thus,  when  a  given  second-order  section  has  a  pole  Q  of 
2,  the  input  signal  to  that  section  must  be  multiplied  by  0.5  to 
prevent  overflow.  When  the  Q  is  less  than  or  equal  to  1.  no 
scaling  is  performed. 

Saturation  arithmetic  is  not  provided  in  this  architecture.  Careful 
scaling  eliminates  the  need  for  saturation  arithmetic,  since  the 
A/D  will  saturate  at  a  precisely  known  value. 


Aliasing  Filters 

Two  4th-order  Chebyschev  filters  permit  reduction  of  the 
sample  rate  following  the  mixer.  Each  filter  provides  0.3  dB 
passband  ripple  and  at  least  68  dB  attenuation  of  aliasing 
components.  The  slightly  high  passband  ripple  is  acceptable, 
since  subsequent  filters  will  dominate  the  composite  passband 
shape. 

The  first  filter  permits  a  sample  rate  reduction  factor  of  16.  It  is 
designed  with  a  passband  cutoff  frequency  of  479  Hz  and  a 
sample  rate  of  50  kHz.  Eq.  (12)  predicts  the  limit  cycle  noise  for 
this  filter  to  be  -58  dB. 
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The  second  filter  permits  a  second  sample  rate  reduction  factor 
of  two.  Its  cutoff  frequency  is  219  Hz,  with  a  sample  rate  of 
3.1 25  kHz.  Equation  (12)  predits  the  limit  cycle  noise  for  this  filter 
to  be  -83  dB.  This  filter  also  provides  an  additional  14-dB 
attenuation  of  the  limit  cycle  noise  generated  in  the  first  aliasing 
filter,  reducing  the  limit  cycle  noise  from  the  first  filter  to  -72  dB. 

These  two  filters  permit  an  overall  fs  reduction  factor  of  32 
before  processing  the  Bessell  filter,  detector,  and  Iinear-to-log2 
functions.  This  results  in  a  very  substantial  throughput  improve- 
ment. Net  execution  time  is  determined  by  the  time  to  execute  a 
given  function  multiplied  by  the  sample  rate  for  that  function. 
Thus  32  instructions  at  the  reduced  rate  will  increase  the  net 
execution  time  by  an  amount  equivalent  to  only  1  instruction  at 
the  full  sample  rate. 

Fig.  9  shows  the  aliasing  spectra  of  the  sample  rate  reduction 
process. 
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Figure  9.  Aliasing  Spectra  for  fs  Reduction 


Lowpass  Filter 


A  4th-order  Bessel  lowpass  filter  determines  the  overall  band- 
width of  the  analyzer.  The  overall  bandwidth  is  twice  the 
bandwidth  of  this  filter.  Overall  bandwidths  of  20,  50, 100  or  500 
Hz  are  provided  by  loading  the  proper  set  of  filter  coefficients 
into  RAM  when  the  bandwidth  is  selected. 

The  Bessel  filter  provides  an  optimal  transient  response  for  the 
analyzer.  Good  transient  response  is  important,  especially  at 
narrow  bandwidths,  since  the  spectral  peaks  are  swept  with 
respect  to  the  titer  passband.  The  net  effect  is  similar  to  pulsing 
the  filter  input.  Because  the  phase  response  is  critical,  the 
matched-z  transform  is  used  to  convert  the  analog  Bessel  design 
to  the  z-domain. 

The  second  aliasing  filter  provides  3  dB  of  attenuation  at  250  Hz. 
When  cascaded  with  the  Bessel  filter,  which  also  provides  3  dB 
attenuation  at  250  Hz  in  the  500  Hz  bandwidth  mode,  the 
response  at  this  bandwidth  is  modified.  However,  since  the 
overall  bandwidth  is  relatively  broad,  good  transient  response  is 
still  achieved.  Cascading  these  two  filters  provides  a  "transitional" 
filter  with  a  Bessel  response  at  low  attenuation  and  a 
Chebyschev  response  at  high  attenuation.  At  bandwidths  less 
than  500  Hz,  the  combination  produces  an  optimal  tradeoff 
between  transient  response  and  resolution. 

Analysis  of  Equation  (12)  reveals  that  limit  cycle  noise  increases 
exponentially  as  the  pole  frequency  is  reduced.  Operating  the 
lowpass  filter  at  the  lowest  possible  sampling  frequency  (1 .5625 
kHz)  minimizes  limit  cycle  noise,  in  addition  to  improving 
throughput.  Limit  cycle  noise  for  the  lowpass  filter  will  be  -95  dB 


at  the  500-Hz  bandwidth,  increasing  to  -67  dB  at  the  20-Hz 
bandwidth. 

Detector 

A  square-law  detector  provides  a  DC  signal  corresponding  to 
the  energy  at  the  lowpass  filter  output.  From  trigonometry: 


(A  cos  Wtf 


A2 


cos  2wt) 


(15) 


The  detector  output  contains  the  desired  DC  term  and  a  single 
undesired  term  at  frequency  2w.  If  the  square  law  is  ideal  (easy 
in  the  digital  domain),  no  additional  terms  are  produced.  The 
elimination  of  harmonics  ensures  the  accuracy  of  the  detector 
with  fs/32  =  1.5625  kHz.  The  highest  component  is  always  less 
than  fs/64  with  a  250  Hz  maximum  lowpass  filter  cutoff 
frequency.  However,  2w  can  be  anywhere  from  DC  to  500  Hz  as 
the  VCO  sweeps  past  the  spectral  component. 
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Figure  10.  Detector  Sweep  Filtering 

Fig.  10  illustrates  a  technique  to  render  the  effects  of  the  2w 
terms  negligible.  The  analyzer  passband  is  divided  into  n  equal 
intervals.  The  VCO  sweep  rate  is  controlled  so  that  the  VCO 
sweeps  BW/n  perfs/32  interval.  The  detector  is  followed  by  a 
single-pole  lowpass  filter  with  a  3  dB  frequency  of  BW/n.  As  the 
VCO  sweeps  a  component  through  the  passband.  the  DC  term  is 
present  in  all  n  intervals,  but  the  2w  term  can  affect  only  one 
interval.  The  worst-case  DC  error  is  1/n  for  an  ideal  cutoff,  and 
is  multiplied  by  (2°  +  2-1  +  2"3  +2^  +  2~5  +2"5-5  +  2~6  + 
2-6.5  +  2-7  +  2-7  25  +  ....)  =  1 .85  due  to  the  finite  6  dB/octave 
rolloff  of  the  single-pole  filter.  Further  analysis  reveals  that: 


1.85 


10 


e/10 


(16) 


where  e  represents  the  resulting  error  in  dB.  For  e  =  0.1  dB, 
n  =  80. 

In  the  filter  test  mode,  the  signal  frequency  and  VCO  frequency 
are  the  same,  forcing  w  =  0.  The  detector  has  no  error  in  this 
mode,  but  has  a  3  dB  gain  due  to  the  second  DC  term. 
The  detector  output  represents  signal  energy.  Each  bit  in  the 
detector  output  word  thus  represents  only  3  dB,  and  21  bits  are 
required  to  reflect  a  60  dB  dynamic  range.  Double  precision 
arithmetic  is  required  for  the  detector  ouput  and  the  sinqle-pole 
filter.  The  67C7560  multiplier  will  handle  double  precision  calcu- 
lations with  a  time  penalty.  Fortunately,  the  calculations  to  be 
performed  are  simple  and  the  operations  take  place  at  the 
minimum  sample  rate,  reducing  the  impact  on  throughput. 

Linear-to-Log  Conversion 

The  architecture  of  Figure  6  is  customized  to  provide  an  efficient 
algorithm  for  linear-to-logarithmic  output  conversion.  The  RAM 
address  generator  monitors  the  8  MSBs  of  the  data  bus,  and  can 
provide  a  number  indicating  the  MSB  position  of  a  positive 
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i  nis  value  is  used  to  scale  the  data  word  to  quickly  left-justify 
the  MSB.  A  second  4-point  lookup  table  is  then  used  to  improve 
the  accuracy  of  the  resulting  log2  conversion.  A  log2  con- 
version is  adequate,  since: 

log,  x 

loginx  = — —  (17) 
10  log210 

Equation  (17)  demonstrates  that  the  output  can  be  displayed  in 
decibels  by  setting  the  oscilloscope  or  X-Y  plotter  Y-axis  gain  to 
the  proper  value. 

Two  lookup  tables  provide  .027  dB  accuracy  for  output  values 
from  0  to  -45  dB.  and  3  dB  accuracy  from  -45  to  -84  dB.  The 
logarithmic  accuracy  is  limited  by  the  10-bit  output  word  length 
to  the  D/A.  This  output  can  represent  0  to  -84  dB  in  .082  dB 
increments.  The  accuracy  of  the  4-point  lookup  is  therefore 
sufficient. 

The  logarithmic  conversion  procedure  is  as  follows: 
1)  If  the  MSB  of  the  data  word  is  not  among  the  8  MSBs  into  the 
address  generator,  multiply  the  word  by  27  =  128.  and 
increment  the  output  number  by  7.  Repeat  until  the  data 
word  is  greater  than  2-7,  but  no  more  than  three  times. 
Set  a  flag  if  this  step  is  executed  more  than  once. 


log2  of  the  scale  factor  to  the  output  word.  The  conversion 
is  now  accurate  to  3  dB. 

3)  If  the  flag  was  not  set  during  step  1,  multiply  the  data  word 
by  the  scale  factor  to  left-justify  the  MSB  position. 

4)  If  the  flag  was  not  set  during  step  1,  retrieve  an  intercept 
and  slope  value  from  the  8-word  lookup  table  (four  pairs 
available).  Perform  a  linear  interpolation  using: 

x'  =  a  x  +  b 

where  a  is  the  slope  value  (18) 
bis  the  intercept  value 

The  conversion  is  now  accurate  to  .027  dB. 

5)  Scale  the  result  to  provide  84-dB  output  range  with  a 
10-bit  word. 

6)  Subtract  2~1  from  the  output  to  convert  it  to  two's  com- 
plement form  for  the  D/A. 

The  calculations  are  double  precision  for  steps  1 .  2,  and  3,  and 
single  precision  thereafter.  The  conversion  sequence  can  be  by- 
passed using  a  strap  option  to  provide  a  linear  amplitude  output 
from  0  to  -30  dB. 


HORIZ  VERT 


Figure  11.  Simplified  Schematic  Hi-Fi  Audio  Spectrum  Analyzer 
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Control  Logic 

Figure  11  shows  a  simplified  schematic  of  the  analyzer.  All 
critical  components  are  shown.  Bypass  capacitors  and  some 
component  input  connections  are  omitted  for  clarity. 

The  microprogram  is  stored  in  three  63S281  PROMs.  The 
microcode  word  formats  are  shown  in  Figure  12.  A  wide,  highly- 
parallel  instruction  word  ensures  maximum  flexibility  and  pro- 
gram efficiency. 

Eight  PAL  devices  interpret  the  instruction  word  and  control  the 
analyzer.  Two  additional  PAL  devices  generate  a  50-kHz  strobe 
from  the  8-MHz  master  clock,  and  implement  the  output  D/A 
multiplexer.  The  control  PAL  devices  function  as  follows: 

Sequencer:  A  PAL20X8  implements  an  8-bit  instruction  sequen- 
cer. The  sequencer  performs  the  following  operations: 


CI  Ctf  CX  Operation 

0  0  X   Increment  by  1  (execute  next  instruction) 

0  1  0   Increment  by  2  (skip  next  instruction) 

1  0  0  Jump  to  address 

1  1  0   No  increment  (repeat  current  instruction) 

The  CX  input  conditions  the  sequencer.  Conditional 
branches  or  skip  operations  can  be  implemented.  The 
sequencer  will  increment  if  the  conditional  requirement  is 
not  met. 

Condition  detector:  A  PAL16C1  monitors  up  to  twelve  status 
flags,  and  generates  CX.  The  microcode  word  includes  a  4-bit 
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Figure  12.  Microinstruction  Word  Formats 
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condition  word,  CXO  through  CX3.  CX  will  be  zero  under  the 
following  conditions: 

CX3  CX2  CX1  CXO  Condition  for  CX  =  0 


Always  (unconditional  operation) 

Sample  strobe  (SS)  =  1 

AU  overflow  (OVF)  =  1 

AU  busy  (AUB)  =  1 

Input  sample  ready  (INR)  =  0 

Loop  counter  timeout  (LLC)  =  0 

LLC  =  1 

Address  control  (AC3-AC0)  =  0 
ACO  =  0 
1 
0 
1 
0 


ACO 
AC1 
AC1 
AC2 
AC2 
AC3 
AC3 


The  assignment  is  made  using  the  flexible  PAL  device  coding, 
and  is  optimized  for  the  analyzer.  The  user  can  select  a  different 
set  of  conditions  by  reprogramming  the  PAL  device. 
When  the  microcode  represents  a  constant  (Type  1  microinstruc- 
tion —  see  Figure  12)  the  CON  input  forces  CX  =  1  to  suppress 
conditional  operations.  CX  is  also  used  to  suppress  certain 
strobes  in  the  analyzer,  providing  conditional  arithmetic 
operations. 

Loop  counter:  A  PAL16R6  implements  a  6-bit  programmable 
down  counter.  This  counter  controls  iteration  loops  and  provides 
a  timeout  signal  to  the  condition  detector.  The  counter  is  preset 
via  a  Type  2  microinstruction,  and  can  be  decremented  by  other 
Type  2  microinstructions.  The  counter  will  halt  when  zero  count 
is  reached.  Up  to  64  iterations  can  be  accommodated  with 
minimal  overhead. 

Address  control:  A  PAL22V10  provides  indexed  addressing  for 
the  32x1 6  RAM,  and  analyzes  the  eight  MSBs  of  the  data  bus  for 
conditional  operations.  If  D15  represents  the  data  bus  sign  bit, 
then  OP1-OP3  will  provide  the  following  functions: 

OP3  OP2  OP1  AC3-AC0  Output  Function 

0  0  0  Clear  (0000) 

0  0  1  Increment 

0  1  0  Decrement 

0  1  1  Preset  to  D15-D12  (Sign  +  3  MSB) 

1  0  0  Preset  to  D14-D11  (4  MSB) 

1  0  1  Preset  to  D11-D8  (Address  load) 

1  1  0  MSB  position 

1  1  1  No  change 

The  ADE  input  enables  a  change  in  the  address  word.  The 
address  word  will  not  change  if  ADE  =  1. 

The  MSB  position  function  indicates  the  position  of  the  MSB  for 
positive  numbers.  AC3  represents  sign  bit  D15.  This  output 
should  be  zero.  AC2-AC1  represent  the  position  of  the  first  1 
following  the  sign  bit.  Code  0000  indicates  that  D15-D8  are  all  0. 

Input/RAM  control:  Miscellaneous  FIFO  input  and  RAM  control 
is  provided  by  a  PAL  10L8.  The  67401  FIFO  includes  input  ready 
(FIR)  and  output  ready  (FOR)  signals,  which  are  latched  using 
the  input/output  shift  clocks  to  generate  two  flags.  The  first  flag 
(FR)  resets  the  FIFO  when  input  ready  (latched)  goes  low. 
indicating  the  FIFO  capacity  is  exhausted.  The  latched  output 
ready  signal  flag  represents  input  sample  ready  (INR).  The  INR 


flag  is  used  as  a  sequencer  condition  to  synchronize  wait  loops. 
Use  of  the  FR  and  INR  flags  maintains  proper  fill  of  the  FIFO. 

The.  RAM  address  LSB  (AO)  and  read-write  line  (R/W)  are 
decoded  and  latched.  These  signals  are  provided  directly  by 
Type  IV  microinstructions. 

Notice  that  a  clocked  register  function  requires  two  PAL 
combinatorial  outputs  per  bit,  while  a  transparent  latch  function 
requires  only  one  PAL  output  per  bit. 

Arithmetic  unit  control:  The  variety  of  functions  listed  in  Table  3 
indicate  the  utility  of  the  arithmetic  unit  (AU).  A  PAL16R6  pro- 
vides simplified  control  of  the  AU. 

The  PAL  devices  and  AU  load  signal  provide  conditional  arith- 
metic operations.  Gating  the  load  input  will  suppress  the  start  of 
a  new  arithmetic  operation.  When  CXE  is  high,  the  operation  is 
performed  unconditionally.  When  CXE  is  low,  the  operation  is 
performed  only  if  CX  is  low.  Combining  conditional  jumps  and 
conditional  AU  operations  provides  a  high  degree  of  program 
flexibility. 

The  PAL  device  monitors  the  AU  instructions  and  generates  a 
busy  signal  (AUB).  A  counter  in  the  PAL  device  keeps  track  of 
variable-length  operations  to  provide  the  correct  output  for  any 
instruction  sequence.  The  AUB  signal  conditions  the  sequencer 
to  synchronize  the  microprogram  to  the  AU  operation.  Microp- 
rogramming is  simplified  as  a  result. 

The  PAL  device  also  gates  the  input  FIFO  shift  out  clock  (INS)  to 
eliminate  transients  while  providing  a  full  125-ns  pulse  for  proper 
FIFO  operation. 

Data  strobe  generator:  A  PAL10L8  provides  a  number  of 
transient-free,  gated  strobes.  These  strobes  provide  control  of  the 
analyzer  data  flow.  The  PAL  device  interprets  the  microinstruc- 
tion to  determine  the  proper  microinstruction  type,  and  gener- 
ates the  strobes  accordingly. 

The  PAL  device  also  generates  an  8-MHz  buffered  clock,  as 
shown  in  Figure  1 1 .  The  crystal  oscillator  circuit  provides  inde- 
pendent AC  and  DC  feedback,  permitting  reliable  operation  with 
the  PAL  device. 

Strap/output  sample  control:  A  PAL16L8  generates  additional 
control  strobes  for  the  output  sample-and-hold  circuits. 

The  PAL  device  also  provides  a  tristate  buffer  function,  connect- 
ing control  straps  to  the  data  bus  for  certain  conditional  jump 
operations.  Two  straps  select  the  desired  analyzer  bandwidth/ 
sweep  rate,  and  the  third  strap  selects  linear  or  logarithmic 
output. 

Signal  output 

The  VCO  output  must  be  sampled  at  precise  intervals  to  avoid 
phase  modulation  effects.  Three  67401  FIFOs  buffer  the  VCO 
samples,  which  are  generated  during  the  50-kHz  input  process- 
ing. A  12-bit  D/A  converter  provides  better  than  91  dB  signal-to- 
distortion  ratio.  The  S&H  circuit  provides  VCO  outputs  at  pre- 
cise 50-kHz  intervals,  and  removes  spikes  that  are  generated  in 
the  D/A  converter.  All  necessary  control  signals  are  generated 
by  the  strap/output  data  control  PAL  device. 

The  horizontal  and  vertical  outputs  normally  drivean  X-Y  plotter  or 
oscilloscope.  There  is  no  need  to  buffer  these  signals  as  long  as 
the  two  outputs  track  each  other.  The  D/A  used  for  the  VCO 
output  is  shared  by  adding  two  PAL12H6  chips  programmed  as 
multiplexers.  Use  of  PAL  devices  requires  fewer  packages  than  a 
TTL  multiplexer.  Additional  S&H  circuits  decode  the  multiplexed 
D/A  output  to  separate  the  output  signals. 
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Microprogram 

The  architecture  can  implement  a  variety  of  DSP  functions.  A 
microprogram,  stored  in  63S281  PROMs,  customizes  the  archi- 
tecture to  perform  the  spectrum  analyzer  tasks.  The  microin- 
struction formats  were  summarized  in  Figure  12.  The  algorithms 
to  be  implemented  were  discussed  in  the  previous  section.  The 
step-by-step  implementation  of  these  algorithms  is  converted  to 
a  sequence  of  microinstructions  to  form  the  microprogram.  The 
procedure  is  analogous  to  programming  a  microprocessor. 

Operation  of  the  microprogram  is  better  understood  by  consid- 
ering the  allocation  of  the  74S218  RAM  locations,  shown  in 
Figure  13.  The  microprogram  consists  of  two  parts.  High-speed 
input  processing  provides  the  carrier  generation,  mixing,  alias- 
ing filter  and  lowpass  filter  functions.  Figure  13a  shows  the  RAM 
allocation  during  input  processing.  The  input  segment  includes 
an  efficient  iteration  loop,  using  the  PAL  device  loop  counter,  to 
process  the  50-kHz  functions.  The  carrier  frequency  shift  and 
lowpass  filter  functions  are  processed  at  the  fs/32  reduced  sam- 
ple rate  for  maximum  throughput  efficiency. 

The  values  of  sin  A,  cos  A.  and  the  Bessell  filter  coefficients 
depend  on  the  analyzer  bandwidth  strap  selection.  These  values 
are  stored  in  a  "table"  area  in  Fig.  1 3,  and  can  be  easily  changed. 
The  fixed  aliasing  filter  coefficients  are  stored  as  constants  in  the 
microprogram  itself. 


Once  the  input  processing  is  complete,  coefficients  located  in 
the  table  area  can  be  changed.  This  area  is  re-used  by  the  output 
program  segment  to  hold  the  scale  factors  for  the  linear-to-log 
conversion  routine.  The  detector  functions  are  processed,  and 
the  logarithmic  conversion  is  started  with  the  RAM  allocation  of 
Figure  13(b).  The  table  area  is  then  reloaded  with  the  interpola- 
tion coefficients  (Figure  1 3(c))  to  complete  the  logarithmic  con- 
version. Shaded  areas  in  Figure  1 3  provide  temporary  data  and 
flag  storage  for  the  routines. 

The  microprogram  samples  the  strap  settings  and  loads  the  table 
area  with  the  appropriate  coefficients  for  input  processing.  The 
detector  filter  coefficient  (bi )  is  also  determined  and  loaded. 
The  input  processing  is  then  repeated.  This  sequence  repeats 
indefinitely.  The  coefficient  loading  technique  makes  efficient 
use  of  RAM  capacity  while  eliminating  elaborate  jump 
sequences.  All  coefficient  table  updates  are  processed  at  the 
minimum  sample  rate  for  best  efficiency. 

The  PAL  device  controllers  simplify  the  microprogram.  A  PAL 
device  provides  hardware  iteration  loops.  The  AU  controller 
eliminates  wasteful  "NO-OP"  instructions  otherwise  required  to 
allow  completion  of  AU  operations.  The  input  control  PAL 
device  simplifies  handshaking  with  the  input  logic.  With  the 
benefit  of  the  PAL  device  controllers,  the  analyzer  microprogram 
easily  fits  into  the  256-instruction  capacity  of  the  PROMs, 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


AN-100  DSP  Counter 

DSPCOUNT 

1 

Marc  Baker 
Monolithic  Memories 
August  20, 


1987 


CHIP  DSPCOUNT  PAL20X8 


CLK  NC  OP3  OP4  OP5  OP6  OP7  CO  CI  /CX  NC  GND 
/OE  NC  A7     A6     A5     A4     A3     A2  Al  AO     /LOAD  VCC 


EQUATIONS 

LOAD  =     /CO*  CI*  CX 

/AO    :=  /LOAD*/A0 

+     /CO*  CI*  CX 
:+:  /C0*/C1 

/Al    :=  /LOAD*/Al 

+  /CO*  CI*  CX 

:+:  /C0*/C1*  AO 

+  C0*/C1*  CX*/A0 

/A2    :=  /LOAD*/A2 

+  /CO*  CI*  CX 

:+:  /C0*/C1*           Al*  AO 

+  C0*/C1*  CX*  A1*/A0 

/A 3    :=  /LOAD*/A3 

+  /CO*  CI*  CX*/OP3 

:+:  /C0*/C1*           A2*  Al*  AO 

+  C0*/C1*  CX*  A2*  A1*/A0 

/A4    :=  /LOAD*/A4 

+  /CO*  CI*  CX*/OP4 

:+:  /C0*/C1*           A3*  A2*  Al*  AO 

+  C0*/C1*  CX*  A3*  A2*  A1*/A0 


; FED  BACK  TO  EQUATIONS 

; HOLD/INCREMENT  BY  2 
; LOAD  0 

; INCREMENT  BY  1 

;  HOLD 
; LOAD  0 

; INCREMENT  BY  1 
INCREMENT  BY  2 

;  HOLD 
; LOAD  0 

.•INCREMENT  BY  1 
.•INCREMENT  BY  2 

;  HOLD 
;  LOAD 

; INCREMENT  BY  1 
INCREMENT  BY  2 

.•HOLD 
;  LOAD 

.•INCREMENT  BY  1 
; INCREMENT  BY  2 


/A5 


/A6 


/A7 


:=  /LOAD*/A5  ; HOLD 

+  /CO*  CI*  CX*/OP5  ; LOAD 

:+:  /C0*/C1*           A4*  A3*  A2*  Al*  AO    ; INCREMENT  BY  1 

+  C0*/C1*  CX*  A4*  A3*  A2*  A1*/A0    ; INCREMENT  BY  2 


: =  /LOAD*/A6 

+  /CO*  CI*  CX*/OP6 

:+:  /C0*/C1*           A5*  A4*  A3*  A2*  Al*  AO 

+  C0*/C1*  CX*  A5*  A4*  A3*  A2*  A1*/A0 


HOLD 
LOAD 
INC  BY  1 
INC  BY  2 


:=  /LOAD*/A7  ; HOLD 

+  /CO*  CI*  CX*/OP7  ; LOAD 

:+:  /C0*/C1*           A6*  A5*  A4*  A3*  A2*  Al*  AO    ; INC  BY  1 

+  C0*/C1*  CX*  A6*  A5*  A4*  A3*  A2*  A1*/A0   ; INC  BY  2 
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SIMULATION 

TRACE  ON  CLK  A7  A6  A5  A4  A3  A2  Al  AO 


;Trace  CLK  and  outputs 


SETF  OE  /CO  CI  CX  /OP7  /OP6  /OP5  /OP4  /OP3 

CLOCKF  CLK  ;Load  all  Os 

CHECK  /A7  /A6  /A5  /A4  /A3  /A2  /Al  /AO 


SETF  CO  /CI 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4  /A3  /A2  Al  /AO 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4  /A3  A2  /Al  /AO 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4  /A3     A2     Al  /AO 

SETF  /CO 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4  /A3  A2  Al  AO 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4     A3  /A2  /Al  /AO 

SETF  CO  CI 
CLOCKF  CLK 
CLOCKF  CLK 

CHECK  /A7  /A6  /A5  /A4     A3  /A2  /Al  /AO 


(•Increment  by  2 
;to  2 

;Increment  by  2 
;to  4 

,-Increment  by  2 
;to  6 


,-Increment  by  1 
;to  7 

; Increment  by  1 
;to  8 


;Hold 
;Hold 
;to  8 


TRACE  OFF 
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TITLE  AN-100  DSP  Condition 

PATTERN  DSPCOND 

REVISION  1 

AUTHOR  Marc  Baker 

COMPANY  Monolithic  Memories 

DATE  August   20,  1987 

CHIP  DSPCOND  PAL16C1 

CX3     CX2  CXI  CXO  SS     OVF  AUB  INR  /LLC  GND 

/CON  ACO  AC1  AC 2  /CX  COMPCX  AC 3  NC     NC  VCC 

EQUATIONS 

CX  =  /CX3*/CX2*/CX1*/CX0*  CON 

+  /CX3*/CX2*/CX1*  CXO*  SS*  CON 

+  /CX3*/CX2*  CX1*/CX0*  OVF*  CON 

+  /CX3*/CX2*  CXI*  CXO*  AUB*  CON 

+  /CX3 *  CX2*/CX1*/CX0*/INR*  CON 

+  /CX3 *  CX2*/CX1*  CXO*  LLC*  CON 

+  /CX3 *  CX2*  CX1*/CX0*/LLC*  CON 

+  /CX3*  CX2*  CXI*  CX0*/AC3*/AC2*/AC1*/AC0*  CON 

+     CX3*/CX2*/CX1*/CX0*/AC0*  CON 

+     CX3*/CX2*/CX1*  CXO*  ACO*  CON 

+     CX3*/CX2*  CX1*/CX0*/AC1*  CON 

+     CX3*/CX2*  CXI*  CXO*  AC1*  CON 

+     CX3*  CX2*/CX1*/CX0*/AC2*  CON 

+     CX3*  CX2*/CX1*  CXO*  AC2*  CON 

+     CX3*  CX2*  CX1*/CX0*/AC3*  CON 

+     CX3*  CX2*  CXI*  CXO*  AC3*  CON 

SIMULATION 

SETF  /CON  ;SET  /CX  HIGH 

CHECK  /CX 

SETF  CON  /CX3  /CX2  /CXI  /CXO  ;CX3-CX0=0  -  SET  /CX  LOW 

CHECK  CX 

SETF  CXO   SS  ;CX3-CX0=1  -  SET  /CX  LOW 

CHECK  CX 

SETF  CXI  /CXO  OVF  ;CX3-CX0=2  -  SET  /CX  LOW 

CHECK  CX 

SETF  CXO  AUB  ;CX3-CX0=3  -  SET  /CX  LOW 

CHECK  CX 

SETF  CX2  /CXI  /CXO  /INR  ;CX3-CX0=4  -  SET  /CX  LOW 

CHECK  CX 

SETF  CXO  LLC  ;CX3-CX0=5  -  SET  /CX  LOW 

CHECK  CX 
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SETF  CXI  /CXO  /LLC 
CHECK  CX 

SETF  CXO  /AC 3  /AC2  /AC1  /ACO 
CHECK  CX 

SETF  CX3  /CX2  /CXI  /CXO 
CHECK  CX 

SETF  CXO  ACO 
CHECK  CX 

SETF  CXI  /CXO 
CHECK  CX 

SETF  CXO  AC1 
CHECK  CX 

SETF  CX2  /CXI  /CXO 
CHECK  CX 

SETF  CXO  AC 2 
CHECK  CX 

SETF  CXI  /CXO 
CHECK  CX 

SETF  CXO  AC 3 
CHECK  CX 


;CX3-CX0=6  -  SET  /CX  LOW 
;CX3-CX0=7  -  SET  /CX  LOW 
;CX3-CX0=8  -  SET  /CX  LOW 
;CX3-CX0=9  -  SET  /CX  LOW 
;CX3-CX0=10  -  SET  /CX  LOW 
;CX3-CX0=11  -  SET  /CX  LOW 
;CX3-CX0=12  -  SET  /CX  LOW 
;CX3-CX0=13  -  SET  /CX  LOW 
;CX3-CX0=14  -  SET  /CX  LOW 
;CX3-CX0=15  -  SET  /CX  LOW 
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TITLE  AN-100  DSP  Address  Control 

PATTERN  ADDCONT 

REVISION  1 

AUTHOR  Marc  Baker 

COMPANY  Monolithic  Memories 

DATE  August  21,  1987 


CHIP  ADDCONT  PAL22V10 


CLK  OP3  OP2  OP1  /ADE  D15  D14  D13  D12  NC  NC  GND 
NC  NC  Dll  D10  ACO  AC1  AC 2  AC 3  D9  D8  NC  VCC 
GLOBAL 


EQUATIONS 


ACO  :  = 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 


/OP3*/OP2*  OP1*/ACO*  ADE 
/OP3*  OP2*/OP1*/ACO*  ADE 
/OP3*  OP2*  OP1*  D12*  ADE 
OP3*/OP2*/OPl*  Dll*  ADE 
OP3*/OP2*  OP1*  D8  *  ADE 
OP3*  OP2*/OPl*  D14*  ADE 
OP3*  OP2*/OPl*/D14*/D13 


INC 
DEC 
D12 
Dll 
D8 

MSB  EQUATIONS 


D12*  ADE 

OP3*  0P2*/0P1*/D14*/D13*/D12*/D11*  D10*  ADE 
OP3*  OP2*/OP1*/D14*/D13*/D12*/D11*/D10*/D9*D8*  ADE 
OP3*  OP2*  OP1*  ACO  ; HOLD 


ACO* /ADE 


;  HOLD 


AC1    :=  /OP3*/OP2*  OP1*  AC1*/AC0*  ADE  ; INC 

+  /OP3*/OP2*  0P1*/AC1*  ACO*  ADE  ; INC 

+  /OP3*  OP2*/OPl*  AC1*  ACO*  ADE  ;DEC 

+  /OP3*  OP2*/OP1*/AC1*/ACO*  ADE  ;DEC 

+  /OP3*  OP2*  OP1*  D13*  ADE  ;D13 

+     OP3*/OP2*/OPl*  D12*  ADE  ;D12 

+     OP3*/OP2*  OP1*  D9  *  ADE  ;D9 

+  OP3*  OP2*/OPl*  D14*  ADE  ;MSB  EQUATIONS 
+     OP3*  OP2*/OPl*/D14*  D13*  ADE 

+     OP3*  OP2*/OPl*/D14*/D13*/D12*/Dll*  D10*  ADE 
+     OP3*  OP2*/OP1*/D14*/D13*/D12*/D11*/D10*  D9*  ADE 

+     OP3*  OP2*  OP1*  AC1  ; HOLD 

+     AC 1* /ADE  ; HOLD 
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AC 2    :=  /OP3*/OP2*  0P1*/AC2*  AC1*  ACO*  ADE  ; INC 

+  /OP3*/OP2*  OP1*  AC2*/AC1*  ADE  ; INC 

+  /OP3*/OP2*  OP1*  AC2*  /ACO*  ADE  ; INC 

+  /OP3*  OP2*/OP1*/AC2*/AC1*/ACO*  ADE  ; DEC 

+  /OP3*  OP2*/OPl*  AC 2*  AC1*  ADE  ; DEC 

+  /OP3*  OP2*/OPl*  AC 2*  ACO*  ADE  ; DEC 

+  /OP3*  OP2*  OP1*  D14*  ADE  ;D14 

+     OP3*/OP2*/OPl*  D13*  ADE  ;D13 

+     OP3*/OP2*  OP1*  D10*  ADE  ;D10 

+     OP3*  OP2*/OPl*  D14*  ADE  ;MSB  EQUATIONS 

+     OP3*  OP2*/OPl*  D13*  ADE 

+     OP3*  OP2*/OPl*  D12*  ADE 

+     OP3*  OP2*/OPl*  Dll*  ADE 

+     OP3*  OP2*  OP1*  AC 2  ; HOLD 

+     AC 2* /ADE  ; HOLD 


AC  3  :  = 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ 


/OP  3 
/OP  3 
/OP  3 
/OP  3 
/OP  3 
/OP  3 
/OP  3 
/OP  3 
/OP  3 
OP  3 
OP  3 
OP  3 
AC  3 


*/OP2 
*/OP2 
*/OP2 
*/OP2 

*  OP2 

*  OP2 

*  OP2 

*  OP2 

*  OP2 
*/OP2 
*/OP2 

*  OP2 
*/ADE 


*  OPl*/AC3*  AC 2*  AC1*  ACO*  ADE 

*  OP1*  AC3*/AC2*  ADE 

*  OP1*  AC 3*  /AC1*  ADE 

*  OP1*  AC 3*  /ACO*  ADE 
*/OPl*/AC3*/AC2*/ACl*/AC0*  ADE 
*/OPl*/AC3*  AC 2*  ADE 
*/OPl*/AC3*  AC1*  ADE 
*/OPl*/AC3*                       ACO*  ADE 

*  OP1*  D15*  ADE 
*/OPl*  D14*  ADE 

*  OP1*  Dll*  ADE 

*  OP1*  AC 3 


SIMULATION 


INC 
:  INC 
:  INC 
;  INC 
:DEC 
:DEC 
;DEC 
;DEC 
;D15 
;D14 
;D11 
;  HOLD 
;  HOLD 


TRACE  ON  CLK  AC 3  AC 2  AC1  ACO 


SETF  ADE  /OP3  /OP2  /OP1 
CLOCKF  CLK 

CHECK  /AC 3  /AC 2  /AC1  /ACO 

SETF  OP1 
CLOCKF  CLK 

CHECK  /AC 3  /AC2  /AC1  ACO 
CLOCKF  CLK 

CHECK  /AC 3  /AC 2  AC1  /ACO 

CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

CHECK  AC 3  /AC 2  /AC1  /ACO 


; CLEAR  TO  0 

;  INCREMENT  TO  1 
; INCREMENT  TO  2 


; INCREMENT  TO  8 
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SETF  OP 2  /OP1 
CLOCKF  CLK 

CHECK  /AC3  AC 2  AC1  ACO 

CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

CHECK  /AC 3  /AC 2  AC1  ACO 

SETF  OP1  /D15  D14  /D13  1 
CLOCKF  CLK 

CHECK  /AC 3  AC 2  /AC1  ACO 

SETF  OP3  /OP2  /OP1 
CLOCKF  CLK 

CHECK  AC 3  /AC 2  AC1  /ACO 

SETF  OP1  D10  /D9  D8 
CLOCKF  CLK 

CHECK  /AC 3  AC 2  /AC1  ACO 

SETF  OP2  /OP1 
CLOCKF  CLK 

CHECK  /AC 3  AC 2  AC1  ACO 

SETF  /D14 
CLOCKF  CLK 

CHECK  /AC 3  AC 2  /AC1  ACO 


; DECREMENT  TO  7 
; DECREMENT  TO  3 

2  /Dll 

;SET  TO  D15-D12 
;SET  TO  D14-D11 
;SET  TO  D11-D8 
; CHECK  MSB 
; CHECK  MSB 


SETF  OP1 

CLOCKF  CLK 

CHECK  /AC 3  AC 2  /AC1 

SETF  /OP3  /OP2  /OP1 

CLOCKF  CLK 

CHECK  /AC 3  AC 2  /AC1 


;  HOLD 

ACO 

/ADE  ; HOLD 

ACO 


TRACE  OFF 
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Bus  interface  plays  an  important  role  in  computer  architecture 
design.  As  systems  become  more  complicated,  the  bus  require- 
ment becomes  more  critical  in  terms  of  speed,  power,  accessibil- 
ity and  feasibility.  There  are  many  different  kinds  of  buses  in  the 
electronics  industry.  Each  of  the  buses  is  specially  designed  to 
fit  into  a  special  architecture  for  various  purposes.  There  are 
many  stringent  rules  and  regulations  to  follow  depending  on  the 
application  to  design  a  bus  architecture  or  bus  interface.  The 
application  depends  on  timing  (synchronous  or  asynchronous), 
type  (military  or  commercial),  architecture  (open  or  closed),  data 
structure  (serial  or  parallel),  computer  type  (micro,  mini  or  others) 
or  a  mix  of  the  above. 

There  have  been  nearly  one  hundred  buses  since  the  invention 
of  the  computer.  A  great  deal  of  them  have  faded  away  because 
the  application  no  longer  exists,  or  have  been  replaced  by  new 

technology. 

The  following  are  some  current  major  buses: 

•  DECUnibus 

•  Future  Bus 

•  IEEE  488  Bus 

•  Multibus  II 

•  Nu  Bus 

•  PC  Bus 

•  VMEBus 

•  QBus 

Disregarding  the  complexity  and  different  varieties  of  buses, 
there  are  four  basic  sections  (Figure  1)  in  the  bus  structure: 

•  Address  Section 

•  Data  Section 

•  Control  Section 

•  Utility  Section 


BUS  INTERFACE 


ADDRESS  SECTION 
DATA  SECTION 
CONTROL  SECTION 
UTILITY  SECTION 


Figure  1 .  Four  Sections  of  the  Bus  Interface 

Address  Section 

The  Address  Section  consists  of  the  address  lines  and  control 
portion.  The  address  lines  are  used  by  the  processor  to  indicate 
to  memory  and  other  peripherals  the  location  with  which  it  wants 
to  communicate.  The  address  width  varies  from  bus  to  bus.  For 


example,  VME,  Nu  and  Multibus  II  are  32  bits  wide.  DEC  Unibus, 
PC  Bus  and  Q  Bus  are  1 8, 20  and  22  bits  wide,  respectively.  The 
width  of  the  address  line  has  gradually  expanded  with  each  new 
bus  announcement.  As  the  width  expands,  the  address  decoding 
becomes  more  complex.  One  solution  to  this  complex  address 
decoding  problem  is  to  use  PAL  devices  such  as  the  PAL16L8/ 
20L8/20L10.  The  advantage  of  using  these  PAL  devices  is  to 
utilize  their  wide  input  range  (up  to  1 6/20)  to  address  up  to  64/1 M 
addresses  by  a  single  PAL  device. 

Data  Section 

The  Data  Section  consists  of  the  data  lines  and  control  portion. 
The  data  lines  carry  instruction  and  data  between  the  processor 
and  the  peripherals,  including  the  memory.  The  control  portion 
provides  support  to  the  data  lines.  In  most  cases,  data  can  be 
addressed  as  byte,  word  or  double-word.  For  instance,  PC  Bus, 
DEC  Unibus,  and  Nu  Bus  can  be  addressed  as  8, 16  and  32-bits 
wide,  respectively.  VME  and  Multibus  II  can  both  be  addressed 
as  8, 1 6, 24  or  32-bits  wide.  The  external  bus  data  communication 
requires  handshaking.  As  long  as  the  protocol  is  well  defined,  it 
can  be  implemented  easily  by  the  PAL16FS8/20R8/20X10.  If  a 
change  is  needed  after  the  circuit  is  designed,  we  can  simply 
modify  the  PAL  device  file.  In  other  words,  by  using  PAL  devices 
for  data  communication,  flexibility  can  be  achieved  with  minimum 
hardware  change. 

One  of  the  major  problems  when  dealing  with  data  transfer  is 
synchronization.  Synchronization  is  when  events  occur  concur- 
rently with  a  regular  or  predictable  timing  relationship,  typically 
based  on  a  system  clock.  The  data  transfer  control  functions  for 
this  type  of  synchronous  bus  can  be  implemented  by  PALdevices 
such  as  the  20R8.  In  asynchronous  buses,  the  data  lines  com- 
municate through  handshaking  instead  of  the  data  clock.  This 
can  be  tedious  when  using  discrete  logic.  The  asynchronous 
devices  PAL1 6RA8/20RA1 0  can  be  very  useful  for  implementing 
such  designs. 

Another  problem  is  when  both  ends  of  the  data  li  nes  have  different 
data  rates.  Inthiscase  we  need  FIFO  memories.  FIFO  memories 
match  the  data  rates  and  serve  as  temporary  buffers  between 
source  and  destination. 

Control  Section 

The  Control  Section  coordinates  the  operation  of  all  bus  interface 
circuitry.  This  is  the  heart  of  the  bus  interface  design.  The  major 
architecture  differences  can  be  observed  here.  Generally,  any 
device  that  is  capable  of  controlling  bus  operation  is  called  a  "bus 
master".  Devices  that  operate  on  the  bus  but  cannot  control  it,  i.e. 
the  memory  system,  I/O  ports,  etc.  are  called  "slaves".  Most  of 
the  buses  are  designed  to  allow  more  than  one  bus  master  or  a 
variety  of  intelligent  devices,  i.e.  a  second  CPU,  disk  controller  or 
network  controller,  on  the  bus. 


2-325 


Bus  Interface 


Functionally,  the  Control  Section  can  be  split  into  three 
subsections: 

•  Interrupt  Subsection 

•  DMA  Subsection 

•  Arbiter  Subsection 

Interrupt  Subsection 

Interrupts  are  important  to  all  bus  systems.  These  signals  are 
designed  to  allow  infrequent  or  important  peripherals  to  get  the 
attention  otthe  bus  master.  A  maskable  interrupt  can  be  ignored 
by  the  bus  master,  while  a  non-maskable  interrupt  can  not. 
Vectored  interrupt  inputs  provide  priority  to  the  interrupt  controller 
in  case  more  than  one  interrupt  request  should  occur  simultane- 
ously. The  number  and  type  of  interrupt  inputs  vary  widely  among 
buses. 

To  determine  which  device  caused  the  interrupt  in  order  to 
execute  the  correct  interrupt-handling  routine,  two  methods  can 
be  used: 

1 .  Software  Method — In  this  method,  a  polling  program  checks, 
one  by  one,  each  of  the  devices  connected  tothe  interrupt  line, 
to  determine  which  one  is  interrupting.  When  an  interrupting 
device  is  found,  the  routine  then  branches  to  the  appropriate 
interrupt  handler.  This  scheme  is  considerably  slowduetothe 
software  overhead  involved. 


Hardware  Method — This  method  is  more  efficient  than  the 
software  scheme.  The  interrupted  peripheral  will  provide  an 
interrupt  vector  which  forms  the  branching  address  to  the 
interrupt-handling  routine  at  the  time  it  occurs.  This  is  called 
automated  interrupt-vectoring  (Figure  2).  After  the  Interrupt 
Request  (IR)  and  the  vector  are  issued  by  a  peripheral,  the 
Interrupt  Handler  arbitrates,  grants  and  serves  that  interrupt. 
The  VME  and  DEC  Unibus  use  this  interrupt  technique.  The 
required  branching  address  is  loaded  in  the  address  register. 
Ordinary  registered  PALdevices  such  as  the  20R8/20X 10  to- 
gether with  the  20L8/20L1 0  are  suitable  for  this  technique.  A 
design  example  of  the  interrupt  controller  for  the  DEC  Unibus 
is  shown  on  page  2-328.  This  example  demonstrates  a  near 
4  to  1  chip  count  saving  by  using  a  PAL  device  design.  Obvi- 
ously, the  real  estate  and  debugging  time  shrink  and  the 
reliability  increases. 


In  the  event  that  several  interrupts  occur  simultaneously,  a  priority 
scheme  is  used  to  decide  which  I/O  device  should  be  granted 
control  of  the  bus.  The  most  popular  one  is  the  Fixed  Priority 
Scheme.  This  assigns  a  fixed  priority  number  to  each  device. 
This  number  will  be  there  until  the  next  configuration  change.  The 
other  scheme  is  more  complex  and  is  rarely  used,  called  the 
Programmable  Priority  Scheme.  In  this  scheme,  the  priority  can 
be  changed  dynamically  by  the  software  program.  This  scheme 
may  be  useful  in  applications  such  as  strategic  military  or  dynamic 
financial  marketing  where  the  strategy  changes  with  various 
battles  or  investment  environments;  but,  the  software  design 
may  be  very  complex.  However,  both  schemes  can  be  accom- 
plished by  using  sequencer  devices  such  as  the  PMS14R21. 
This  powerful  device  can  serve  as  a  local  distributed  processor 
within  the  Bus  Interface  Unit. 

Direct  Memory  Access  (DMA)  Subsection 

The  Direct  Memory  Access  concept  exists  because  the  ordinary 
memory  transferring  speed  often  isn't  fast  enough.  The  transfer 
speed  of  a  microprocessor  is  limited  by  the  software  overhead 
involved  in  transferring  successive  words:  several  instructions 
must  be  executed  in  order  to  transfer  a  word  or  a  chain  of  words. 
In  such  cases  where  the  ordinary  memory  transferring  speeds 
may  not  be  sufficient,  as  in  a  floppy  disk  or  a  frame  buffer,  DMA 
is  required.  The  DMA  Controller  implements  the  transfer  algo- 
rithm in  hardware  and  automates  the  word  or  block  transfers 
directly  between  the  memory  and  the  I/O  device. 

Two  techniques  are  used: 

1 .  The  processor  is  halted  or  suspended  by  the  DMA  Controller 
(DMAC).  The  DMA  dominates  the  bus  until  the  end  of  the 
transfer. 

2.  The  DMAC  steals  a  memory  cycle  from  the  processor  and 
inserts  a  data  transfer  between  the  memory  and  the  I/O 
device.  This  is  called  cycle-stealing  DMA. 

These  methods  can  also  be  used  in  a  combined  fashion. 


INTERRUPT 
HANDLER 

BUS 
REQUESTER 

INTERRUPTER  1 

INTERRUPTER  2 

IR 
IA 

ARBITRATION  BUS 


Figure  2.  Bus  Interrupt  Example 
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DMAC  design  can  involve  complex  handshake  sequences  be- 
tween the  processor  and  an  I/O  device.  It  contributes  significantly 
to  the  system's  cost  and  complexity  and  thus  it  is  not  normally 
used  in  small  systems.  However,  with  the  availability  of  PAL 
devices,  DMA  applications  are  becoming  more  popular.  The 
handshake  sequence  and  complex  discrete  logic  design  can  be 
replaced  by  asynchronous  PAL  devices  such  as  the  PAL16RA8/ 
20RA10  and  programmable  sequencers  such  as  the 
PMS14R21/A.  This  can  significantly  decrease  the  cost,  real 
estate  and  debugging  time. 

Arbiter  Subsection 

Bus  Arbiter  designs  are  becoming  more  critical  and  complex  as 
microprocessor  costs  decrease.  This  is  because  it  is  becoming 
more  cost-effective  to  design  systems  with  multiple  processors 
sharing  global  resources.  In  the  bus  interface,  the  arbiter  acts  as 
a  judge  to  decide  who  should  own  the  proprietorship  of  the  bus 
and  for  how  long.  The  arbiter  decides  which  Requester  should  be 
granted  control  of  the  Bus  when  several  requests  happen  simul- 
taneously. 

The  function  of  the  arbiter  consists  of: 

1 .  Granting  the  use  of  the  bus  to  one  of  the  bus  masters. 

2.  Scheduling  requests  from  multiple  bus  masters  for  optimum 
bus  use. 


3.  Prevent 


ng  simultaneous  use  of  the  bus  by  two  bus  masters. 


In  other  words,  a  successful  arbiter  design  can  prevent  bus 
hangup  and  increase  bus  bandwidth. 

Four  major  types  of  arbitration  are: 

1.  Prioritized  Arbiter — This  priority  scheme  provide  a  pre-as- 
signed  number  to  each  bus  master  to  arbitrate  the  execution 
order. 

2.  Round-robin  Arbiter — This  scheme  assigns  the  bus  master 
on  a  rotating  priority  basis. 

3.  Single-Level  Arbiter — This  scheme  provides  only  single  level 
arbitration.  However,  it  relies  on  a  bus  grant  daisy-chain  to 
arbitrate  the  request. 

4.  Fairness  Arbiter — This  scheme  prohibits  the  winning  module 
from  being  serviced  again  until  all  pending  requests  have 
been  serviced. 

An  example  of  using  the  PAL22V10  to  perform  the  Prioritized 
Arbiter  is  described  on  the  application  note  on  page  2-347.  The 
arbitration  process  and  PAL  device  equations  are  also  provided. 


An  example  of  an  arbiter  used  for  resolving  a  bus  contention 
between  two  bus  masters  fighting  for  control  is  shown  in 
Figure  3. 

When  both  master  1  (M,)  and  master  2  (M2)  issue  a  Bus  Request 
(BR),  the  Arbiter  will  grant  the  Bus  Grant  (BG)  to  the  one  with 
higher  priority.  Assuming  it  is  M2,  M2  will  then  drive  Bus  Busy  (B 
Busy)  active  and  release  the  BR  signal.  After  M2  finishes  its  op- 
eration, the  Arbiter  detects  that  the  BR  signal  is  still  low  and 
transfers  the  bus  control  to  M,. 


ARBITER 

REQUESTER  M1 

REQUESTER  M2 

■  BR 
BG 

B  BUSY 
B  CLR 


Figure  3.  Arbitration  Example:  Bus  Contention 

Utility  Section 

The  Utility  section  provides  support  to  each  of  the  previous 
sections.  It  links  the  entire  bus  interface  between  modules.  The 
Utility  Section  consists  of  the  System  Clock  Distribution,  Power 
Monitor  and  System  Diagnostics.  The  Power  Monitor  detects 
power  failures  and  signals  the  system  to  start  an  orderly  shut- 
down. When  power  is  then  reapplied  to  the  system,  itensuresthat 
each  module  is  initialized  accordingly.  For  example,  the  power 
failure  and  power  up  sequences  for  the  VME  bus  are  4  ms  and  200 
ms,  respectively.  This  can  help  the  processor  to  react  to  these 
external  interrupts  according  to  the  system  specification.  A 
system  without  diagnostics  is  an  incomplete  system.  System 
Diagnostics  can  be  implemented  by  a  Programmable  Sequencer 
such  as  the  PMS14R21/A  (page  3-123). 

Future  System  Expansion 

With  the  rapid  rate  of  improvement  of  technology,  the  useful  life 
of  a  new  system  is  increasingly  getting  shorter.  Therefore,  the 
future  system  expansion  consideration  can  be  a  major  design 
feature  to  system  designers  as  well  as  a  selection  guideline  to  a 
buyer.  The  key  to  system  expansion  in  a  general  computer 
system  is  the  bus  through  which  the  CPU  communicates  with  the 
present  and  future  system  components.  Design  flexibility,  high 
throughput,  multiprocessor  support,  and  low  power  consumption 
soon  become  critical  design  considerations  for  bus  interface 
techniques.  PLDs  can  be  the  solution  for  bus  interface,  not  only 
because  of  their  flexibility  in  terms  of  possible  hardware  modifica- 
tion, but  also  because  of  their  real  estate  saving,  which  in  turn 
lowers  the  manufacturing  cost  and  power  consumption. 


m 
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Functional  Description 

One  of  the  more  established  computer  families  is  the  Digital 
Equipment  Corporation's  PDP-1 1  series.  This  family  of  compu- 
ters uses  the  DEC  unibus  to  communicate  between  cards.  A 
specific  protocol  is  required  to  interface  a  card  to  the  unibus. 
This  protocol  is  described  in  the  available  DEC  literature. 

Since  the  unibus  is  an  asynchronous  bus,  much  of  the  interface 
circuitry  consists  of  combinational  logic  to  generate  specific 
signals  and  flip-flops  which  are  set  and  reset  as  flags.  This  tends 
to  use  a  lot  of  SSI  and  MSI  logic  packages.  Using  Monolithic 
Memories'  PAL  devices,  much  of  this  logic  can  be  condensed 
into  a  few  packages.  Figure  2  is  the  schematic  diagram  for  an 
interrupt  controller  to  be  used  on  the  unibus. 

Many  cards  communicate  over  the  bus  by  taking  control  of  the 
unibus  with  an  interrupt  request,  and  then  do  whatever  they 
require  before  releasing  control.  As  can  be  seen,  this  interrupt 
controller  takes  six  special  interface  ICs,  (380  and  8881  bus 


drivers  and  receivers)  eight  MSI,  SSI  ICs,  (7400, 7402  and  7474s) 
along  with  some  transistors  and  discrete  parts.  This  parts  count 
can  be  considerably  reduced  by  using  a  PAL20RA10  and  a 
PAL20L10. 

Figure  1  shows  how  the  circuit  with  the  PAL  devices  would  look. 
The  two  PAL  devices  allow  almost  all  of  the  7400, 7402  and  7474 
packages  to  be  removed.  (Almost  a  four-to-one  saving  in  chip 
count.)  In  addition,  the  preload  pin  (PRLD)  on  the  20RA10 
allows  the  flip-flops  to  be  easily  set  to  a  known  state  on  power 
up,  or  when  reinitializing.  So  the  PAL  devices  reduce  the  logic 
package  count  from  eight  chips  to  three. 

In  the  schematic  shown,  there  are  three  VLSI  devices,  three 
MSIs  and  two  SSIs.  Using  a  PAL20RA10,  it  is  possible  to  replace 
three  MSIs  and  one  SSI  device,  thereby  reducing  the  chip  count 
by  a  factor  of  two.  The  ICs  inside  the  enclosed  loop  were 
replaced. 
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TITLE  INTERRUPT  LOGIC 

PATTERN  INTRP01 

REVISION  01 

AUTHOR        DAN  KINSELLA 

COMPANY     MONOLITHIC  MEMORIES,  INC. 

DATE  11/06/87 

CHIP   INT_LOGIC  PAL20L10 

INTRAH   INTRAHEN  MCLEARAH   INTRBH   INTRBHEN  MCLEARBH  BGINBH 
BGINAH  BUSSSYNL  BUSBBSYL  STARTINTRAL  GND 

STARTINTRBL  FF3RESET  ENINTRB  INTRDONEBH  INTRDONEAH  EN8881 
SSYN  NBGINBH  FF1RESET  EN INTRA  NBGINAH  VCC 


EQUATIONS 

/NBGINAH 

/FF1RESET 

/ENINTRA 

/NBGINBH 

/SSYN 

/EN8881 

/INTRDONEAH 

/ENINTRB 

/FF3RESET 

/INTRDONEBH 


=  BGINAH 

=  MCLEARAH+ EN INTRA 
=   INTRAHEN* INTRAH 
=  BGINBH 

=  BUSSSYNL* BUSBBSYL 
=   STARTINTRAL* STARTINTRBL 
=   BUSSSYNL+ STARTINTRAL 
=   INTRBH* INTRBHEN 
=  MCLEARBH + EN I NTRB 
BUS SSYNL+ STARTINTRBL 


FF1  CLK  CONTROL 

BLOCK  A 
SET  FF1  CONTROL 

BLOCK  A 
ENABLE  INTERRUPT  A 

FF3  CLOCK  CONTROL 

BLOCK  B 
SYNCHRONIZE  FF2  & 

FF4 

INTERRUPT  BUS 

SIGNAL  INTERRUPT 
DONE 

ENABLE  INTERRUPT  B 

SET  FF3  CONTROL 

BLOCK  B 
SIGNAL  INTERRUPT 

DONE 


... 


ESCRIPTION 


; COMBINATORIAL  LOGIC  FOR  PAL20RA10   INTERRUPT  CONTROLLER 
1ST  PART  OF  THE  TWO  PAL  SOLUTION:      PAL20L10  &  PAL20RA10) 

SIMULATION  NOT  INCLUDED 
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INSIDE  PAL20RA10 


(MUST  BE  GROUNDED) 


55^ 


Figure  2. 
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PAL  Design  Specification 


Simulation  Results 


DEC  PDP-11  unibus  interrupt  controller 
Control .pds 


Santa  Clara,  CA 


Title 
Pattern 
Revision 

Author  Dan  Kinsella 

Company  Monolithic  Memories  Inc. 

Date  3/1/85 

[CHIP  INTR_CONTROL  PAL20RA10 

PL  AINTR  NC  ABGIN  FF1RESET  SSYN  BINTR  NC  FF3RESET  BBGIN 
NC  GND 

NC  OUT 4  OUT3  OUT2  0UT1  FF3  NFF4  FF4  NFF2   FF2  FF1  VCC 


/FF1 

FF1.SETF 
FF1.CLKF 

FF2 

FF2 . SETF 
FF2.CLKF 

/NFF2 

NFF2.SETF 

NFF2.CLKF 

/FF3 

FF3.SETF 
FF3 . CLKF 

FF4 

FF4 . SETF 
FF4.CLKF 

/NFF4 
NFF4 . SETF 
NFF4 . CLKF 

/0UT1 

/OUT  2 

/OUT  3 

/OUT  4 


SIMULATION 


:=  /FF1*FF2 
=  /FF1RESET 
=  /ABGIN 

:=  FF1 
=  /AINTR 

=  ABGIN*FF2*/SSVN 

:-  FF1 
=  /AINTR 

=  ABGIN*NFF2*/SSYN 

/FF3*FF4 
=  /FF3RESET 
=  /BBGIN 

FF4 

-  /B1NTR 

=  BBGIN* FF4*/SSYN 

:»  FF3 

=  /BINTR 

=  BBGIN*NFF4  */SSYN 

=  FF1+FF2 

-  FF4+FF3 
=  AINTR 

=  BINTR 


.•Master  control 
;  block  A 


rBus  Busy  Signal 


;Bus  sack  signal 


.•Master  control 
rblock  B 


;Bus  busy  signal 


;Bus  sack  signal 


;Bus  request  signal 
; block  A 

;Bus  request  signal 
;  block  B 

;Intr.  signal  for 
;bus  req.  block  A 
r Infer,  signal  for 
;bus  req.   block  B 


TRACEON  FF1RESET  FF 3 RESET  AINTR  BINTR  SSYN  ABGIN  BBGIN 
FF1  FF3  NFF2   NFF4  0UT1  0UT2  OUT3  0UT4 


SETF  /FF1RESET  /FF3RESET  AINTR  BINTR         ;Reset  all  regs 


SETF  FF1RESET  FF 3 RESET  /AINTR  /BINTR 
ABGIN  BBGIN 


SETF  /SSYN 


;Clock  FF1  and  FF3 
;  regs 

;Clock  NFF  and  NFF3 
;regs 


Page  :  1 

FF1RESET 

FF3RESET 

AINTR 

BINTR 

SSYN 

ABGIN 

BBGIN 

FF1 

FF3 

NFF2 

NFF4 

0UT1 

OUT  2 

OUT  3 

0UT4 


gg  g 

LHHH 
LHHH 
HLLL 
HLLL 
XXXL 
XHHH 
XHHH 
LLLL 
LLLL 
XLLL 
XLLL 
XHHH 
XHHH 
LHHH 
LHHH 
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A  Multibus  Arbiter  Design 
for  10  MHz  Processors 


This  application  note  describes  the  implementa- 
tion of  a  bus  arbiter  using  a  PAL16R4  to  interface 
the  10  MHz  801 86  to  the  MULTIBUS  multi-master 
environment.  The  PAL  equations  for  bus  ex- 
change were  developed  based  on  functional  and 
timing  requirements  specified  in  the  Intel 
MULTIBUS  Specification.  The  interface  shown 
addresses  the  bus  master  case  in  regards  to  bus 
exchange  only.  No  attempt  has  been  made  to 
include  the  requirements  for  bus  slave  operation, 
interrupt  handling  or  byte  operations.  This  note 
assumes  a  knowledge  of  MULTIBUS  operations 
and  signal  names. 

Overview 

The  system  environment  is  assumed  to  be  a 
collection  of  loosely  coupled  independent 
processors  which  communicate  through  a  com- 
mon memory  on  the  system  bus.  Each  processor 
in  this  system  possesses  its  own  local  resources. 

A  block  diagram  showing  the  interface  of  one  such 
processor,  an  80186,  to  the  system  bus, 
MULTIBUS,  is  shown  in  Figure  1 .  A  block  of  the 
80186's  address  space  is  allocated  to  system 
memory.  Access  into  this  portion  of  the  address 
map  generates  a  request  for  the  system  bus  via  a 
programmable  memory  chip-select  pin.  Once  a 
request  is  generated  by  the  CPU,  the  PAL16R4 
arbiter  and  the  8288  bus  controller  combine  to 
perform  a  bus  exchange  and  transfer  cycle. 

In  this  design  the  arbiter  maintains  control  of  the 
system  bus  and  therefore  must_be  forced  off  either 
by  a  higher  priority  master  (BPRN  goes  HIGH)  or  a 
Common  Bus  Request  (CBRQ). 

Functional  Requirements 

On  the  MULTIBUS,  the  bus  exchange  process 
begins  when  the  Bus  Request  (BREQ)  goes  LOW 
(parallel  priority  resolution  scheme),  or  when 
BPRO  goes  HIGH  to  succeeding  masters  (serial 


priority  resolution  scheme).  If  the  requesting 
master  is  granted  priority  (BPRN  enabled)  and  the 
bus  is  not  under  control  of  another  master  (BUSY 
disabled),  then  the  master  may  take  control  of  the 
bus  by  forcing  BUSY  LOW. 

An  optional  request  mechanism,  Common  Bus 
Request,  used  in  conjunction  with  either  parallel  or 
serial  priority  has  been  implemented  in  this  design. 

It  reduces  bus  acquisition  overhead  by  allowing 
bus  retention  across  transfer  cycles,  unless  a 
request  is  pending  (CBRQ  is  LOW).  For  example, 
in  parallel  priority  the  requesting  master  asserts 
both  BREQ  and  CBRQ  to  force  the  current  master 
off  the  bus.  Once  acquisition  is  complete  CBRQ  is 
released  for  use  by  another  master. 

Timing  Considerations 

Bus  exchange  on  MULTIBUS  is  a  synchronous 
process.  BREQ,  BPRO,  BUSY  and  CBRQ  are  all 
synchronized  with  the  trailing  edge  (HIGH-to- 
LOW  transition)  of  BCLK.  BCLK  is  a  MULTIBUS 
signal  having  a  duty  cycle  of  approximately  50% 
and  a  maximum  frequency  of  1 0  MHz.  There  is  no 
requirement  for  synchronization  between  BCLK 
and  any  other  clock  in  a  MULTIBUS  system. 
Synchronizer  circuits  must  be  used  on  signals 
that  cross  clock  boundaries,  because  it  is  possi- 
ble (as  in  this  design)  for  the  processor  clock  to  be 
asynchronous  to  BCLK. 

Bus  priority  on  MULTIBUS  resolution  takes  place 
in  one  BCLK  cycle.  If  the  serial  priority  scheme  is 
implemented,  this  requirement  will  place  an  upper 
limit  on  the  number  of  masters  allowed  for  a  given 
BCLK  rate.  The  parallel  priority  scheme  allows 
the  relationship  between  BCLK  and  the  number  of 
masters  to  be  independent.  Whichever  is  chosen, 
the  MULTIBUS  specified  minimum  setup  time  of 
22  ns  (resolution  to  clock),  plus  the  desire  to  keep 
the  BPRN  to  BPRO  piopagation  delay  to  a  mini- 
mum, points  to  the  use  of  an  A"  speed  PAL 
device  (Figure  2). 
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The  MULTIBUS  bus  exchange  timing  requires  a 
BUSY  setup  time  before  clock  of  25  ns  (see  Fig- 
ure 2).  Release  by  one  master  and  acquisi- 
tion by  another  takes  place  on  two  successive 
HIGH  to  LOW  transitions  of  BCLK. 

PAL  Design  Description 

The  PAL  design  specification  with  supporting 
timing  diagrams  are  shown  in  Figures  3  and  4.  The 


following  remarks  are  in  addition  to  the  comments 
found  on  the  PAL  design  specification. 


The  term  "BRQP'AEN 
required 


found  in  the  equation  for 
to  keep  the  arbiter  from 


"BREQ"  is 

recapturing  the  bus  when  it  is  the  releasing  master 
This  can  occur  when  a  CBRQ  forced  the 
surrender,  and  the  processor  cycle  which  follows 
the  just  completed  cycle  accesses  the  system 
memory. 
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Figure  1.  Single  Multimaster  Bus  Interface 
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Using  BREQ  to  disable  BPRO  results  in  a  40  ns 
delay  worst  case  from  clock  (tCO  +  tPD).  Taking 
into  account  the  22  ns  setup  (see  Fig.  2)  and 
clock  skew  (3  ns  max)  leaves  35  ns  of  cycle  (10 
MHz  BCLK),  if  the  serial  priority  resolution 
scheme  is  being  employed.  Lower  priority  mas- 
ters using  this  design  would  exhibit  a  25  ns  worst 
case  delay  BPRN  to  BPRO.  Use  an  external  gate 
to  implement  the  equation  "BPRO  = 
BPRN*BREQ"  if  the  serial  priority  is  required  when 
using  more  than  two  masters. 

Note  the  inversion  of  BLCK  for  use  by  the  PAL 
device.  PAL  device  registers  load  data  on  the 
LOW  to  HIGH  clock  transition  of  pin  1 . 

MULTIBUS  address  setup  before  Command  (50 
ns  min.)  is  determined  by  the  8288  bus  controller. 
The  8288  specification  guarantees  115  ns 
minimum  between  AEN  going  LOW  and 
Commands  going  active. 


MULTIBUS  address  hold  after  Command  (50  ns 
min.)  is  satisfied  by  using  the  ALE  signal  from  the 
8288  instead  of  the  80186.  The  8288  Command 
lines  go  inactive  on  the  HIGH  to  LOW  transition  of 
CLKOUT  at  the  beginning  of  T4  (30  ns  max  delay). 
The  8288  ALE  signal  goes  active  on  the  HIGH  to 
LOW  transition  of  CLKOUT  at  the  beginning  of  T1 
(1/2  clock  cycle  later  than  ALE  from  the  80186). 
The  minimum  hold  time  is  therefore  100  ns  -  35  ns 
=  65ns. 


U79A3-2; 


Figure  2.  Bus  Exchange  AC  Timing 
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TITLE  A  MULTIBUS  ARBITER 

PATTERN  AMP3-318 

REVISION  005 

AUTHOR        JOE  ENGINEER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/15/87 


CHIP  MULTIB  PAL16R4 


BCLK  /RD  /WR  /SREQ  /RESET  /BPRN  NC  NC  NC  GND 

/E  /CBRQ  /BUSY  /SYNC  /HREQ  /AEN  /OEN  /BREQ  /BPRO  VCC 


EQUATIONS 

BPRO  =  /HREQ*BPRN 

BREQ  =  HREQ+AEN 

OEN    :=  /RESET* SREQ* AEN 

AEN    :=   /RESET* AEN* HREQ* RD 
+   /RESET* AEN *HREQ*WR 
+  /RESET* HREQ* BPRN* /BUSY 
+  /RESET* AEN* BPRN* /CBRQ 

HREQ    :=   /RESET* SYNC 


; HOLD  BUS  UNTIL  CURRENT 

; CYCLE  IS  COMPLETE 

; INITIAL  BUS  ACQUISITION 

; HOLD  BUS   IF  NO  OTHER  REQUEST 


SYNC    :=  /RESET* SREQ *RD 
+  /RESET* SREQ *WR 

BUSY  =  AEN 
BUSY . TRST  =  AEN 

CBRQ  =  /AEN* HREQ 
CBRQ. TRST  =  /AEN*HREQ 

;THIS  MODIFICATION  ALLOWS  THE  ARBITER  TO  RESIDE  ANYWHERE  IN 
; THE  ARBITRATION  DAISY  CHAIN 


Figure  3.  PAL16R4  Arbiter  Design  Specification 
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Multibus  to  Am9516 


Interface 


This  interface  shows  the  Am9516  connected  to  the  MULTIBUS 
(Figure  1).  This  is  accomplished  by  two  PAL  device  designs. 
The  equations  for  the  PAL  devices  are  shown  in  Figures  2  and  3. 
The  first  designated  Am9516MBC  does  the  MULTIBUS  arbitra- 
tion as  defined  by  the  MULTIBUS  specification.  Common  bus 
request  (CBRQ)  was  not  implemented  in  this  design.  Addition- 
ally this  design  holds  the  bus  as  long  as  BREQ  is  active.  If  the 
user  wishes  to  release  the  bus  after  each  transaction  the 
Am9516  should  be  programmed  for  CPU  interleave. 

The  second  PAL  device  in  this  interface  designated  9516MBC, 
converts  the  Am9516  signals  into  MULTIBUS  control  signals.  It 
also  generates  RD  and  WR  for  the  8530  so  that  flyby  transfers 


can  be  done.  When  not  doing  flyby,  this  part  of  the  PAL  device 
should  be  changed.  There  are  several  considerations  when 
using  the  SCC  with  DMA  not  addressed  here.  These  are  cov- 
ered in  a  separate  application.  This  example  will  work  for 
moderate  serial  data  rates.  To  operate  the  SCC  at  maximum 
speed,  a  local  RAM  should  be  used  as  bus  arbitration  overhead 
could  cause  problems.  The  main  purpose  here  was  to  illustrate 
the  versatility  of  PAL  devices  and  how  easy  it  is  to  interface 
apparently  incompatible  devices  to  the  MULTIBUS. 

The  two  PAL  devices  shown  are  similar  in  function  to  the  8289 
and  8288  shown  with  the  8086  CPU.  A  similar  design  could  be 
done  for  processors  such  as  the  68000  or  other  bus  masters 
such  as  the  8052  CRT  controller  which  has  its  own  DMA. 
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Figure  1 .  Am9516  to  8086  in  a  MULTIBUS0  Environment 
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TITLE         MULTIBUS  CONTROLLER  FOR  AM9516 

PATTERN  AMP3-189 

REVISION  01 

AUTHOR        JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/15/87 

CHIP  MULTI1  PAL16L8 

BACK  MIO  NC  NC  /DACK  NC  NC  NC  /CEN  GND 
NC     /RD  /IORC  /DS  /MWTC  /MRDC  /IOWC  /RW  /WR  VCC 

EQUATIONS 

IORC  =  /MIO*DS*/RW*CEN 

IORC.TRST  =  BACK 

IOWC  =  /MIO*DS*RW*CEN 

IOWC.TRST  =  BACK 

MRDC  =  MIO*DS*/RW*CEN 

MRDC.TRST  =  BACK 

MWTC  =  MIO*DS*RW*CEN 

MWTC.TRST  =  BACK 

RD   =   DACK *RW* BACK 
+  IORC*/BACK 

WR  =  DACK */RW* BACK 
+  IOWC*/BACK 

DS  =  IORC 
+  IOWC 

DS.TRST  =  /BACK 

RW  =  IOWC 

RW.TRST  =  /BACK 

.•DESCRIPTION 

;This  PAL  device  converts  multibus  signals  into  9516  compatible 
;signals  and  vice-versa.   It  also  supports  the  8530  in  flyby  mode 

Figure  2.  Source  Listing  for  MULTIBUS  to  Am9516  Interface 
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Multibus  to  Am951 6  Interface 


TITLE         MULTIBUS  ARBITER  FOR  AM9516 

PATTERN  AMP3-190 

REVISION  01 

AUTHOR       JOE  BRCICH 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/15/87 


CHIP  MULTI2  PAL16R4 


/BCLK  /XACK  BRQ  /BSY  /BPRN  /DS  NC  /IORC  /CS  GND 
/OE       /RBEN  /TBEN  BACK  /CEN     /BREQ  /BUSY  /BPRO  /WAIT  VCC 

EQUATIONS 

TBEN  =  IORC*CS 

TBEN.TRST  =  /BACK 

RBEN  =  /IORC*CS 

RBEN.TRST   =  /BACK 

WAIT  =/XACK*BACK 

BREQ   :=  BRQ 

BPRO  =  /BRQ*BPRN 

/BACK    :=  /BUSY 

BUSY    :=   BREQ* BPRN* /BSY* /BUSY 
+   BREQ* BUSY* BPRN 
+  DS*BUSY 

CEN    :=  BACK 

.•DESCRIPTION 

;/CEN  delays  the  commands  to  meet  the  multibus  requirements  that 
,-address  and  data  be  valid  at  least  50  ns  prior  to  control  active. 
;If  we  do  not  allow  preemption;   remove  BPRN  from  the  second 
;expression  in  the  BUSY  equation  and  eliminate  the  third  expression 

Figure  3.  Source  Listing  for  MULTIBUS  to  Am9516  Interface 
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Z-Bus  and  8088/8086  Interface 


This  application  note  describes  how  replacing  two  8086  support 
chips  with  a  Z8000  support  chip  and  a  PAL16R8A  allows  the 
8086  CPU  to  interface  directly  to  the  Z-BUS.  Since  the  timing  of 
the  signals  used  is  the  same  for  the  8088  CPU,  this  circuit  will 
work  equally  well  in  those  applications. 

Interfacing  the  8086  CPU  to  the  Z-BUS  allows  8086  users  to 
take  advantage  of  the  very  powerful  Z8000  peripheral  and 
memory  support  circuits  that  are  available.  The  Z8000  periph- 
eral circuits  in  particular  offer  the  user  higher  throughput  rates, 
simpler  control  software  and  less  system  overhead  require- 
ments than  any  previous  generation  peripheral  family  for 
any  CPU. 

Design  Requirements 

The  8086  CPU  can  operate  in  two  different  modes.  In  minimum 
mode,  it  generates  all  the  bus  control  and  timing  signals  for  the 
8086  (8085,  8088)  buses  directly  on-chip.  In  maximum  mode, 
the  CPU  puts  out  status  information  early  in  each  bus  cycle  and 
relies  on  an  external  bus  controller  chip,  the  8288,  to  generate 
timing  and  control  signals. This  implementation  uses  the  CPU  in 
maximum  mode  and  replaces  the  8288  with  a  PAL1 6R8A  that 
generates  the  Z-BUS  timing  and  control  signals  from  the  status 


signals  provided  by  the  CPU.  It  also  makes  use  of  the  AmZ81 27 
clock  generator  to  allow  precise  timing  resolution  by  providing 
an  oscillator  signal  at  3  times  the  CPU  clock  frequency.  The 
AmZ8127  provides  all  the  clock  generation  functions  of  an 
8284A  as  well  as  several  additional  functions.  Either  clock  chip 
will  work  in  this  system. 

The  bus  controller  provides  the  following  functions: 


—Generates  AS,  DS,  INTACK  and  R/W  with  proper  timing  rela- 
tive to  address  and  data. 
— Provides  simultaneous  assertion  of  AS  and  DS  during  reset. 
—Automatic  insertion  of  1  wait  state  for  all  I/O  cycles. 

—Synthesizes  a  single  Z-BUS  interrupt  acknowledge  cycle  from 
the  8086  IACK  cycles. 

Figure  1  shows  the  circuit  interconnection  diagram.  The  system 
uses  a  high-speed  PAL16R8A  to  generate  AS,  DS,  R/W  and 
INTACK  of  the  Z-BUS,  RDY  for  wait  state  generation,  and  three 
internal  state  variables.  The  registers  are  clocked  with  the  15- 
MHz  OSC  signal  from  the  81 27.  The  five  input  signals  to  the  PAL 
device  are  5-MHz  CPU  Clock  Signal  (CLK),  System  Reset 
(RESET),  and  the  three  CPU  Status  States  (S0,  S,,  S2). 
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Figure  1.  Circuit  Interconnection  Diagram 
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The  CPU  indicates  the  start  of  a  bus  cycle  by  bringing  at  least  one 
of  the  status  lines  low  from  the  idle  high  state  (see  Figure  2).  This 
starts  an  internal  timing  sequence  within  the  PAL  device  which 
corresponds  closely  to  the  various  T  states  of  a  bus  cycle.  AS  is 
asserted  during  the  time  CLK  is  LOW  during  T,.  DS  is  asserted 
at  the  start  of  T3.  If  it  is  an  I/O  cycle,  then  RDY  would  be  disabled 
for  one  CLK  period  straddling  T2  and  T3  causing  the  8127  to 
request  1  wait  state  after  T3.  In  either  case,  DS  remains  as- 
serted until  after  the  first  1/3  of  T4,  which  is  identified  by  the 
status  lines  returning  to  the  idle_state  during  the  previous  cycle. 
R/W  is  generated  by  sampling  S0  and  S,  during  AS. 


It  is  in  the  realm  of  interrupts  where  the  Z8000  peripherals  shine 
over  other  peripherals.  Each  peripheral  can  identify  many  differ- 
ent exception  conditions  during  its  operation.  The  occurrence  of 
one  or  more  of  these  conditions  causes  activation  of  a  single 
interrupt  request  line.  The  peripheral  wants  the  CPU  to  respond 
with  a  single  interrupt  acknowledge  cycle,  during  which  the  pe- 
ripheral resolves  priority  and  provides  the  CPU  with  enough 
status  and  vector  information  to  allow  it  to  respond  to  the  excep- 
tion without  any  further  interrogation  of  the  peripheral.  This  al- 
lows interrupt  driven  systems  to  achieve  very  high  data 
throughput  rates. 
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Figure  2.  Memory  and  I/O  Timing  Diagram 
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The  8086  CPU  responds  to  an  interrupt  request  with  a  sequence 
of  two  interrupt  acknowledge  cycles,  and  only  in  the  second  is 
any  data  read  off  the  bus.  As  stated  before,  the  Z-BUS  peripher- 
als require  only  one  acknowledge  cycle.  The  timing  of  this  has  to 
be  such  that  there  is  enough  delay  between  AS  going  HIGH  and 
DS  going  LOW  to  allow  any  prioritizing  daisy  chains  to  settle, 
and  DS  has  to  be  wide  enough  to  allow  the  peripheral  time  to 
p  ace  vector  or  status  information  on  the  bus.  Figure  3  shows 
how  these  two  requirements  are  accomplished  by  turning  the 
two  acknowledge  cycles  into  one.  The  first  cycle  allows  only  AS 
and  the  second  asserts  only  DS  and  does  so  for  the  complete 
cycle.  This  appears  to  the  peripheral  as  one  very  long  bus  cycle 
which  is  identified  as  an  interrupt  acknowledge  cycle  by  the 
assertion  of  INTACK. 


Design  Approach 

To  implement  this  design  in  a  PAL16R8A  requires  recognizing 
the  Z-BUS  timing  characteristics  in  Figures  2  and  3.  The  major 
characteristic  to  consider  is  counting  the  phases  of  a  bus  cycle. 
Internal  state  variables  P0  and  P,  are  the  result  (see  Figure  4). 
An  additional  internal  state  variable  (l2)  is  necessary  to  count  the 
second  bus  cycle  of  an  interrupt  acknowledge  sequence.  As 
shown  in  Figure  5,  l2  in  conjunction  with  INTACK  allows  AS  to  be 
asserted  only  in  the  first  interrupt  acknowledge  cycle  and  DS 
only  in  the  second.  The  RESET  input  is  used  to  initialize  the 
internal  variables  and  assert  AS  and  DS.  Note  ajso  that  S0  and 
S,  are  included  in  the  DS  equation  to  prevent  DS  from  being 
asserted  during  a  halt  cycle. 
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Figure  3.  Interrupt  Acknowledge  Timing  Diagram 


^MonomMcM Momorles  £1 


Z-Bus  and  8088/8086  Interface 


The  fact  that  PAL  devices  are  user  programmable  allows  a  great 
deal  of  flexibility  for  the  designer.  Minor  timing  changes  are 
easily  implemented  by  simply  adding  or  changing  a  term  in  the 
logic  equations  and  reprogramming  the  device.  In  this  system, 
we  have  timing  resolution  to  67  ns.  This  same  configuration  can 
be  used  with  a  24-MHz  crystal  for  8-MHz  CPU  chips.  The  logic 
equations  would  change  because  the  OSC  period  would  be 
42  ns.  The  only  hardware  change  would  be  the  crystal. 

An  additional  PAL  device  could  also  perform  chip  select  decod- 
ing based  on  both  address  and  status  signals. 


Conclusion 

We  have  seen  how  a  properly  programmed  PAL  device  can  be 
used  to  replace  a  specialized  bus  controller  chip  and  allow  an 
8086  CPU  to  interface  directly  to  Z-BUS  peripheral(s)  and/or 
memory  systems.  This  brings  all  the  advantages  of  the  superior 
Z8000  peripheral  family  in  terms  of  both  throughput  and  ease  of 
use  to  8086  users  with  no  increase  in  chip  count  while  still 
allowing  a  wide  range  of  design  flexibilty.  The  PALASM  software 
equations  for  the  PAL16R8A  8086  to  Z-BUS  interface  chip  are 
shown  in  Figure  6. 


Pi 

Po 

PHASE 

CPU  T  STATES 

0 

0 

IDLE 

T4,Ti 

0 

1 

AS  TIME 

Ti 

1 

0 

AS  TO  DS  DELAY 

T2 

1 

1 

DS  TIME 

T3,  Tw 

03862A-106 

Figure  4. 


Is 

INTACK 

AS 

DS 

NO 

NO 

YES 

YES 

NO 

YES 

YES 

NO 

YES 

YES 

NO 

YES 

03862A-107 

Figure  5. 


£1  Monoli thiol 


2-345 


Z-Bus  and  8088/8086  Interface 


TITLE         8086  TO  Z-BUS  INTERFACE 

PATTERN  AMP3-195 

REVISION  01 

AUTHOR       NICK  ZWICK 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/15/87 


CHIP  ZBUS1   PALI 6R  8 

CK     RESET  CLK  /SO  /SI  /S2     NC  NC       NC  GND 

/E  /PO       /PI     /RW  /DS  /AS  /INTACK  RDY  /1 2  VCC 

EQUATIONS 


;    INTERNAL  STATE  VARIABLES 


PO 

/RESET* 

S0*/P0*/P1*/CLK 

+ 

/RESET* 

S1*/P0*/P1*/CLK 

+ 

/RESET* 

S2*/P0*/P1*/CLK 

+ 

/RESET* 

PO*/CLK 

+ 

/RESET* 

PI*  CLK*  SO 

+ 

/RESET* 

PI*  CLK*  SI 

+ 

/RESET* 

PI*  CLK*  S2 

PI 

/RESET* 

P0*/P1*  CLK 

+ 

/RESET* 

P1*/CLK 

+ 

/RESET* 

PI*  CLK*  SO 

+ 

/RESET* 

PI*   CLK*  SI 

+ 

/RESET* 

PI*  CLK*  S2 

12 

/RESET* 

INTACK*/I2*  CLK*  P0*.P1 

+ 

/RESET* 

I2*/P1 

+ 

/RESET* 

I2*/P0 

+ 

/RESET* 

I2*/CLK*  PO*  PI 

;    Z-BUS  OUTPUT  SIGNALS 


AS    : =  RESET 

+  /P0*/P1*  CLK*/I2 
+  AS*/P0*/I2*/DS 

DS    :=  RESET 

+   /INTACK*/P0*   PI*  CLK*S0 
+  /INTACK*/PO*  PI*  CLK*S1 
+     12*   SO*   SI*  S2 
+     DS*   PO*  PI 

RW   :=     AS*  S0*/S1 
+  RW*/AS 

INTACK :=  /RESET*AS*S0*S1*S2 

+  /RESET*INTACK*/I2*P1 
+  /RESET* I  2 

/RDY    :=  /RESET*   S0*/S1*   S2*/P0*  PI 
+  /RESET*/SO*  SI*  S2*/P0*  PI 

Figure  6.  Source  Listing  for  the  Z-Bus  to  8088/8086  Interface 
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VME  Bus  Control  Simplified  with  PLDs 


A  High-Performance  Bus 

The  VMEbus  is  a  high-performance  asynchronous  bus  capable 
of  supporting  multiple  16-  and  32-bit  processors.  Its  asynchro- 
nous nature  permits  easy  implementation  of  hardware  control- 
lers that  reduce  the  time  required  for  bus  control  tasks  such  as 
bus  arbitration.  By  offloading  these  tasks  to  hardware,  bus  ar- 
bitration time  is  minimized  which  consequently  maximizes  the 
overall  bus  transfer  rate. 

This  article  describes  PLD  implementations  of  two  such  control- 
lers in  a  typical  VMEbus  system:  the  bus  arbiter  and  the  inter- 
rupter that  comply  with  the  VMEbus  protocol.  These  state 
machines  used  to  require  multi-chip  designs  consisting  of  se- 
quencers, microprogrammed  control  stores,  and  other  MSI/LSI 
chips.  By  using  the  PAL22V1 0  programmable  array  logic  device, 
you  can  reduce  substantially  the  number  of  chips  needed  for 
these  state-machine  implementations. 

Functional  Modules 

A  typical  VMEbus-based  computer  system  contains  a  bus  ar- 
biter and  an  interrupt  handler  board.  A  processor/master  in- 
itiates a  data  transfer  to  a  slave  by  requesting  control  of  the  data 
bus  from  the  bus  arbiter.  Once  bus  control  has  been  granted  to  a 
master,  control  signals  to  and  from  the  master  and  slave  are 
exchanged  which  follow  predefined  protocols.  These  protocols 
guarantee  an  orderly  transfer  of  data  between  the  communicat- 
ing modules.  Interrupt  servicing  capability  is  provided  by  the 
interrupt  handler  board  (see  Figure  1). 


Steps  in  Designing  the 
Bus  Controllers 

VMEbus  protocols  outline  the  steps  to  perform  any  bus-related 
operation,  such  as  transferring  data  over  the  bus  between  two 
modules.  These  protocols  are  described  using  flow  diagrams 
with  interactions  between  the  communicating  modules  shown 
through  various  interface  signals.  For  example,  the  priority  bus 
arbiter  flow  diagram  (Figure  2)  shows  how  bus  requests  be- 
tween two  modules  using  the  same  request  line  are  resolved. 

The  flow  diagrams  are  analyzed  to  pick  out  the  functions  that 
can  be  incorporated  into  PLDs,  and  then  state  machines  are 
created  for  these  functions.  These  state  machines  can  be 
described  logically,  or  with  flowchart  symbols  such  as  rectan- 
gular blocks  (to  symbolize  events  or  control  signals)  and  deci- 
sion diamonds  (to  determine  control  program  flow).  Logic 
equations  are  then  written  for  these  state  machines. 

This  methodical  process  is  used  in  designing  the  two  PLD- 
based  VMEbus  controllers:  the  bus  arbiter  and  interrupt  handler 
controller. 

A  PLD  Bus  Arbiter 

Before  any  module  can  perform  a  data  transfer,  it  must  request 
control  of  the  data  transfer  bus  from  the  bus  arbiter.  This  design 
uses  a  priority  option  bus  arbiter  implemented  on  a  single 
PAL22V10  that  monitors  the  four  bus  request  lines  BR0-3(L) 
with  BR3(L)  assigned  the  highest  priority. 
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Figure  1.  Intermodule  Communication  Through  Am29PL141  (FPCs)  and  PLDs  in  a  VMEbus  System 
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Figure  2.  Bus  Arbitration  Flow  Diagram 
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Based  on  the  flow  diagram  (Figure-2),  the  arbiter  grants  the  bus 
to  the  requesting  module  using  the  highest  active  request  line,  in 
this  case  BR1(L).  A  bus  grant  signal  is  daisy-chained  to  all  the 
devices  using  this  request  line  to  resolve  simultaneous  bus  re- 
quests from  two  or  more  modules.  This  dictates  that  the  arbiter 
be  in  the  first  slot  of  the  VMEbus  system,  and  that  the  module 
physically  closest  to  the  arbiter  through  the  daisy-chain  will  get 
the  bus  grant. 

All  bus  requests  are  processed  in  parallel  by  the  AND-OR  array 
of  the  PLD.  This  allows  priority  arbitration  to  occur  in  a  single 
clock  cycle,  which  minimizes  arbitration  time  and  maximizes  the 
data  transfer  rate  on  the  VMEbus. 

When  the  bus  is  free,  the  module  using  the  highest  active  bus 
request  line  is  granted  the  bus.  This  is  described  with  Boolean 
logic  notation  where  registered  outputs  are  defined  with  ':=': 


IF  (/BBSY*(BR0+BR1  +BR2 
+BR3))  THEN  BEGIN 
IF  (BR3)  THEN  BG3IN  :=  1; 

IF  (/BR3*BR2)  THEN 

BG2IN  :=  1; 
IF  (/BR3*/BR2*BR1 )  THEN 

BG1IN  :=  1; 


IF  (/BR3*/BR2*/BR1*BR0) 

THEN  BGOIN  :=  1; 
END  ; 


if  bus  not  busy  and  a 
request  line  is  active 

if  BR3  is  active,  grant 
bus  to  device  on  BR3 

activate  bus  grant  daisy 
chain  2 

if  BR1  is  active  and  BR3 
and  BR2  are  not,  then 
grant  bus  to  device 
using  request  line  1 

BRO-3  and  BGOIN  to 
BG3IN  are  active  low 


The  PAL22V10  can  be  programmed  with  normal  or  inverted 
outputs  and  the  logic  software  can  support  active  high  or  low 
input  polarities. 

BG1IN(L)  is  asserted  until  the  requesting  module 
BBSY(L).  This  is  defined  in  logic  equation  form  as: 


IF  (/BBSY*BG1  IN)  THEN 
BG1IN  :=  1; 


continue  asserting 
BG1  IN  until  BBSY 
becomes  active 


Priority  Arbitration  Options 

It  may  be  necessary  for  the  arbiter  to  force  the  current  bus 
master  to  relinquish  bus  control  for  certain  conditions,  such  as  if 
a  higher  priority  operation  must  be  attended  to.  This  is  done  by 
asserting  the  bus  clear  signal  BCLR(L)  based  on  the  priority  of 
the  current  bus  master  and  the  bus  clear  conditions. 

The  PLD  arbiter  keeps  track  of  the  current  bus  master's  priority 
by  recording  which  bus  request  line  was  used  to  gain  control  of 
the  bus.  For  example,  two  output  registers  called  BUS-MAS- 
TER will  be  set  to  the  binary  value  "2"  if  the  current  bus  master 
used  BR2(L)  to  gain  control  of  the  bus. 

In  this  design,  BCLR(L)  wil  be  activated  under  two  conditions: 

1 .  If  the  BUS_MASTER  is  2, 1  or  0  and  the  active  bus  request 
line  is  3  or  2 

2.  If  BUS_MASTER  is  0,  and  any  bus  request  line  is  active. 

When  BUS_MASTER  is  3,  then  the  arbiter  will  not  honor  any 
bus  requests  until  the  bus  busy  line  BBSY(L)  is  low.  The  above 
conditions  are  expressed  logically  as: 


IF  (BBSY*(BR0+BR1  +BR2  +  BR3))  THEN 
BEGIN  IF  (BR3*(/MASTER[1:0]  =  3)  )  )  THEN 

BCLR  :=  1;  "assert  BCLR  if  MASTER  <  >  3" 
IF  (/BR3'BR2*(/MASTER[1 :0]  =  3)  )  )  THEN 

BCLR  :=  1; 
IF  (MASTER[1 :0]  =  0)  THEN 

BCLR  :=  1; 
END; 

Once  the  BCLR(L)  line  is  active,  then  it  should  be  held  until  the 
current  bus  master  releases  BBSY(L).  This  is  logically  ex- 
pressed as  BCLR  =  BCLR'BBSY,  or  in  a  high-level  syntax, 

IF  (BCLR'BBSY)  THEN 
BCLR  :=  1; 

The  BCLR(L)  signal  is  defined  such  that  uninterruptible  devices 
use  BR3(L).  Devices  that  can  be  temporarily  suspended  to  ac- 
commodate interrupts  and  higher-priority  operations  are  as- 
signed to  bus  request  line  0.  The  BCLR  conditions  will  vary  with 
your  application,  but  any  modifications  will  only  require  redefini- 
tions of  the  high-level  logic  expressions. 

The  PAL22V10  is  ideal  as  the  bus  arbiter  because  of  the  input/ 
output  signal  requirements  and  the  large  number  of  product 
terms  needed  to  logically  define  the  bus  grant  and  bus  clear 
signals. 

Processing  Interrupts 

When  real-time  or  urgent  response  is  needed  by  a  processor,  it 
generates  an  interrupt  request  signal  and  waits  for  the  interrupt 
acknowledge  signal  IACK(L)  and  IACKIN(L)  daisy-chain  signal. 
A  3-bit  value  is  then  read  from  the  VMEbus  when  the  data 
strobes  are  active;  this  value  indicates  which  interrupt  request 
line  was  acknowledged.  These  three-bits  are  decoded  to  deter- 
mine if  it  matches  the  processor's  request  level. 

If  the  interrupt  is  acknowledged,  then  the  interrupting  processor 
puts  its  status  or  ID  byte  on  the  data  bus  for  the  interrupt  hand- 
ler. This  data  is  used  by  the  interrupt  handler  as  an  interrupt 
vector.  This  processor  can  then  wait  in  a  loop  until  the  IACK(L) 
signal  from  the  interrupt  handler  is  driven  HIGH  to  signify  inter- 
rupt service  completion. 

Interrupt  Handling:  The  Interrupt 
Handler  Preprocessor  (IHP) 

To  handle  external  I/O  or  special  system  events  (e.g.,  timeout, 
overflow),  interrupts  are  supported  on  the  VMEbus  through  an 
interrupt  handler  (IH)  module. 

The  logic  complexity  of  the  IH  is  reduced  by  offloading  some  of  the 
initial  interrupt  recognition  tasks  to  a  PLD.  A  PAL22V10  can  be 
programmed  as  an  IHP  to  preprocess  interrupt  requests,  obtain 
the  bus  and  handle  data  transfers  (such  as  interrupt  acknowledge 
signals).  Control  is  passed  to  the  interrupt  handler  only  when  the 
IHP  latches  the  interrupt  vector. 

Interrupt  request  processing,  bus  acquisition,  and  the  interrupt 
vector  transfer  phase  are  all  specified  using  logic  equations 
which  are  written  using  a  high-level  Boolean  notation.  The 
PAL22V10  monitors  seven  interrupt  request  lines  and  five  other 
control  inputs,  and  generates  ten  registered  outputs  to  send 
control  signals  to  the  interrupt  handler  and  the  VMEbus  drivers. 
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Interrupt  Logic 

Al  interrupt  request  lines  are  monitored  according  to  the  follow- 
ing logic  equation: 

F  (IR1  +  IR2  +  IR3  +  IR4  +  IR5  +  IR6  +  IR7)  THEN 
BR3  :=  1  ;  this  interrupt  handler  uses 

the  BR3  request  line 

If  any  of  the  interrupt  lines  are  active,  then  BR3(L)  is  asserted. 
This  initiates  the  bus  acquisition  phase. 

The  next  step  is  to  wait  for  the  bus  grant  in  signal.  Only  when 
BGIN3(L)  is  active  will  BBSY(L)  be  active.  This  is  expressed 
logically  as: 


F  (BR3VBG3IN)  THEN  — 

BR3  :=  1 ;  —   

IF  (BR3*BG3IN  +  BBSY7SERVICE_D0NE) 

THEN  - 

BBSY  :=  1; 


[A] 
[B] 


Equation  [A]  continually  asserts  BR3(L)  as  long  as  BG3IN(L)  is 
not  active,  while  [B]  asserts  BBSY(L)  only  when  request  line 
BR3(L)  and  BG3IN(L)  are  active,  or  if  service  has  not  been 
completed  once  the  IHP  asserts  BBSY(L). 


i  the  IHP  is  the  bus  master,  it  puts  the  3-bit  interrupt-line 
owledge  value  on  the  bus  and  applies  the  data  strobes. 
When  the  DTACK(L)  signal  is  received  from  the  interrupter,  the 
IHP  then  strobes  the  status  byte  from  the  bus  into  a  register  on 
the  interrupt  handler  card.  The  IHP  informs  the  interrupt  handler 
that  a  status/ ID  byte  is  ready  and  to  commence  interrupt 
servicing. 

The  IHP  takes  only  a  single  clock  cycle  to  resolve  interrupt 
priorities  because  all  interrupt/input  signal  lines  are  processed 
in  parallel  by  the  PLD.  This  is  expressed  in  a  logic  description 
language  as  follows: 


IF  (BBSY)  THEN 

BEGIN 
(1)  IF(IR7)  THEN 

INTR[2:0]  :=  7; 

IF  (/IR7*IR6)  THEN 
INTR[2:0]  :=  6; 

IF  (/IR7*(IR6*IR5)  THEN 
INTR[2:0]  :=  5; 


IR7  was  active 

3-bit  value  acknowledging 

interrupt  line  7 

IR6  active,  IR7  inactive 

acknowledge  interrupt  line  6 

IR5  highest  priority 

interrupt  line  active 


(2)  IF  (/IR7*/IR6'/IR5*/IR4*/IR3*/IR2*IR1)  THEN 

INTR[2:0]  :=  1  ;  acknowledge  interrupt  line  1 

END: 

As  noted  above,  if  both  the  IR7(L)  and  BBSY(L)  signal  are  ac- 
tive in  (1),  then  regardless  of  the  value  of  the  other  interrupt 
lines,  the  three-bit  value  generated  will  be  "111"  or  7.  In  (2),  if 
IR1  (L)  and  BBSY(L)  are  active  and  the  other  six  control  signals 
are  inactive,  then  the  three-bit  output  value  will  be  "001"  or  1 .  In 
(2),  IR1  (L)  is  the  highest  priority  line  active. 

The  remaining  preprocessing  involves  completing  the  interrupt 
vector  byte  transfer.  This  is  logically  expressed  as: 

IF  (BBSY)  THEN 
BEGIN 

IACK  :=  1 ;  begin  interrupt  acknowledge 

(X)    IF  (DTACK)  THEN  daisy  chain;  if  device  sends 

LATCH_STATUS  :=  1;   data  transfer  acknowledge 
END;  (DTACK)  signal,  then  latch 

IF  (IACK)  THEN  the  status 

AS  :=  1 ;       assert  the  address  strobe  signal  to  inform 
the  interrupter  that  the  interrupt 
acknowledge  level  is  ready 

Once  the  8-bit  status  of  ID  byte  is  transferred  successfully  via 
the  DTACK(L)  signal  (X),  the  IHP  PAL  device  informs  the  inter- 
rupt handler  module  to  begin  the  interrupt  service  routine: 

IF  (BBSY*LATCH_STATUS  +BBSY*START_SERVICE* 
/SERVICE.DONE)  THEN 
START_SERVICE  :=  1; 

In  this  design,  the  START_SERVICE  signal  is  constantly  as- 
serted until  the  interrupt  handler  generates  a  SERVICE_DONE 
signal,  at  which  point  START_SERVICE  is  brought  LOW.  The 
above  equations  will  generate  the  timing  diagram  in  Figure  3. 
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IR6(L) 
GETBUS(H) 
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Figure  3.  Timing  Diagram 


Controller  Design  Simplified 
with  Development  Tools 

The  task  of  programming  FPCs  and  PLDs  as  VMEbus  control- 
lers is  simplified  with  the  development  tools.  The  designer  need 


only  analyze  the  bus  protocols,  convert  these  into  state  ma- 
chines and  then  write  assembly  language  programs  or  high- 
level  logic  equations  to  describe  these  state  machines.  The 
assembler  and  logic  software  will  then  process  these  programs 
and  equations  to  fit  into  the  FPC  or  PLD. 


MMen,or,esa 
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DEVICE  ARBITER  (AMPAL22V10)    "  VMEbus  Priority  Arbiter  Option 
For  this  application,  if  the  bus  is  busy,  then  bus  requests  will  be 
deferred  until  the  BBSY(L)  (bus  busy  line  active  LOU)  goes  HIGH, 
unless  the  bus  request  is  of  a  higher  priority.  In  this  case,  the 
bus  clear  signal  BCLR(L)  is  asserted  and  the  current  bus  master 
relinquishes  control.  If  the  bus  is  free  and  2  or  more  masters 
request  service,  then  the  bus  is  granted  to  the  master  with  the 
higher  priority  request  line.  " 


PIN  CLK 
/BBSY 
/BRO 
/BR1 
/BR2 
/BR3 


=  1 
=  2 
=  3 
=  4 
=  5 
■  6 


/BCLR 

/BGOIN 

/BG1IN 

/BG2IN 

/BG3IN 

MASTER[1:0] 


23 
22 
21 
20 
19 

18,17; 


BEGIN 

"if  a  bus  request  occured  and  bus  =  free,  then  activate  the  bus  grant  in 
line  corresponding  to  the  highest  priority  bus  request  line  " 

IF  (/BBSY*(BR0+BR1+BR2+BR3))  THEN 
BEGIN 

IF  (BR3)  THEN 

BG3IN  :=  1  J 
IF  (/BR3*BR2)  THEN 

BG21N  :=  1  ; 
IF  (/BR3*/BR2*BR1)  THEN 

BG1IN  :=  1  ; 
IF  (/BR3*/BR2*/BR1*BR0)  THEN 

BGOIN  :=  1  ; 
END  ; 

"  while  the  bus  grant  in  line  is  active  and  the  bus  is  still  busy, 
then  latch  the  bus  grant  signals  until  BBSY( I )  becomes  active" 


F  <8G3INVBBSY>  THEN 


"  when  requester  responds  with  BBSY  active,  then  " 

"  MASTER  is  set  to  the  line  currently  controlling" 

»  the  bus;  this  is  used  internally  for  future  11 

"  bus  priority  resolution  " 


BG3IN  :=  1  ; 
F  (BG2INVBBSY)  THEN 

BG2IN  :=  1  ; 
F  (BG1 INVBBSY)  THEN 

BG1IN  :  =  1  ; 
F  (BGO  INVBBSY)  THEN 

BGOIN  :=  1  ; 

F  (BG31N*BBSY>  THEN 

MASTER[1:0]  :  =  3  ; 
F  (BG2IN*BBSY)  THEN 

MASTERt1:0]  :  =  2  ; 
F  (BG1 IN*BBSY)  THEN 

MASTER  [1:01  :=  1  ; 
F  (BGOIN*BBSY)  THEN 

MASTER [1:0]  :=  0  ; 


F  (BBSY)  THEN      "  when  bus  busy,  then  remember  current  bus  master  line  " 
MASTER [1:01   :=  MASTER [1:0]  ; 

if  a  higher  priority  bus  request  occurs  and  the  bus  is  busy,  then  begin 
bus  resolution  (by  using  bus  clear  BCLR)  under  the  following  conditions  : 

•  if  BR3  and  present  bus  master  is  not  using  bus  line  3;  i.e.,  once  a  device 
uses  BR3,  then  no  one  can  force  it  to  relinquish  the  bus  or 

-  if  BR2  and  present  bus  master  is  not  3;  i.e.,  BR2  can  assert  bus  clear 
except  when  the  presetn  bus  master  obtained  the  bus  using  bus  line  3  or 

•  if  BR1  and  the  bus  master  obtained  the  bus  using  bus  request  line  0,  then 
assert  BCLR 

==>  if  BRO  is  activated  when  the  bus  is  busy,  then  do  not  assert  BCLR;  BRO 
devices  cannot  force  anyone  off  the  bus.  This  is  one  possible  priority 
implementation. 

ii 

IF  (BBSY*(BR0+BR1+BR2+BR3))  THEN    "if  a  bus  request  occured  and  bus  =  busy" 
BEGIN 

IF  (BR3*(/MASTER[1]*/MASTER[0]))  THEN  "if  BR3  and  present  master  is  not  " 
BEGIN 

BCLR  :=  1  ;  "  using  line  3,  then  assert  BCLR  11 

IF  (BCLR'BBSY)  THEN 
BCLR  :=  1; 

END; 


Figure  4.  PLPL  Specifications  for  the  Example  of  Figure  1 


bulk  :=  i  ;  —  ■   -  - M       - ■  ■  - 

IF  (BCLR'BBSY)  THEN 

bclr  :•  1; 

END; 

IF  (/BR3*/BR2*BR1*/MASTER[1]*/MASTERr,0])  THEN 
BEGIN 

BCLR  :=  1  ; 
IF  (BCLR'BBSY)  THEN 
BCLR  :=  1; 

END; 
END  ; 
END. 

"Test  vectors  used  for  simulation  and  testing" 
TESTVECTORS 

IN    CL<  ,  /BBSY  ,  /BRO  ,  /BR1   ,  /BR2  ,  /BR3  ; 

OUT  /BCLR  ,  /BGOIN  ,  /BG1IN  ,  /BG2IN  ,  /BG3IN  ,  MASTER [1:0]  j 

BEGIN 


I  1  1  1  1 

/  /  /  /  /  " 

c  b  b  b  b  b 

b  b  b  b  b  m  m" 

I  s  r  r  r  r 

c  g  g  g  g  s  s" 

k  y  0  1  2  3 

r  0  1  2  3  1  0" 

C  1  0  1  1  1 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  1 

X  X  X  X  X  X  X  ; 

C  1  1  1  1  1 

X  X  X  X  X  X  X  ; 

C  1  1  0  0  1 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  1 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  1  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  1  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  1  1  1  1  0 

X  X  X  X  X  X  X  ; 

C  0  1  1  1  1 

X  X  X  X  X  X  X  , 

C  0  0  1  1  1 

X  X  X  X  X  X  X  , 

C  0  0  1  1  1 

X  X  X  X  X  X  X  , 

C  0  1  0  1  1 

X  X  X  X  X  X  X  , 

C  0  1  1  0  1 

X  X  X  X  X  X  X 

C  0  1  1  1  0 

X  X  X  X  X  X  X 

END. 


Figure  4.  PLPL  Specifications  for  the  Example  of  Figure  1  (Cont'd.) 


DEVICE  1NTR_HMD_CTRLR  (AMPAL22V10)  "  VMEbus  interrupt  handler  preprocessor 

This  PAL  serves  as  an  interface  between  the  interrupt  servicing  logic  in  the 
interrupt  handler  and  the  bus  arbiter:  when  an  interrupt  is  detected,  the 
PAL  requests  the  bus  from  the  arbiter.  When  the  bus  is  granted,  the  bus 
busy  BBSY(L)  is  asserted.  The  3  bit  code  corresponding  to  the  highest 
priority  interrupt  request  is  put  on  the  bus,  and  both  IACK(L)  and  ASCL) 
are  also  asserted.  The  PAL  waits  for  DTACK(L)  LOW  and  then  latches  the 
status/ID  byte.  The  STARTSERVICE  signal  informs  the  interrupt  handler 
to  begin  the  interrupt  service  sequence  with  the  status/ID  byte  latched  in. 
Note  :  this  interrupt  handler  interface  PAL  handles  all  interrupts  using 
bus  request  line  BR3;  this  is  application-dependent  " 


"interrupt  request  lines  1  to  highest  priority  7" 


PIN 

CLK 

=  1 

/IR1 

=  2 

/1R2 

=  3 

/IR3 

=  4 

/IR4 

*  5 

/IR5 

=  6 

/IR6 

=  7 

/IR7 

■  8 

/DTACK 

=  9 

/BG3IN  =  10 
SERVICE  DONE  ■  13 


LATCHSTATUS 
/BBSY    =  22 
/IACK    =  21 
/AS       =  20 
/BR3     ■  19 
GETBUS 
INTR[2:0] 
START  SERVICE 


23 


"data  transfer  ack;  used  by  interrupter  to  indicate 
status/ID  data  byte  ready  on  bus  11 
"bus  grant  in  signal  from  arbiter" 
"signal  from  interrupt  handler  indicating 
service  complete" 
"latches  data  from  bus  for  interrupt  service  routine" 
"bus  busy" 

"interrupt  acknowledge" 
"address  strobe" 

"bus  request  signal  to  bus  arbiter" 
18       "used  internally  by  PAL  " 
15:17  "3-bit  level  code  put  on  bus  " 
14  ;    "sent  from  PAL  to  intr  handler  " 


"if  any  interrupt  request  line  is  active,  then  try  getting  the  bus.  All  the 
interrupts  handled  by  this  PAL  will  use  the  bus  request  line  BR3.  A 
request  can  occur  only  when  no  other  interrupt  is  being  serviced  (i.e., 
bus  should  not  be  busy  and  bus  grant  in  should  not  be  active)  " 


IF  (/S8SY*/BG3!N*(IR1  ♦  IR2  ♦  IR3  ♦  IR4  ♦  IR5  »  IR6  ♦  IR7))  THEN 
BEGIN 

GET_BUS  :=  1  ; 

BR3  :  =  1  ;  "subject  to  application" 

END  ; 

"if  interrupt  handler  wants  the  bus,  the  bus  grant  line  is  still  inactive, 
and  the  interrupt  request  is  still  active,  then  continue  requesting  the  bus  » 
IF  (GET_BUS*BR3*/BG3IN«(IR1  ♦  1R2  *  IR3  ♦  IR4  *  IR5  ♦  IR6  ♦  IR7>>  THEN 

BEGIN 

BR3  :=  1  ; 

GET_BUS  :=  GET_BUS  ; 
END; 

"if  bus  grant  in  line  is  active,  then  assert  the  bus  busy  signal" 
IF  (GETBUS-BG3IN  ♦  BBSY'/SERVICEDONE )  THEN 

BBSY  :=  1  ; 

"if  interrupt  handler  received  bus,  then  acknowledge  the  interrupt" 
IF  (BBSY)  THEN 
BEGIN 

IF  </START_SERVICE)  THEN 
IACK  :=  1  ; 

IF  (DTACK)  THEN  "when  the  interrupter  responds  with  DTACK(L),  then  11 

LATCHSTATUS  :=  1  j     "latch  the  status/ID  byte  into  the  interrupt  handler" 


"IR7  line  is  highest  priority" 


IF-(IR7)  THEN 

1NTRC2:0]  :=  7  j 
IF  (/IR7*IR6)  THEN 

INTR[2:0]  :=  6  J 
IF  (/IR7»/IR6*1R5>  THEN 

INTR[2:0]  :=  5  j 
IF  (/IR7*/IR6*/IR5*IR4)  THEN 

INTR[2:0]  :=  4  ; 
IF  (/IR7«/IR6*/IR5*/IR4*IR3)  THEN 

INTR [2:0]  :=  3  j 
IF  (/IR7*/IR6VIR5*/IR4«/IR3*IR2)  THEN 

INTR [2:01  :=  2  j 
IF  (/IR7*/IR6*/IR5*/IR4*/IR3*/IR2*IR1)  THEN 

INTR[2:0]  :=  1  j  "IR1  line  is  lowest  priority" 
END; 


Figure  5.  PLPL  Specifications  for  the  Example  of  Figure  1 


VME  Bus  Control  Simplified  with  PLDs 


IF  UACKVAS  ♦  IACK*/OTACK>  THEN 
INTR[2:0]   :=  INTR[2:0]  ; 


"hold  the  interrupt  acknowledge  level  until" 
"strobed  by  address  strobe  signal  or  until 
the  interrupter  acknowledges  the  3-bit 
level  with  DTACK  " 


"if  the  IACK  line  is  active  and  interrupt  servicing  has  not  yet  begun 
activate  the  address  strobe  line  AS  " 
IF  <IACK*/START_SERVICE)  THEN 
AS  :=  1  ; 

"send  the  start  service  signal  until  service  is  done" 
IF  < BBSY*LATCH_STATUS  +  BBSY*START_SERVICE*/SERVI CEDONE)  THEN 
START  SERVICE  :=  1  ; 


END. 


TESTVECTORS 
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Figure  5.  PLPL  Specifications  for  the  Example  of  Figure  1  (Cont'd.) 
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Introduction  to  Communication 
Systems 

Modern  communications  has  developed  from  two  major  sources: 
telephones  and  computers.  Within  telephony,  what  was  once 
straightforward  engineering  in  the  analog  telephone  system  has 
grown  over  1 00  years  into  the  sophisticated  discipline  of  telecom. 
Within  computers,  communication  needs  have  expanded  rapidly 
over  the  last  20  years  from  simple  asynchronous  terminals 
clustered  around  a  host  to  also  include  networking  of  PCs,  work- 
stations and  minicomputers  across  rooms  and  continents.  Both 
areas  can  be  expected  to  merge  into  complex  information  net- 
works which  handle  all  aspects  of  communications  over  the  next 
1 0-20  years.  A  graphic  portrayal  of  a  hypothetical  future  system 
is  shown  in  Figure  1. 

Historically,  the  system  architecture,  engineering  and  compo- 
nents of  these  two  converging  areas  have  been  dissimilar.  As 


telephony  systems  become  digital-based  and  computer  systems 
use  more  of  the  telephony  system  for  interaction,  this  dissimilarity 
is  eroding.  In  particular,  digital  design  techniques  developed  in 
computer  engineering  are  used  increasingly  in  telecom.  The 
original  application  was  in  the  trunk  links  between  exchanges 
where  digital  time  division  multiplexing  allowed  a  much  higher 
density  of  circuits  over  a  given  pair  of  wires.  This  spread  into  the 
switch,  line  card  and  other  areas  of  the  exchange  until  only  the 
subscriber  line  to  the  exchange  remains  largely  analog.  Develop- 
ment of  the  Integrated  Services  Digital  Network  (ISDN)  interna- 
tional standard  will  make  this  digital  as  well,  enabling  data  and 
video  as  well  as  voice  to  be  handled  in  a  uniform  and  flexible 
manner.  Future  telecom  engineering  will  therefore  have  much  in 
common  with  the  digital  design  techniques  used  in  computers 
and  the  distinction  between  them  will  become  minimal. 
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Figure  1.    Communications  in  the  Nineties 
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Along  with  the  strong  trend  in  equipment  design  towards  digital 
systems,  there  is  a  parallel  move  towards  a  more  global  market 
for  equipment.  The  development  costs  for  major  pieces  of  tele- 
com systems,  such  as  central  office  switches,  require  a  large 
market  in  order  to  amortize  the  costs.  This  implies  a  requirement 
for  standards  beyond  national  boundaries,  permitting  sales  into 
foreign  markets  with  minimum  engineering  changes.  This  in- 
creased market  size  on  top  of  increased  complexity  of  services  is 
realizable  only  through  widespread  standardization.  It  also 
makes  such  a  universal  system  more  complex  and  therefore 
more  elusive. 

The  computer  area  has  evolved  around  a  number  of  proprietary 
computer  families  whose  architectures  and  operating  systems 
we  e  incompatible  to  a  large  degree.  The  need  to  interconnect 
mu  tiple  computer  systems  with  high-speed  links  led  to  a  polari- 
zation around  proprietary  standards  as  local  area  network  (LAN) 
interconnect  standards  were  evolved  by  each  major  manufac- 
turer. Recent  moves  towards  decentralized  computing,  based  on 
personal  workstations  of  varying  power,  has  crystallized  users 
around  a  few  LAN  standards,  but  the  dream  of  universal  LAN 
interconnectivity  among  equipment  is  still  far  off. 

Communications  therefore  requires  widespread  standards  for 
any  meaningful  inter-vendor  or  international  operation.  Teleph- 
ony is  the  more  advanced  in  this  area.  Standards  efforts  by  the 
Bell  system  in  the  US  and  the  local  PTT  (the  governmental  Post, 
Telephone  and  Telegraphy  authority)  in  other  industrial  countries 
has  ensured  that  any  equipment  which  connects  to  the  public 
telephone  network  operates  fully  with  any  other  piece  of  equip- 
ment. 

The  main  international  body  which  regulates  that  is  the  CCITT 
(Consultative  Committee  for  Telephony  and  Telegraphy)  through 
a  series  of  "recommendations",  which  are  effectively  standards. 
In  the  USA,  before  deregulation,  AT&T  published  technical  stan- 
dards or  "Bell  Pubs"  which  specified  the  operating  characteristics 
for  equipment  in  a  manner  similar  to  and  sometimes  based  on  the 
CCITT  recommendations.  Since  deregulation,  AT&T  continues 
to  issue  such  documents  as  it  pertains  to  their  equipment  and  the 
RBOCs  (Regional  Bell  Operating  Companies)  now  issue  their 
own  either  directly  or  through  their  joint  research  organization, 
BellCoRe  (Bell  Communications  Research). 


Despite  strenuous  efforts  among  governing  bodies,  problems  of 
compatibility  have  existed.  Deregulation,  the  increase  in  the 
number  of  suppliers,  international  traffic  and  equipment  sales, 
and  most  of  all,  the  massive  increase  in  equipment  complexity 
have  led  to  a  growing  demand  for  flexibility  within  telecom 
systems  to  handle  compatibility  issues.  Fully  compatible  systems 
on  a  global  scale  remain  an  unrealistic  goal  before  the  billenium. 
Most  systems  are  variants  on  a  few  widespread  standards.  The 
small  deviations  among  manufacturers  or  countries  can  fre- 
quently be  handled  by  a  minimal  amount  of  flexibility  in  the 
system  design,  such  as  that  provided  by  PLDs. 

The  situation  on  the  computing  side  remains  even  more  chaotic. 
Other  than  IBM,  no  vendor  has  managed  to  establish  a  wide- 
spread standard  based  on  their  own  proprietary  network.  All 
others  have  turned  to  variants  of  the  newer  LAN  standards,  such 
as  Ethernet,  but  even  here,  compatibility  at  the  hardware  levels 
does  not  necessarily  mean  compatibility  at  the  upper  or  software 
levels. 


Features  of  Programmable  Logic 
in  Communications 

Programmable  logic  has  already  demonstrated  its  usefulness  in 
computer  systems  in  a  variety  of  roles,  including  address  decod- 
ing, state  machines  and  "garbage  collection".  All  these  applica- 
tions exist  in  communications,  but  others  particular  to  this  market 
also  exist. 

The  problems  of  interconnecting  dissimilar  systems  will  remain  in 
communications  for  a  number  of  years.  Equipment  manufactur- 
ers are  therefore  faced  with  a  dilemma  of  how  to  combine  the 
economies  of  scale  associated  with  volume  production  with  the 
need  to  interface  with  multiple  vendor  equipment,  some  of  which 
deviates  from  the  standards.  PLDs  offer  a  unique  method  of 
customizing  logic  to  conform  to  deviations  while  maintaining 
identical  hardware  layout  and  parts. 

Power  and  space  requirements  are  considerations  in  most 
computer  systems,  but  normally  are  relegated  to  being  secon- 
dary issues  when  compared  to  performance.  The  opposite  is  true 
oftelephony,  where  these  factors  assume  paramount  importance 
over  speed.  In  order  to  satisfy  requirements  from  both  areas, 
components  must  therefore  be  available  in  both  high-speed  and 
low-power  versions. 

In  telephony,  certain  high-volume  functions  such  as  line  card 
interfaces  justify  VLSI  devices.  But  a  large  number  of  other  lower- 
volume  functions  differ  among  equipment  vendors  and  applica- 
tions. Such  design  problems  cannot  be  easily  solved  with  dedi- 
cated VLSI  components.  With  the  introduction  of  CMOS  PLDs,  an 
opportunity  now  exists  to  combine  higher  integration  with  low 
power  and  the  ability  to  modify  designs  without  hardware  compo- 
nent or  board  alteration. 

In  computers,  the  proliferation  of  decentralized  computing  power, 
particularly  in  the  form  of  PCs  and  workstations,  has  brought  the 
power  issue  into  focus  as  engineers  attempt  to  minimize  power 
consumption  and  the  associated  heat  generation.  While  minis 
and  mainframes  continue  to  strive  for  increased  speed,  the  bulk 
of  applications  now  have  a  need  for  both  lower-power  compo- 
nents and  a  flexibility  which  permits  each  unit  to  communicate 
with  as  many  of  the  office  automation  interconnection  schemes 
as  possible.  Again,  the  advent  of  CMOS  PLDs  offers  a  solution  to 
the  system  design  problems  associated  with  this. 

Basic  Telephony 

Basic  telephony  is  loosely  defined  as  the  classic  telephone 
system.  It  includes  such  major  components  as  the  voice  tele- 
phone with  twisted  pair  interface  to  the  exchange,  as  well  as  the 
Private  Branch  Exchange  (PBX),  the  Central  Office  (CO)  and  the 
trunk  lines  which  interconnect  them.  The  voice  telephone  has  little 
scope  for  PLDs  due  to  high  volume  and  extremely  low  cost 
requirements.  From  an  architecture  point  of  view,  both  the  PBX 
and  the  CO  can  be  considered  similar  and  divided  into  several 
main  components: 

•  The  line  cards  which  interface  to  subscriber  lines 

•  The  central  switching  facility  which  routes  calls  through  the 
exchange 

•  The  trunk  interface  which  connects  to  other  exchanges 
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Line  cards  are  the  most  cost-  and  power-sensitive  components  of 
the  basic  telephony  system.  They  are  usually  fabricated  in 
volume  and  incorporate  VLSI  devices  available  from  several 
semiconductor  houses.  The  scope  for  PLDs  is  limited,  due  to 
these  constraints  and  the  limited  customization  needs  of  the 
simpler  functions  of  the  voice  telephone. 

The  central  switch  in  modern  PBX  and  CO  exchanges  is  a  digital 
memory-based  system  controlled  by  a  minicomputer.  As  such,  it 
contains  many  familiar  opportunities  for  PLD  applications  from 
the  computer  industry,  but  a  limited  number  that  are  distinctly 
communication-oriented. 

The  trunk  interface  is  an  area  that  received  little  attention  until 
recently  as  the  interface  was  buried  in  the  heart  of  the  telephony 
system.  Its  moderate  unit  volume  attracted  little  interest  in  dedi- 
cated silicon  development.  Deregulation  of  the  Bell  system  inthe 
US  has  resulted  in  a  proliferation  of  equipment  complying  with  the 
widespread  T1  standard  developed  by  AT&T  for  Multiplexers  and 
other  Channel  Service  Units  (CSUs)  which  use  trunk  links. 

From  its  beginnings  as  a  means  of  allowing  several  voice  circuits 
to  share  the  same  cable,  T1  has  developed  in  application  to  the 
stage  where  it  is  being  used  in  many  cases  where  up  to  24  x  64 
Kb/sec  channels  are  required  in  a  point-to-point  configuration 
both  within  and  between  isolated  customer  sites.  This  application 
allows  data  and  video,  as  well  as  the  original  voice  connections, 
to  be  multiplexed  over  existing  standardized  links  provided  by 
both  AT&T  and  the  RBOCs.  It  is  one  of  the  fastest  growing  areas 


of  modern  telecom  and  is  spawning  related  equipment  such  as  T1 
switching  nodes  and  T1  testers. 

The  basic  interface  to  the  T1  connection  can  be  engineered  in  a 
number  of  ways  but  basically  consists  of  a  transmit  and  receive 
section,  as  shown  in  Figure  3. 

The  transmit  section  of  a  T1  mux  must  buffer  and  rate-adapt  the 
bit  stream  of  incoming  channels  which  are  to  be  multiplexed, 
perform  a  24-to-1  multiplex  function  and  then  encode  the  result- 
ing bits  with  appropriate  framing  onto  the  line  using  a  bipolar  line 
driver.  The  receive  section  of  a  mux  contains  a  corresponding  line 
receiver  and  decoder.  The  recovered  NRZI  data  is  then  fed  into 
a  deframer  which  detects  frame  synchronization  and  allows 
channel  demultiplexing  and  buffering. 

Several  of  these  functions  can  be  efficiently  implemented  using 
programmable  logic.  The  encoder  and  decoder  for  both  the  North 
American  T1  B8ZS  and  the  European  HDB3  coding  schemes  can 
be  implemented  in  such  devices  (pages  2-362  and  2-384,  respec- 
tively). The  deframing  problem  for  the  most  common  current  T1 
framing  scheme  (D4  Superframes)  can  also  be  solved  with  a  few 
PAL  devices  (page  2-404).  In  the  newer  T1  Extended  Super- 
Frame  (ESF)  standard,  an  error  detection  scheme  is  superim- 
posed on  the  framing  bit  pattern  using  a  CRC-6  scheme.  The 
CRC  generatorfor  this  fits  nicely  within  a  single  PALdevice  (page 
2-431). 

Two  applications  are  also  shown  for  the  LCA  (Logic  Cell  Array) 
device.  Page  2-435  shows  a  32:1  mux  for  a  T1  link,  and  page 
2-444  shows  a  format  converter  for  a  PBX. 
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Data  Communications 

Unlike  telephony,  data  communications  is  a  relatively  new  disci- 
pline and  equipment  designed  to  interconnect  data  processing 
equipment  evolved  in  parallel  with  mainframe  computers.  Such 
equipmenttendsto  be  high-speed  and  less  concerned  with  power 
requirements  than  telephony  equipment.  Its  main  function  is  to 
transmit  binary  data  in  a  reliable  manner  across  a  room  or  across 
a  continent  between  computers  compatible  enough  to  exchange 
data.  A  wide  variety  of  methods  exist  to  do  this  at  speeds  ranging 
from  300  bits  per  second  to  100,000,000  bits  per  second.  As 
distance  increases,  the  speed  options  available  become  limited. 
Generally,  the  options  can  be  considered  in  a  number  of  catego- 
ries, in  order  of  increasing  speed,  cost  and  sophistication: 

•  Asynchronous  terminals  and  connections  using  existing 
analog  phone  lines 

•  Synchronous  terminals  and  dedicated  data  links 

•  High-speed  point-to-point  links,  including  microwave  and 
satellite 

•  Multipoint,  medium-speed  local  area  networks 

•  Multipoint  backbone,  high-speed  local  area  networks 

The  slowest  and  most  widely  available  category  includes  most 
"dumb"  terminals,  which  are  normally  linked  to  a  host  system 
either  directly  via  an  RS-232C/V.24  serial  link  or  over  a  standard 
telephone  line  via  a  voiceband  modem.  Maximum  data  rates 
available  are  9600  bits/sec  and  the  typical  modem  rates  are 
currently  300  and  1 200  baud.  Devices  used  in  building  eithertype 
of  link  are  typically  LSI  low-cost  NMOS  devices  with  most  new 
devices  being  VLSI  CMOS.  Where  PALdevices  offer  the  greatest 
advantage  is  in  higher-performance  applications  for  which  VLSI 
solutions  do  not  yet  exist. 

There  are,  however,  a  number  of  additional  control  functions 
required  around  standard  UART/USARTs  which  can  be  optimally 
implemented  in  a  single  PAL  device,  such  as  providing  address 
latching  and  an  asynchronous  divider  around  a  Motorola  6850 
Asynchronous  Communications  Interface  Adapter  (page  2-453). 

In  the  case  of  modems,  however,  several  highly  sophisticated 
techniques  beyond  that  found  in  the  standard  modem  "box"  can 
be  used  to  improve  throughput  on  the  link.  One  technique  used 
to  increase  the  bit  rate  across  the  link  both  in  the  slow,  voiceband 
modems  and  in  the  higher-speed  satellite  modems  is  Quadrature 
Amplitude  Modulation  (QAM).  Another  technique  uses  a  convo- 
lutional  encoding  scheme  to  increase  the  reliability  of  data  trans- 
mitted across  a  noisy  link.  This  cuts  down  the  requirement  for 
retransmission  of  lost  data  and  effectively  increases  the  through- 
put of  the  link. 

Both  schemes  can  be  implemented  efficiently  in  PAL  devices  as 
described  in  the  application  notes  on  pages  2-456  and  2-463. 


Synchronous  terminal  links  usually  comply  with  one  of  a  number 
of  standards  for  which  low-cost  LSI  devices  exist.  These  provide 
for  data  rates  above  that  of  async  links  and  generally  can  perform 
up  to  1  Mbit/sec  and  beyond,  but  only  over  a  limited  distance,  such 
as  within  a  building.  Data  rates  above  2400  baud  over  long 
distances  are  currently  expensive  and  limited  in  popularity. 

One  scheme  which  is  expected  to  overcome  this  and  gain  wide 
application  over  the  coming  years  is  the  provision  of  64Kbit/sec 
data  across  the  new  digital  telephone  system  which  will  be 
available  on  the  international  Integrated  Services  Digital  Network 
(ISDN)  "S"  and  "U"  interfaces  to  the  phone  line.  This  is  a  standard 
to  which  virtually  every  telephony  authority  and  a  large  number  of 
major  computer  manufacturers  have  given  their  support.  Among 
the  difficulties  which  will  be  encountered  is  the  problem  of 
adapting  existing  terminal  devices  to  communicate  with  the  wave 
of  VLSI  CMOS  devices  which  are  being  released  for  use  on  the 
ISDN  "S"  and  "U"  interface  link. 

The  "S"  interface  is  being  addressed  by  a  number  of  VLSI 
solutions.  The  recently  standardized  "U"  interface  has  no  such 
devices  and  has  an  additional  encoding/decoding  requirement 
similar  to  trunk  lines,  due  to  the  longer  distances  involved  to  the 
central  office.  A 4B3T  coding  scheme  is  used,  which  can  be  im- 
plemented using  PLDs,  as  shown  in  the  application  note  on  page 
2-475. 

Further  applications  in  data  communications  can  be  found  in 
LANs.  While  several  VLSI  chip  sets  exist  for  standards  such  as 
Ethernet,  a  number  of  proprietary  systems  are  in  use  and  their 
numbers  are  increasing  steadily.  Some  of  the  more  basic  func- 
tions required  in  the  LAN  interface,  such  as  the  Manchester 
encoding  function  of  Ethernet,  can  be  provided  in  a  single  PAL 
device  (page  2-499). 

Conclusion 

The  opportunities  for  the  use  of  PLDs  in  communications  is 
widespread  and  is  becoming  increasingly  important  forthefollow- 
ing  reasons: 

•  Power  and  space  limitations. 

•  Increasing  adherence  to  multiple  standards,  yet  the  need 
for... 

•  Standardized  hardware  to  permit  low-cost 
volume  production. 

•  Need  to  reconfigure  equipment  in  the  field 
at  the  minimum  overhead. 

Where  speed  is  an  issue,  the  broad  range  of  fast  bipolar  PAL 
devices  provides  a  range  of  solutions.  But  in  the  more  recent  trend 
towards  low-power  circuitry,  the  need  for  which  is  most  apparent 
in  equipment  such  as  in  fanless  or  portable  personal  computers, 
The  family  of  low-power  CMOS  devices  provides  the  same  func- 
tionality at  a  fraction  of  the  power  budget. 
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Bipolar  with  eight  zero  substitution  (B8ZS)  coding  provides 
ear  channel  capability,  which  is  required  in  an  Integrated 
Service  Digital  Network  (ISDN)  or  an  open  network.  A  (B8ZS) 
encoder  and  decoder  can  be  implemented  in  CMOS  PAL 
devices.  The  encoder  and  decoder  convert  between  non- 
return-to-zero  (NRZ)  data  and  B8ZS  while  providing  a  flexible 
interface  to  framing  chips. 


Principles  of  Coding 

Communication  lines  require  a  data  stream  with  imbedded 
clocking  information.  T1  digital  communication  lines  transmit 
Alternate  Mark  Inversion  (AMI)  coded  voice  and  data.  For  AMI, 
logic  "1's"  are  encoded  as  alternating  positive  and  negative 
pulses,  while  logic  "O's"  are  encoded  as  zero  voltage.  This 
coding  scheme  ensures  that  consecutive  pulses  alternate  in 
polarity.  The  pulse  edges  provide  the  clocking.  A  pair  of 
consecutive  pulses  with  the  same  polarity  is  called  a  bipolar 
violation. 

Since  a  string  of  zeros  in  AMI  coding  does  not  contain  any 
transitions,  a  clock  cannot  be  derived.  If  this  string  of  O's  is 
too  long,  the  system  will  lose  synchronization.  Therefore,  the 
AT&T  T1  standard  limits  the  maximum  number  of  consecutive 
O's  to  fifteen.  For  voice  transmission  this  restriction  is 
acceptable;  however,  for  data  it  is  not.  A  variation  of  AMI  could 
provide  clocking  information  while  not  restricting  user  data. 

B8ZS  coding  is  such  an  improvement  on  the  AMI  code;  logic 
"1's"  are  encoded  as  alternating  positive  and  negative  pulses, 
while  logic  "O's"  are  encoded  as  zero  voltage.  Additionally, 
B8ZS  substitutes  a  string  of  eight  O's  with  a  code  word  as 
shown  in  Table  1 .  The  polarity  of  the  preceding  pulse 
determines  which  B8ZS  code  word  is  substituted.  The  B8ZS 
code  words  contain  bipolar  violations  in  the  fourth  and  seventh 
bit  positions. 

S  nee  B8ZS  coding  allows  strings  of  O's  for  any  length,  it 
supplies  clear  channel  capability.  Clear  channel  capability  is 
required  for  an  open  network  and  ISDN.  With  B8ZS  coding,  the 
maximum  length  of  consecutive  zeros  is  seven  bits. 


PRECEDING  PULSE 

B8ZS  CODE  WORDS 

+ 

000+-0-+ 

000-+0+- 

Table  1.  B8ZS  Coding 


B8ZS  coding  is  used  in  the  following  applications: 

•  T1  multiplexers 

•  T1  repeaters 

•  T1  channel  banks 

•  T1  network  equipment 

•  Video  CODECS 

Functional  Description 

Implementing  B8ZS  coding  in  such  equipment  requires 
converting  between  NRZ  and  B8ZS  format.  In  this  design, 
CMOS  PAL  devices  encode  and  decode  data  while  providing 
flexibility.  Current  single  chip  solutions  do  not  interface  to  all 
framing  devices.  Using  the  PAL  devices,  B8ZS  line  coding  can 
be  added  to  any  framing  technique.  This  design  performs  two 
operations:  encoding  and  decoding.  The  encoding  operation 
converts  NRZ  data  into  positive  and  negative  pulses  with  B8ZS 
code  words.  Decoding  converts  the  positive  and  negative 
pulses  into  NRZ  data  and  substitutes  eight  O's  for  the  B8ZS 
code  words  shown  in  Table  1 . 


B8ZS  Encoder 

The  B8ZS  encoder  converts  NRZ  data  to  B8ZS  data.  Each 
string  of  eight  consecutive  O's  is  replaced  with  a  B8ZS  code 
word. 

Two  PAL  devices  can  be  configurated  to  encode  the  data 
stream.  Figure  1  shows  the  PAL  devices,  designated,  PAL_A 
and  PAL_B.  Signals  on  Figure  1  are  explained  in  Table  2. 
PA  LA,  a  PALC16R8Z,  implements  the  3-bit  counter,  which 
searches  for  strings  of  eight  O's.  An  8-bit  string  of  O's  is 
detected  by  C2-C0  and  NRZJN.  PAL_A  also  incorporates  a 
5-stage  shift  register,  which  delays  the  NRZJN  data  stream  to 
ensure  proper  alignment  with  the  counter's  output.  The 
delayed  data,  NRZ_DELAY,  is  encoded  by  PAL  B. 


PIN 

DESCRIPTION 

RST 

Active  low  master  reset 

NRZ_CK 

External  NRZ  clock 

NRZ_IN 

NRZ  input  data  stream 

C(2-0) 

Counter  output 

NRZ_DELAY 

Delay  NRZ  data  stream 

CS 

Current  state 

PPO 

Positive  pulse  output 

NPO 

Negative  pulse  output 

Table  2.  B8ZS  Encoder  Pin  Description 
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NR2_CK  - 


PAL  A 
PALC16R8Z 

C2 
C1 
CO 

R3 
R2 
R1 

no 


PALB 
PALC1 6R6Z 


Q2 
01 

QO 


xcs 

-PPO 


Figure  1.  B8ZS  Encoder 


PAL_B,  a  PALC1 6R6Z,  encodes  the  data  using  two  state 
machines;  one  generates  the  B8ZS  sequence,  and  the  other 
generates  the  positive  and  negative  pulse  outputs.  Figure  2a 
shows  the  B8ZS  Sequence  State  Machine,  which  generates 
the  B8ZS  sequence.  If  an  8-bit  string  of  O's  is  detected,  the 
B8ZS  Sequence  State  Machine  forces  the  B8ZS  code  word  to 


C2tC1+C0*NRZ_IN 


be  encoded.  Otherwise,  B8ZS  Encoder  State  Machine 
performs  normal  AMI-type  encoding.  The  B8ZS  Encoder  State 
Machine  encodes  NRZ_DELAY  into  positive  and  negative 
pulses  based  on  the  B8ZS  Sequence  states,  Q2-Q0  (see 
Figure  2b).  The  positive  and  negative  pulses  (PPO  and  NPO) 
occur  six  clock  cycles  after  the  input  NRZJN. 


ZERO  J 


Figure  2a.  B8ZS  Sequence  State  Machine 

Q2'QT'Q0*NRZ_DELAY  ♦  02-01*00 

S2"CTi*ei5»NRZ_DELAY  +  Q1'Q0 


01 'QO 


QTQO 


Q2*O1*Q0*NRZ_OELAY  +  02*01  *Q0 

Figure  2b.  B8ZS  Encoding  State  Machine 


£1 
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B8ZS  Decoder 

Decoding  is  performed  by  a  PALC16R6Z  and  a  shift  register 
(see  Figure  3).  The  PAL  device  searches  for  incoming  B8ZS 
code  words.  When  a  B8ZS  code  word  is  detected,  the  PAL 
device  substitutes  an  8-bit  string  of  O's.  The  output,  NRZ_OUT, 
is  delayed  eight  clock  cycles  from  the  positive  and  negative 
pulse  inputs  (PPI  and  NPI).  The  device  flags  the  B8ZS  code 
words  and  any  bipolar  violations.  If  two  bipolar  violations  occur 
in  a  non-B8ZS  8-bit  data  string,  the  device  will  only  flag  a  single 
bipolar  violation.  The  B8ZS  Decoding  State  Machine  shown  in 
Figure  4  searches  for  the  two  B8ZS  code  words.  The  state 
machine  has  two  groups  of  states;  SN(0-7)  searches  for 
000-+0+-  and  SP(0-7)  searches  for  000+-0-+.  The  decoder's 
state  machine  shows  reduced  equations  from  the  B8ZS  and 
BPV  (Bipolar  Violation)  outputs.  Note:  PPI  and  NPI  are  never 
both  HIGH. 


PIN 

DESCRIPTION 

RST 

Active  low  master  reset 

T1_CK 

External  T1  clock 

PPI 

Positive  pulse  input 

NPI 

Negative  pulse  input 

DATA 

NRZ  data  stream 

NRZ_OUT 

NRZ  output  data  stream 

B8ZS 

B8ZS  flag 

BPV 

BPV  (Bipolar  Violation)  flag 

Table  3.  B8ZS  Decoder  Pin  Description 


RST- 
PPI  - 
NPI  - 


HC166 
A  C  D  E  F  G  H 


CLR 
QH 
CK1 


Figure  3,  B8ZS  Decoder 
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TITLE  PAL_A 

PATTERN  B8ZS  ENCODER  PAL  A 

REVISION  1.02 

AUTHOR  THERESA  SHAFER 

COMPANY  MM I 

DATE  6/6/87 

CHIP  PAL  A  PAL16R8 


;  PINS 
IV  2 
NRZ_CK  /RST 
;ll  12 
/OE  CO 


3  4 
NRZ_IN  NC 
13  14 
CI  C2 


5 
NC 
15 
NRZ 


6 

NC 
16 

DELAY  R3 


7 

NC 
17 
R2 


8 

NC 
18 
Rl 


9 

NC 
19 
RO 


GND 
20 

vcc 


/C2 


INPUTS : 


OUTPUTS : 


EQUATIONS 


NRZ_CK 
/OE 
/RST 
NRZ  IN 


NRZ_DELAY 
C2  -  CO 


EXTERNAL  CLOCK 

ACTIVE  LOW  OUTPUT  ENABLE  SIGNAL 
ACTIVE  LOW  MASTER  RESET  SIGNAL 

SERIAL  NRZ  DATA  STREAM  WHICH  IS  ENCODED 
AND  TRANSMITTED 

DELAY  NRZ  DATA  WHICH  IS  INPUT  FOR  PAL  B 
COUNTER  OUTPUTS   (111)  INDICATES 
AN  8-BIT  STREAM  OF  ALL  ZEROS 


3 -BIT  COUNTER  WITH  SYNCHRONOUS  RESET  AND  ENABLE 


:=  C2  *  CI  * 
+  /C2  *  /CI 
+  /C2  *  /CO 
+  RST 
+  NRZ  IN 


CO 


MASTER  RESET 

CLEAR  COUNTER  WHEN  NRZ  IN 


=  1 


/CI  :=  /CI  *  /CO 
+  CI  *  CO 

+  RST  ;  MASTER  RESET 

+  NRZ_IN  ;  CLEAR  COUNTER  WHEN  NRZ_IN  =  1 

:=  CO 

+  RST  ;  MASTER  RESET 

+  NRZ_IN  ;  CLEAR  COUNTER  WHEN  NRZ_IN  =  1 

5 -STAGE  PIPELINE  DELAY 
/NRZ_DELAY  :=  /R3  +  RST 
:=  /R2  +  RST 
:=  /Rl  +  RST 
:=  /RO  +  RST 
:=  /NRZ  IN  +  RST 


B8ZS  Coding 


SIMULATION 


TRACE_ON  NRZ_CK  /OE  /RST  NRZ_IN 

C2  CI  CO  NRZ  DELAY  R3  R2  Rl  RO 


SETF  OE 
RST 

CLOCKF  NRZ  CK 


;  ENABLE  OUTPUT 
;  RESET 


;  INITIALIZE 


SETF  NRZ_IN 

/RST 
CLOCKF  NRZ  CK 


PERFORM  ZERO  SUBSTITUTION 


SETF  /NRZ_IN 
FOR  J:=  0  TO  8  DO 
BEGIN 

CLOCKF  NRZ_CK 
END 

SETF  NRZ_IN  ;  ALTERNATE  POSITIVE  AND  NEGITIVE  PULSES 

CLOCKF  NRZ_CK 

SETF  NRZ_IN 

CLOCKF  NRZ_CK 

SETF  /NRZ_IN 

CLOCKF  NRZ_CK 

SETF  NRZ_IN 

CLOCKF  NRZ_CK 

SETF  /NRZ_IN  ;  PERFORM  ZERO  SUBSTITUTION 

FOR  J:=  0  TO  6  DO 
BEGIN 

CLOCKF  NRZ_CK 
END 

SETF  NRZ_IN 
CLOCKF  NRZCK 

TRACE  OFF 
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PAL ASM  SIMULATION.    V2.22  -  MARKET  RELEASE    (-1  1-19-86) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES   INC.  1986 
PAL. ASM  SIMULATION  SELECTIVE  TRACE  LISTING 


Title 
Pattern 
Revi  51  on 


PAL  A 

882S  ENCODER  PAL 
1 . 02 


Author  : 
Company  i 
Date  : 


THERESA  SHAFER 
MM  I 

6/6/87 


PAL  A 
Paqe  i 


NRZ_CK 
/OE 
/RST 
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C2 

CI 

CO 
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PAL  A 

Page  :  2 


eg     eg     eg     c     c     c     c     c     c     eg  c 


IN 


NRZ  .CK 
/OE 
/RST 

NRZ. 

C2 

CI 

CO 

NRZ_  DELAY 

R3 

R2 

Rl 
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TITLE  PAL_B 

PATTERN  B8ZS  ENCODER  PAL  B 

REVISION  1.04 

AUTHOR  THERESA  SHAFER 

COMPANY  MMI 

DATE  6/6/87 

CHIP  PAL  B  PAL16R6 


;  PINS 

;i 

NRZ  CK 
;ll 
/OE 


2 

3 

4 

5 

6 

7  8 

9 

/RST 

NRZ  IN 

C2 

CI 

CO 

NRZ  DELAY  NC 

NC 

12 

13 

14 

15 

16 

17  18 

19 

NC 

CS 

NPO 

PPO 

QO 

Ql  Q2 

NC 

GND 
20 
VCC 


INPUTS:       NRZ_CK  EXTERNAL  CLOCK 

/OE  ACTIVE  LOW  OUTPUT  ENABLE  SIGNAL 

/RST  ACTIVE  LOW  MASTER  RESET  SIGNAL 

NRZ_DELAY     SERIAL  NRZ  DATA  STREAM  WHICH  WAS  DELAYED 
BY  PAL  A 
C2  -  CO  COUNTER  OUTPUTS 

NRZ_IN  NRZ  DATA  STREAM  INDICATES  AN 

8-BIT  STREAM  OF  ALL  ZERO  WHEN 
COUNTER  OUTPUTS  ARE  111  AND  NRZ  IN  =  0 


OUTPUTS : 


EQUATIONS 


CS  CURRENT  STATE 

PPO  POSITIVE  B8ZS  CODE 

NPO  NEGATIVE  B8ZS  CODE 

Q2  -  QO  B8ZS  SEQUENCE  STATE  MACHINE  STATES 


B8ZS  ENCODING  STATE  MACHINE  EQUATIONS 
STATE  VARIATES  =   [CS, PPO, NPO] 
WITH  THE  FOLLOWING  STATE  ASSIGNMENT 
P_ZERO       =  [0,0,0] 
NEGATIVE  =  [1,0,1] 
N_ZERO       =  [1,0,0] 
POSITIVE  =  [0,1,0] 


/Ql 


/CS  :=  /CS  *  /NPO  *  /Ql  *  QO 
+  CS  *  /PPO  *  Ql  *  /QO 
+  /CS  *  /NPO  *  /NRZ_DELAY  * 
+  /CS  *  /NPO  *  Q2  *  /Ql 
+  CS  *  NRZ_DELAY  *  /PPO  *  /Q2 
+  RST 


/PPO  :=  CS  *  /PPO  *  QO  *  /Ql 
+  /CS  *  /NPO  *  /QO 
+  CS  *  /NRZ_DELAY  *  /PPO  *  /Ql 
+  CS  *  /PPO  *  Q2  *  /Ql 
+  RST 


/QO 
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/NPO  :=  /CS  *  /NPO  *  /Ql  *  QO 

+  /CS  *  /NPO  *  /NRZ_DELAY  *  /Ql 
+  /CS  *  /NPO  *  Q2  *  /Ql 
+  CS  *  /PPO  *  /QO 
+  RST 


B8ZS  SEQUENCE  STATE  MACHINE  EQUATIONS 
STATE  VARIATES  [Q2,Q1,Q0] 
WITH  THE  FOLLOWING  STATE  ASSIGNMENTS 
A  =  [0,0,0] 


B 
C 
D 
E 
F 


[0,0,1] 
[0,1,0] 
[1,0,0] 
[1,0,1] 
[1,1,0] 


/Q2   :=  /Q2  *  /Ql 
+  Q2  *  Ql 
+  RST 

/Ql  :=  /QO 

+  Q2  *  Ql 
+  RST 


/QO  :  = 


/Q2 
/Q2 
/Q2 
/Q2 
QO 
Ql 
+  RST 


*  /Ql  *  /QO  *  /C2 

*  /Ql  *  /QO  *  /CI 

*  /Ql  *  /QO  *  /CO 

*  /Ql  *  /QO  *  NRZ_IN 


SIMULATION 

TRACE_ON  NRZ_CK  /OE  /RST  NRZ_ DELAY  NRZ_IN  C2  CI  CO 
CS  PPO  NPO  Q2  Ql  QO 

SETF  OE  ;  ENABLE  OUTPUT 

RST  ;  RESET 

CLOCKF  NRZCK 

;  INITIALIZE 

SETF  NRZ_DELAY 
/RST 

NRZ_IN  /C2  /CI  /CO 
CLOCKF  NRZCK 

;  PERFORM  NORMAL  ENCODING 

SETF  /NRZ_DELAY 

/NRZ_IN  /C2  /CI  /CO 
CLOCKF  NRZCK 
SETF  NRZ_DELAY 
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/C2  /CI  CO 
CLOCKF  NRZ_CK 
SETF  NRZ_DELAY 

/C2  CI  /CO 
CLOCKF  NRZ_CK 
SETF  NRZ_DELAY 

/C2  CI  CO 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 

C2  /CI  /CO 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 

C2  /CI  CO 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 

C2  CI  /CO 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 

C2  CI  /CO 
CLOCKF  NRZ_CK 
SETF  C2  CI  CO 
CLOCKF  NRZ_CK 
SETF  /C2  /CI  /CO 
CLOCKF  NRZ_CK 
FOR  I:=0  TO  5  DO 

BEGIN 

CLOCKF  NRZ_CK 
END 


SETF  /C2  /CI  /CO 

NRZ_DELAY 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 
CLOCKF  NRZ_CK 
SETF  /NRZ_DELAY 
CLOCKF  NRZCK 
SETF  C2  CI  CO 

/NRZ_DELAY 
CLOCKF  NRZ_CK 
SETF  /C2  /CI  /CO 

/NRZ_DELAY 
CLOCKF  NRZ_CK 
FOR  I:=0  TO  5  DO 

BEGIN 

CLOCKF  NRZ_CK 
END 


;  PERFORM  B8ZS  SUBSTITUTION 
;   FORCE  VIOLATION 


;  FORCE  ONE 

;  FORCE  ZERO 

;  FORCE  VIOLATION 

;  FORCE  ONE 


;  PERFORM  NORMAL  ENCODING 


SETF  /NRZ_DELAY 

NRZ_IN  C2  CI  /CO 
CLOCKF  NRZCK 
SETF  C2  CI  CO 
CLOCKF  NRZCK 
SETF  /C2  /CI  /CO 
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CLOCKF  NRZ_CK 
FOR  I:=0  TO  5  DO 
BEGIN 

CLOCKF  NRZ_CK  ;  PERFORM  B8ZS  SUBSTITUTION 

END 

TRACE_OFF 
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PAL ASM  SIMULATION,    V2.22  -  MARKET  RELEASE  (11-19-86) 
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PAL ASM  SIMULATION  SELECTIVE  TRACE  LI  ST  INS 


Title         :  PAL_B 

Pattern     :    BBZS  ENCODER  PAL  B 

Revision   :  1.04 


Author        :    THERESA  SHAFER 
Company     :    MM I 
Date  :  6/6/87 
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TITLE  B8ZS_DEC0DER 

PATTERN  B8ZS  DECODER  PAL 

REVISION  1.05 

AUTHOR  THERESA  SHAFER 

COMPANY  MMI 

DATE  6/6/87 

CHIP  B8ZS_DECODER  PAL16R6 
;PINS 


;l 

2 

3 

4 

5 

6 

7 

8 

9 

Tl  CK 

/RST 

PPI 

NPI 

NC 

NC 

NC 

NC 

NC 

GND 

;ii 

12 

13 

14 

15 

16 

17 

18 

19 

20 

/OE 

NC 

QO 

Ql 

Q2 

Q3 

/BPV 

/B8ZS 

DATA 

VCC 

INPUTS:       T1_CK         EXTERNAL  CLOCK 

/OE  ACTIVE  LOW  OUTPUT  ENABLE  SIGNAL 

/RST  ACTIVE  LOW  MASTER  RESET  SIGNAL 

PPI  SERIAL  POSITIVE  PUSLE  DATA  STREAM 

TO  BE  DECODED 
NPI  SERIAL  NEGATIVE  PUSLE  DATA  STREAM 

TO  BE  DECODED 


OUTPUTS:       DATA  DATA    WHICH  IS  LOADED  INTO     THE  SHIFT 

REGISTER 

Q3  -  QO  STATE  MACHINE  OUTPUTS 

/BPV  LOW  ACTIVE  SIGNAL  WHICH  INDICATES  THAT 

BIPOLAR  VIOLATION  HAS  OCCURRED 
/B8ZS  LOW  ACTIVE  SIGNAL  WHICH  INDICATES  THAT 

ZERO  SUBSTITUTION  HAS  OCCURRED 


EQUATIONS 

;  DATA  OUTPUT 

/DATA  =  /PPI  *  /NPI; 

;  /BPV  AND  /B8ZS  FLAGS 


BPV  :=  PPI  *  /Q2  *  Ql 

+  PPI  *  /Q3  *  Q2  *  /QO 
+  NPI  *  Q2  *  Ql 
+  NPI  *  /Q3  *  Ql  *  /QO 
+  /PPI  *  /Q2  *  /Ql  *  QO 
+  /PPI  *  Q3  *  /Ql  *  QO 
+  /NPI  *  /Q3  *  Q2  *  /Ql 
+  /NPI  *  Q2  *  /Ql  *  /QO; 

B8ZS  :=  /PPI  *  NPI  *  /Q3  *  Q2  *  /Ql  *  QO 
+  PPI  *  /NPI  *  Q3  *  Q2  *  /Ql  *  QO; 

;  STATE  MACHINE  WHICH  DETECTS  B8ZS  SEQUENCE 

;  STATE  VARIABLES  [Q3,Q2,Q1,Q0] 

;  WITH  THE  FOLLOWING  STATE  ASSIGNMENTS 


£1  Monolithic  KH3  Memories  £1 
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SPO  =  [0,0,1,1 
SP1  =  [1,0,1,1 
SP2  =  [1,0,1,0 

SP3  =  [0,0,0,0 
SP4  =  [0,1,0,0 
SP5  =  [0,0,1,0 
SP6  =  [1,1,0,0 
SP7  =  [1,1,0,1 
SNO  =  [1,1,1,1 
SN1  =  [0,1,1,1 
SN2  =  [1,1,1,0 
SN3  =  [1,0,0,0 
SN4  =  [1,0,0,1 
SN5  =  [0,1,1,0 
SN6  =  [0,0,0,1 
SN7  =  [0,1,0,1 


/Q3   :=  PPI  *  /RST 

+  /NPI  *  /Q3  *  /Q2  *  /Ql  *  /QO  *  /RST 

+  /NPI  *  Q3  *  /Q2  *  Ql  *  /QO  *  /RST 

+  /NPI  *  /Q3  *  Q2  *  Ql  *  /QO  *  /RST 

+  /NPI  *  Q3  *  /Ql  *  QO  *  /RST 

+  /NPI  *  Q3  *  Q2  *  QO  *  /RST 

+  NPI  *  /Q3  *  Q2  */Ql  *  /QO  *  /RST; 

/Q2  :=  PPI  *  Ql  *  /RST 
+  PPI  *  Q2  *  /RST 
+  /NPI  *  /Q3  *  Q2  *  /QO  *  /RST 
+  /NPI  *  Q3  *  Ql  *  /QO  *  /RST 
+  /PPI  *  /NPI  *  /Q3  *  /Ql  *  /RST 
+  /NPI  *  /Q2  *  Ql  *  QO  *  /RST 
+  Q3  *  /Q2  *  /Ql  *  /QO  *  /RST 
+  /Q3*  Q2  *  /Ql  *  /QO  *  /RST; 

/Ql  :=  PPI  *  /Q3  *  /Q2  *  /Ql  *  /RST 

+  /PPI  *  /NPI  *  /Q2  *  /QO  *  /RST 

+  /PPI  *  /NPI  *  Ql  *  /QO  *  /RST 

+  /PPI  *  NPI  *  Q3  *  /Ql  *  /QO  *  /RST; 

/QO   :=  PPI  *  /Q3  *  /Q2  *  /Ql  *  /QO  *  /RST 
+  PPI  *  Q3  *  /Q2  *  /Ql  *  QO  *  /RST 
+  /PPI  *  /NPI  *  /Q2  *  /QO  *  /RST 
+  /PPI  *  /NPI  *  Q3  *  /QO  *  /RST 
+  /PPI  *  /NPI  *  /Q3  *  /Q2  *  /Ql  *  /RST 
+  /PPI  *  /NPI  *  Q3  *  /Q2  *  Ql  *  /RST 
+  /PPI  *  /NPI  *  /Q3  *  Q2  *  Ql  *  QO  *  /RST 
+  /PPI  *  NPI  *  /Q3  *  Q2  *  /Ql  *  /QO  *  /RST; 
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SIMULATION 

TRACE_ON  T1_CK  /OE  /RST  PPI  NPI 
Q3  Q2  Ql  QO  /BPV  /B8ZS 


SETF  OE 


ENABLE  OUTPUT 


RST 

/PPI  /NPI 
CLOCKF  T1_CK 
SETF  RST 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 

/RST 
CLOCKF  Tl  CK 


;  RESET 


;  NORMAL  DECODING 


SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 

CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  Tl  CK 


DETECT  B8ZS  SEQUENCE 
+  SEQUENCE 


SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 

SETF  /PPI  NPI 
CLOCKF  T1_CK 

SETF  /PPI  /NPI 
CLOCKF  Tl  CK 


;  SET-UP  FOR  NEXT  SEQUENCE 


;   -  SEQUENCE 
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SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 

CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 

;  DETECT  BIPOLAR  VIOLATIONS 

SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  /PPI  /NPI 
CLOCKF  T1_CK 
SETF  PPI  /NPI 
CLOCKF  T1_CK 

TRACE_OFF 
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PAL ASM  SIMULATION.    V2.22  -  MARKET  RELEASE    (1 1-19-86) 
<G3    -  COPYRIGHT  MONOLITHIC  MEMORIES   INC,  1986 
PAL ASM  SIMULATION  SELECTIVE  TRACE  LI ST  INS 


Title  :    B8ZS  DECODER 

Pattern  :  B8ZS  DECODER  PAL 
Ravi  si  on   s    1  .  05 


Author        :    THERESA  SHAFER 
Company     :   MM I 
Date  :  6/6/87 


BSZS  J3EC0DEP 
Page   :  1 
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B8ZS_  DECODER 
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Introduction 

A  d  gital  communication  trunk  system  consists  essentially  of  two 
channel  banks  and  several  repeaters  on  the  transmission  lines 
(Figure  1).  A  channel  bank  is  terminal  equipment  which  multi- 
plexes more  than  one  channel  onto  a  single  transmission  line 
using  Time  Division  Multiplexing  (TDM)  technique.  This  TDM 
technique  assigns  an  equal  individual  time  slot  for  each  channel 
allow  several  channels  to  share  a  transmission  line  sequen- 


Figure  1 .  Digital  Communication  Trunk  System 


High  Density  Bipolar  (HDB3)  code  is  a  modified  bipolar  code.  A 
bipolar  code  uses  alternating  polarities  for  successive  logic 
ones.  The  signal  edges  provide  the  timing  information.  An  error 
occurs  when  two  contiguous  logic  ones  have  the  same  polarity; 
this  is  defined  as  a  "bipolar  violation."  HDB3  code  permits  a  legal 
bipolar  violation  to  prevent  four  or  more  contiguous  zeros  in  the 
data  stream.  Three  is  the  maximum  number  of  contiguous  zeros 
allowed  before  a  violation  is  inserted  to  maintain  timing  informa- 
tion for  a  long  string  of  logic  zeros. 

HDB3  Code 

Code  conversion  rules  for  trunk  lines  are  defined  in  the  CCITT 
G703  recommendation,  which  includes  two  standards.  One 
standard  is  T-carrier  standard  for  North  America  and  Japan;  the 
other  standard  is  CEPT  standard  for  Europe  and  elsewhere. 
HDB3  code  is  recommended  by  CCITT  for  the  CEPT  standard, 
at  the  2,048,  8,448,  or  34,368  kbps  rates. 
Violation  patterns  used  for  four-zeros-substitution  in  HDB3  are 
shown  in  Figure  2.  Four  possible  HDB3  substitution  codes  are: 
+00+,  -00-,  000+,  or  000-.  The  "+"  indicates  the  positive  voltage, 


"-"  indicates  the  negative  voltage,  and  "0"  indicates  the  zero  volt- 
age. The  polarity  of  the  preceding  pulse  and  the  number  of  bipo- 
lar pulses  after  the  previous  bipolar  violation  select  the  substitu- 
tion code. 


POLARITY 
PRECEDING 
PULSE 

NUMBER  OF  BIPOLAR  PULSE 

ODD 

EVEN 

Positive 
M 

000+ 

-00- 

Negative 

(-) 

000- 

+00+ 

Figure  2.  HDB3  Substitution  Table 

HDB3  Coding  Example 

An  example  of  converting  from  NRZ  (Non-Return-Zero)  data  to 
HDB3  code  is  shown  in  Figure  3.  There  are  four  sets  of  four  suc- 
cessive zeros  in  the  data  stream,  in  the  first  group  the  previous 
signal  is  positive  and  the  number  of  logic  ones  is  odd,  so  the 
"000+"  pattern  is  generated.  In  the  second  group  the  previous 
signal  is  positive  and  the  number  of  logic  ones  is  even,  so  the  "- 
00-"  pattern  is  generated.  In  the  third  group  the  previous  signal  is 
negative  and  the  number  of  logic  ones  is  even,  so  the  "+00+" 
pattern  is  generated  .  A  "000-"  pattern  is  generated  in  the  fourth 
group  because  the  previous  signal  is  negative  and  the  number 
of  logic  ones  is  odd. 

HDB3  substitution  codes  force  the  fourth  bit  to  be  a  bipolar  viola- 
tion. The  violation  is  chosen  by  the  following  scheme,  based  on 
the  number  of  bits  since  the  previous  violation: 

(1)  When  the  number  is  even,  HDB3  substitution  code  forces  the 
first  zero  to  be  a  bipolar  code  (B)  and  the  fourth  zero  to  be  a  bi- 
polar violation  (V).  Therefore,  four  successive  zeros  is  converted 
to  a  bipolar  signal,  following  two  continuous  zeros,  and  a  bipolar 
violation  with  the  first  bipolar  signal  (B00V).  The  polarity  of  the 
bipolar  violation  (V)  is  the  same  as  the  polarity  of  the  first  bipolar 
signal  (B). 


NRZ  SIGNAL  STRING 


-001-0000-11-0000-11-0000-1-0000 


THE  PREVIOUS 
SIGNAL 


POLARITY   

 »•     0  0  + 

-  ♦ 

+  - 

NUMBER   

 *-  1 

2 

2 

1 

2-384 


HOB3  SUBSTITUTION  CODE 


HDB3 CODE 


-0  0  + 


000+  -00-  +00+  000- 

W  W 


GROUP  1  GROUP  2         GROUP  3  GROUP4 

Figure  3.  An  Example  of  HDB3  Coding 


HDB3  Line  Coding  Using  PAL  Devices 


(2)  When  the  number  is  odd,  HDB3  substitution  code  forces  the 
fourth  zero  to  be  a  bipolar  violation  (V).  Therefore,  HDB3  is  con- 
verted as  three  continuous  zeros  followed  by  bipolar  violation 
with  the  previous  bipolar  signal  (000V).  The  polarity  of  the  bipo- 
lar violation  is  the  same  as  the  polarity  of  the  previous  bipolar 
signal. 

HDB3  Encoder  Implementation 

Figure  4  shows  the  block  diagram  of  an  HDB3  encoder,  which 
includes  three  state  machines  and  one  4-bit  shift  register.  A  DE- 
TECT_4_ZEROS  state  machine  detects  four  successive  zeros. 
An  ODD_EVEN  state  machine  checks  the  number  of  logic  ones 
after  the  last  bipolar  violation.  An  HDB3  encoding  state  machine 


generates  the  HDB3  code  and  a  RESET  signal  which  clears  the 
ODDJEVEN  state  machine.  A  4-bit  shift  register  delays  the  in- 
coming NRZ  data  in  order  to  synchronize  with  the  4-ZEROS  sig- 
nal from  the  DETECT_4_ZEROS  state  machine. 

The  HDB3  encoding  state  machine  generates  HDB3  code,  de- 
pending on  the  DSIN,  4-ZEROS,  ODD_EVEN,  and  POS_NEG 
signals.  When  DSIN  is  false  and  4-ZEROS  is  false,  the  output  is 
a  zero.  When  DSIN  is  true  and  4-ZEROS  is  false,  the  output 
generates  a  signal  that  alternates  between  the  positive  and  neg- 
ative signals.  When  DSIN  is  true  and  4-ZEROS  is  true,  an  error 
flag  is  raised.  HDB3  substitution  code  is  generated  when  DSIN 
is  false  and  4-ZEROS  is  true.  Four  possible  HDB3  substitution 
codes  are  generated,  depending  on  POS_NEG  and 
ODD_EVEN  signals.  These  are  summarized  in  Figure  5. 


NRZIN 


DETECT  4  ZEROS 


4  -  ZEROS 


S3 

S2 

S1 

SO 

4- 

BIT 

DSIN 


ODD  _  EVEN 


ODD  EVEN 


GENERATE 
HDB3 
CODING 
STATE 
MACHINE 


(  +  ) 
— »»PHDB 


IHDB 


(-) 


SHIFT  REGISTER 

Figure  4.  Block  Diagram  of  an  HDB3  Encoder 


DSIN 

4-ZEROS 

POSJJEG 

ODD  EVEN 

CHARACTERISTICS 

0 

0 

0 

0 

0 

0 

0 

1 

Zero  output 

0 

0 

1 

1 

0 

0 

1 

0 

0 

1 

0 

0 

+00+ 

0 

1 

0 

1 

000- 

0 

1 

1 

1 

000+ 

0 

1 

1 

0 

-00- 

0 

0 

0 

0 

0 

1 

Alternating 

0 

1 

1 

PHDB  (+)  and  NHDB  (-) 

0 

1 

0 

1 

0 

0 

1 

0 

1 

Error 

1 

1 

1 

1 

1 

0 

Figure  5.  HDB3  Encoding  State  Table 
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The  three  Programmable  Array  Logic  (PAL)  devices  used  to  im- 
plement the  entire  HDB3  encoder  are  a  16RA8,  a  20RS8,  and  a 
16R4  (see  Figures). 

The  16RA8  includes  two  state  machines:  ODD_EVEN  and  DE- 
TECT_4_ZEROS.  The  ODD_EVEN's  output  signal  is  fed  into 
the  HDB3  decoding  state  machine.  If  the  ODD_EVEN  state  ma- 
chine is  triggered  by  the  same  clock  edge  as  the  decoding  state 
machine,  the  wrong  result  is  generated  because  the  encoding 
state  machine  always  gets  the  ODD_EVEN's  output  signal  one 
clock  early.  Therefore,  the  ODD_EVEN  state  machine  must  be 
triggered  by  the  falling  edge  of  the  clock  to  compensate  for  this 
timing  problem.  However,  the  DETECT_4_ZEROS  state  ma- 
chine is  triggered  by  the  rising  edge  of  the  clock.  Both  rising  and 
falling  edges  are  needed  for  these  two  state  machines.  The 
1 6RA8  device  offers  the  programmable  clock  feature  because 
one  of  the  product  terms  in  each  cell  is  the  clock  input.  The  clock 
flexibility  of  the  16RA8  allows  the  falling-edge  clock  (CLK)  for 
the  ODD_EVEN  state  machine  and  the  rising-  edge  clock  (CLK) 
for  the  DETECT_4_ZEROS  state  machine  in  a  single  20-pin 
PAL  device. 


The  HDB3  encoding  state  machine  diagram  is  shown  in  Figure 
7.  Four  registers  are  used  for  fifteen  states.  Two  of  the  four  reg- 
isters have  ten  product  terms,  one  register  has  eight  product 
terms,  and  the  final  register  has  seven  product  terms.  Both  the 
20RS4  and  20RS8  have  the  product-sharing  feature.  The 
20RS8  is  used  to  implement  this  HDB3  encoding  state  machine 
because  the  product  sharing  feature  is  sixteen  product  terms 
shared  by  two  registers  exclusively,  and  in  this  case  at  least  sev- 
enteen product  terms  are  required  per  register  pair.  The  design 
fits  in  the  20RS8  by  using  the  product  terms  from  two  registers 
for  one  output,  and  leaving  the  other  output  as  a  no-connect.  In 
addition  to  the  HDB3  encoding  state  machine,  a  reset  signal  and 
an  error  message  are  generated  in  this  20RS8.  The  reset  signal 
clears  the  ODD_EVEN  state  machine.  The  error  message  oc- 
curs when  DSIN  and  4_ZEROS  both  are  high  simultaneously. 

The  16R4  has  four  registered  outputs  and  four  combinatorial 
outputs.  A  4-bit  shift  register  is  implemented  using  four  regis- 
tered outputs.  The  positive  HDB3  signal  (PHDB)  and  negative 
HDB3  signal  (NHDB)  are  generated  using  two  combinatorial 
outputs. 


Figure  6.  HDB3  Encoder  Implemented  by  Three  PAL  Devices 
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/DSIN'  ZEROS 


S4  (0100) 
PHDB  =  1 
NHDB  =  0 
RESET  =  1 
ERROR  =  0 


SS  (1100) 
PHDB  =  0 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


S6(1101) 
PHDB  =  0 
NHDB  a  0 
RESET  =  0 
ERROR =0 


S7(1111) 
PHDB = 1 
NHDB  =  0 
RESET  =1 
ERROR  -  0 


/DSIN* 
ZEROS' 
/ODDEVEN 


/DSIN* 
ZEROS' 
/ODDEVEN 


/DSIN' 
/ZEROS' 


DSIN' 
/ZEROS 


c 


/DSIN' 
ZEROS' 
/ODDEVEN 


SO  (0000) 
PHDB  =  0 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


/DSIN* 
/ZEROS- 


SI  3  (0101) 
PHDB  =  0 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


/DSIN* 
ZEROS* 
ODDEVEN 


DSIN* 
ZEROS 


DSIN* 
/ZEROS 


/DSIN* 
/ZEROS' 


S3  (1110) 
PHOB  =  0 
NHDB = 1 
RESET  =  0 
ERROR  a  0 


DSIN*  /ZEROS 


/DSIN* 
/ZEROS 


S1  (1000) 
PHDB=1 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


/DSIN* 
/ZEROS' 


/DSIN' 
ZEROS 


Q 


/DSIN' 
ZEROS' 
/ODDEVEN 


DSIN* 
/ZEROS 


S2  (1010) 
PHDB  =0 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


/DSIN* 
ZEROS* 
ODDEVEN 


ALL  DSIN*  ZEROS 


S8(0111) 
PHDB=0 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


S14(1001) 
PHDB  =  0 
NHDB  =  0 
RESET  =  0 
ERROR  =  1 


SI  2 (0001) 
PHDB  =  0 
NHDB  =  1 
RESET =1 
ERROR  =  0 


DSIN* 
/ZEROS 


S11  (0011) 
PHDBrO 
NHDB  =  0 
RESET  =  0 
ERROR  =  0 


S1 0(0010) 
PHDB  =  0 
NHDB  =  0 
RESET  =  0 
ERROR =0 


/DSIN* 
ZEROS' 
/ODDEVEN 


S9  (0110) 
PHDB  =  1 
NHDB  =  0 
RESET  =  1 
ERROR  =  0 


/DSIN'  ZEROS 


Notes: 

1.  Inputs:  DSIN,  ZEROS,  and  ODDEVEN. 

2.  State  Outputs:  PHDB,  NHDB.  RESET  and  ERROR. 


Figure  7.  HDB3  Encoding  State  Machine 
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HDB3  Decoder  Implementation 

The  block  diagram  of  the  HDB3  decoder  is  shown  in  Figure  8.  A 
decoding  state  machine  detects  the  HDB3  code,  then  either  a 
Data  Serial  OUT  (DSOUT)  signal  or  a  RESET  signal  is  generat- 
ed. These  two  signals  are  fed  into  the  4-bit  shift  register.  Two 
error  flags  are  generated:  VERROR  (Violation  ERROR)  and  ER- 
ROR. 


nrzout - 


S3 

S2 

S1 

SO 

RESET 


4-  BIT 
SHIFT  REGISTER 


DSOUT 
ERROR-" 
VERROR « 


DETECT 
HDB3  CODING 
STATE 
MACHINE 


PHDB 


-  NHDB 


Figure  8.  Block  Diagram  of  an  HDB3  Decoder 


A  single  24-pin  PAL  device  (20RS4)  and  a  4-bit  shift  register 
(HCT194)  are  used  to  implement  the  HDB3  decoder  (see  Figure 
9).  The  decoding  state  machine  diagram  is  shown  in  Figure  10. 
Four  registers  are  used  for  thirteen  states.  Two  of  the  four  reg- 
isters have  eight  product  terms,  one  register  has  nine  product 
terms,  and  the  final  register  has  five  product  terms.  Because 
the  total  product  terms  of  two  registers  can  be  less  than  six- 
teen product  terms,  these  four  registers  can  be  implemented 
using  a  20RS4  device  rather  than  a  20RS8. 

When  an  HDB3  substitution  code  is  detected,  a  reset  signal  is 
generated  to  load  four  zeros  to  the  HCT194.  Otherwise, 
DSOUT  is  generated  to  the  shift  right  serial  input  (SR  SER)  of 
the  HCT194.  When  any  two  sequential  signals  of  the  same  po- 
larity are  detected  except  the  HDB3  substitution  code,  the 
bipolar  violation  signal  is  generated  (VERROR).  The  ERROR 
signal  is  generated  when  both  positive  HDB3  signal  and  nega- 
tive HDB3  signal  are  high  simultaneously. 


Figure  9.  HDB3  Decoder  Implemented  by  a  PAL  Device  and  a  Shift  Register 
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/PHDB* 
NHDB 


PHDB* 
/NHDB 


PHDB* 
/NHDB 


S11(1110) 
RESET =0 
VERROR  =  1 
1=0 


/PHDB* 
/NHDB 


PHDB' 
/NHDB 


/PHDB* 
/NHDB 


PHDB* 
/NHDB 


SO  (0000) 
RESET  =  0 
VERROR  =  0 
ERROR  = 
PHDB* 
NHDB 


ALL 
PHDB* 
NHDB 


/PHDB* 
NHDB 


/PHDB* 
NHDB 


PHDB* 
/NHDB 


S1(1000) 
RESET  =  0 
VERROR = 1 
ERROR =0 


/PHDB* 
/NHDB 


S2  (1001) 
RESET  =  0 
VERROR  =  1 
=  0 


/PHDB* 
/NHDB 


/PHDB* 
/NHDB 


PHDB* 
/NHDB 


S3  (1011) 
RESET  = 
PHDB-  NHDB 
VERROR =0 
ERROR  =  0 


/PHDB* 
/NHDB 


S4  (1010) 
RESET = 
PHDB'/NHDB 
VERROR =0 

ERROR  =  0 


PHDB* 
/NHDB 


S5(0010) 
RESET  =  0 
VERROR  >  1 
ERROR  =  0 


PHDB* 
/NHDB 


/PHDB* 
NHDB 


/PHDB* 
NHDB 


/PHDB* 
/NHDB 


/PHDB* 
NHDB 


S12  (1101) 
RESET  =  0 
VERROR  a  1 
ERROR  •  0 


/PHDB* 
/NHDB 


S6(0110) 
RESET  =  0 
VERROR  =  1 

=  0 


/PHDB* 
/NHDB 


PHDB* 
/NHDB 


S7  (0100) 
RESET  =  0 
VERROR  =  1 
ERROR  =  0 


PHDB* 
/NHDB 


/PHDB* 
/NHDB 


S8(0101) 
RESET = 
/PHDB"  NHDB 
VERROR  =  0 
ERROR =0 


PHDB* 
/NHDB 


S9 (0111) 
RESET = 
/PHDB*  NHDB 
VERROR  =  0 
ERROR  =  0 


PHDB* 
/NHDB 


Notes: 

1.  Inputs:  PHDB  and  NHDB. 

2.  State  Outputs:  RESET,  VERROR  and  ERROR. 


/PHDB* 
NHDB 


S10  (1111) 
RESET  =  0 
VERROR  =  1 
ERROR =0 


/PHDB* 
NHDB 


/PHDB* 
NHDB 


/PHDB* 
/NHDB 


/PHDB* 
NHDB 


/PHDB* 
NHDB 
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Title        HDB3  encoder  logic 
Pattern    HDB3EN1 . pds 
Revision  A 
Author      Cindy  Lee 

Company    Monothic  Memories  Inc.,  Santa  Clara,  Ca 
Date  3/4/87 

CHIP  HDB3EN1  PAL16RA8 

;PIN1  PIN2  PIN3  PIN4  PIN5  PIN6  PIN7  PIN8  PIN9  PIN10 
/PL       CLK     PHDB  NHDB  NRZIN  RESET     MR      NC       NC  GND 


;PIN11  PIN12  PIN13  PIN14  PIN15  PIN16 
/OE         ZQO       ZQ1       ZQ2       ZEROS  NC 

INPUT  SIGNALS 


PIN17  PIN18  PIN19  PIN20 
OQO       OQl     ODDEVEN  VCC 


/PL 

CLK 

PHDB 

NHDB 

NRZIN 

RESET 

MR 

/OE 


ASSERTIVE  LOW  PRELOAD 
EXTERNAL  CLOCK 
POSITIVE  HDB3  SIGNAL 
NEGATIVE  HDB3  SIGNAL 
NOR-RETURN-ZERO  INPUT  SIGNAL 

RESTE  SIGNAL  TO  RESET  THE  ODD_EVEN  STATE  MACHINEG 
MASTER  REST 

ASSERTIVE  LOW  OUTPUT  ENABLE 


OUTPUT  SIGNALS 
ZQO,   ZQ1,   ZQ2    :    DETECT_4_ZEROS  STATE  VARIABLES 
ZEROS   :   DETECT_4_ZEROS  STATE  OUTPUT 
OQO,   OQl       :   ODD_EVEN  STATE  VARIABLES 
ODDEVEN  :  ODD  EVEN  STATE  OUTPUT 


EQUATIONS 


;16RA8  PROVIDES  TWO  STATE  MACHINES 


(1)   THE  ODD_EVEN  STATE  MACHINE  CHECKS  THE  NUMBER  OF  BIPOLAR  PULSES 
AFTER  THE  PREVIOUS  BIPOLAR  VIOLATION  IS  ODD  OR  /EVEN 


OQl 
OQl 


ODDEVEN 
OQl 


.  SETF 
.  CLKF 


=  OQO*/OQ1 

:=  NHDB*OQO*/OQ1*/RESET 
+  PHDB*OQO*/OQ1*/RESET 
+  /NHDB*/PHDB*OQO*OQ1*/RESET 
=  MR 
=  /CLK 


ODD_EVEN  STATE  OUTPUT 
MSB  STATE  VARIABLE 


MASTER  RESET 

THE  FALLING  EDGE  OF 

THE  CLOCK 


OQC 


OQO. SETF 
OQO . CLKF 


:=  NHDB*/OQl*/RESET 
+  PHDB*/OQl*/RESET 
+  OQ0*/RESET 
=  MR 
=  /CLK 


;   LSB  STATE  VARIABLE 


£1, 
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(2)   THE  DETECT_4_ZER0S  STATE  MACHINE  DETECTS  FOUR  CONTINUOUS  ZEROS 


ZEROS 
ZQ2 

ZQ2.SETF 
ZQ2.CLKF 

ZQ1 

ZQ1.SETF 
ZQ1.CLKF 

ZQO 

ZQO.SETF 
ZQO.CLKF 


=  /ZQO  *  /ZQ1  *  ZQ2 


;  STATE  OUTPUT 


/NRZIN  *  /ZQO  *  ZQ1  *  /ZQ2     ;  MSB  STATE  VARIABLE 


=  MR 
=  CLK 

:=  /NRZIN  *  ZQO  *  /ZQ2 
=  MR 
=  CLK 

:=  /NRZIN  *  /ZQO  *  /ZQ1 
+  /NRZIN  *  /ZQ1  *  /ZQ2 
=  MR 
=  CLK 


;  MASTER  RESET 
;  EXTERNAL  CLOCK 


LSB  STATE  VARIABLE 


SIMULATION 

TRACE_ON  CLK  MR  NRZIN  PHDB  NHDB  RESET 

SETF  /CLK  MR  /NRZIN  /PHDB  /NHDB  /RESET 
SETF     CLK  /MR 

SETF  /CLK  /MR  /NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK  /NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK  /NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK  /NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK    NRZIN     PHDB  /NHDB  RESET 
SETF  CLK 

SETF  /CLK     NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK     NRZIN  /PHDB  /NHDB  /RESET 
SETF  CLK 

SETF  /CLK  /NRZIN     PHDB  /NHDB  RESET 
SETF  CLK 

SETF  /CLK     NRZIN  /PHDB     NHDB  /RESET 
SETF  CLK 


ODDEVEN  ZEROS 

OE  /PL     ; RESET  CONDITION 

;/DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  I,   STATE  0 
;/DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  2,   STATE  0 
;/DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  3,   STATE  0 
;/DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  4,   STATE  0 
;/DSIN,   ZEROS,  /ODDEVEN 

; VECTOR  5,   STATE  4 
;/DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  6,   STATE  5 
;/DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  7,   STATE  6 
;/DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  8,   STATE  7 
;DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  9,   STATE  3 
;DSIN,  /ZEROS,  ODDEVEN 


SETF  /CLK  /NRZIN     PHDB  /NHDB  /RESET 


; VECTOR  10,    STATE  1 


Monolithic        Memories  £1 
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SETF  CLK 


SETF 
SETF 


SETF 
SETF 

SETF 
SETF 

SETF 
SETF 


/CLK 
CLK 

/CLK 
CLK 

/CLK 
CLK 

/CLK 
CLK 


NRZIN  /PHDB     NHDB  /RESET 


/NRZIN  /PHDB  /NHDB  /RESET 


/NRZIN     PHDB  /NHDB  /RESET 


/NRZIN  /PHDB  /NHDB  /RESET 


SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 


SETF 
SETF 

SETF 
SETF 


/CLK 
CLK 

/CLK 
CLK 


SETF  /CLK 
SETF  CLK 


/NRZIN  /PHDB     NHDB  /RESET 


NRZIN  /PHDB  /NHDB  /RESET 


NRZIN  /PHDB  /NHDB  /RESET 


/NRZIN  /PHDB  /NHDB  /RESET 


/NRZIN  /PHDB     NHDB  RESET 


/NRZIN     PHDB  /NHDB  /RESET 


/NRZIN  /PHDB     NHDB  /RESET 


SETF 
SETF 

SETF 
SETF 


/CLK 
CLK 

/CLK 
CLK 


SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 

SETF  /CLK 
SETF  CLK 


NRZIN     PHDB  /NHDB  RESET 


/NRZIN  /PHDB  /NHDB  /RESET 


NRZIN  /PHDB  /NHDB  /RESET 


SETF 
SETF 


/CLK 
CLK 


/NRZIN     PHDB  /NHDB  RESET 


NRZIN  /PHDB     NHDB  /RESET 


NRZIN  /PHDB  /NHDB  /RESET 


/NRZIN     PHDB  /NHDB  /RESET 


;DSIN,   /ZEROS,  /ODDEVEN 

; VECTOR  11,   STATE  3 
;/DSIN,  /ZEROS,  ODDEVEN 

; VECTOR  12,   STATE  0 
;DSIN,  /ZEROS,  ODDEVEN 

; VECTOR  13,   STATE  1 
;/DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  14,   STATE  2 
;DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  15,   STATE  3 
;/DSIN,   ZEROS,  ODDEVEN 

; VECTOR  16,   STATE  13 
;/DSIN,  /ZEROS,  ODDEVEN 

,-VECTOR  17,   STATE  10 
;/DSIN,   /ZEROS,  ODDEVEN 

,-VECTOR  18,   STATE  11 
;/DSIN,  /ZEROS,  ODDEVEN 

,-VECTOR  19,   STATE  12 
;DSIN,  /ZEROS,  /ODDEVEN 

,-VECTOR  20,   STATE  1 
,-DSIN,  /ZEROS,  ODDEVEN 

; VECTOR  21,   STATE  3 
;/DSIN,   ZEROS,  /ODDEVEN 

,-VECTOR  22,   STATE  4 
;/DSIN,  /ZEROS,  /ODDEVEN 

,-VECTOR  23,   STATE  5 
;/DSIN,  /ZEROS,  /ODDEVEN 

,-VECTOR  24,   STATE  6 
;/DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  25,    STATE  7 
;DSIN,  /ZEROS,  /ODDEVEN 

; VECTOR  26,   STATE  3 
;/DSIN,   /ZEROS,  ODDEVEN 

,-VECTOR  27,   STATE  0 
;DSIN,  /ZEROS,  ODDEVEN 

; VECTOR  28,    STATE  1 
;/DSIN,  /ZEROS,  /ODDEVEN 
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SETF  /CLK  /NRZIN  /PHDB  /NHDB  /RESET  ; VECTOR  29,   STATE  2 

SETF     CLK  ;DSIN,  /ZEROS,  /ODDEVEN 

SETF  /CLK  /NRZIN  /PHDB     NHDB  /RESET  ;VECTOR  30,   STATE  3 

SETF     CLK  ;DSIN,  /ZEROS,  ODDEVEN 

SETF  /CLK  /NRZIN     PHDB  /NHDB  /RESET  ;VECTOR  31,   STATE  1 

SETF     CLK  ;/DSIN,    ZEROS,  /ODDEVEN 

TRACE  OFF 
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Title       HDB3  encoder 
Pattern    HDB3EN2 .pds 
Revision  A 
Author      Cindy  Lee 

:ompany    Monolithic  Memories  Inc.,  Santa  Clara,  Ca 
it's  3/9/87 

IIP  HDB3EN2  PAL20RS8 

;PIN1  PIN2  PIN3       PIN4       PIN5  PIN6 
CLK       DSIN  ZEROS  ODDEVEN     NC  MR 

;PIN7  PIN8  PIN9  PIN10  PIN11  PIN12 
NC         NC       NC       NC        NC  GND 

;PIN13  PIN14  PIN15  PIN16  PIN17  PIN18 
/OE         ERROR    NC         Q3         NC  Q2 

fPIN19  PIN20  PIN21  PIN22  PIN23  PIN24 
NC  Ql         NC         QO         RESET  VCC 

INPUT  SIGNALS 

CLK  :   EXTERNAL  CLOCK 

DSIN  :   SERIAL  DATA  INPUT  SIGNAL 

ZEROS  :   FOUR  ZEROS  DETECTION  FLAG  INPUT 

ODDEVEN  :  ODD  OR  /EVEN  SIGNAL 

MR  :  MASTER  RESET 

/OE  :  ASSERTIVE  LOW  OUTPUT  ENABLE 

OUTPUT  SIGNALS 

ERROR       :   ERROR  FLAG 
Q3,   Q2,   Ql,   QO     :  HDB3  DECODING  STATE  MACHINE  STATE  VARIABLES 
RESET       :   RESET  SIGNAL 


EQUATIONS 

;20RS8  GENERATES  TWO  SIGNALS  AND  ONE  STATE  MACHINE 

;(1)   A  RESET  SIGNAL  CLEARS  THE  ODD_EVEN  STATE  MACHINE 

RESET       =  /QO  *     Q2  *  /Q3  ;  STATE  4  AND  STATE  9 

+     QO  *  /Ql  *  /Q2  *  /Q3  ;  STATE  12 

+     QO  *     Ql  *     Q2  *     Q3  ;  STATE  7 

;(2)   A  ERROR  SIGNAL  IS  RAISED  WHEN  DSIN  AND  4-ZEROS  BOTH  ARE 
HIGH  AT  THE  SAME  TIME 

ERROR       =     QO  *  /Ql  *  /Q2  *     Q3  ;  STATE  14 

(3)   HDB3  ENCODING  STATES  MACHINE 

:=  /qo  *  /Ql  *  Q2  *  /MR  ;  MSB  STATE  VARIABLE 
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+  /QO  *  /Q2  *  Q3  *  /ZEROS  *  /MR 

+  DSIN  *  /QO  *  ZEROS  *  /MR 

+  DSIN  *  Q2  *  Q3  *  /MR 

+  DSIN  *  /Ql  *  /Q2  *  /Q3  *  /MR 

+  DSIN  *  /Ql  *  ZEROS  *  /MR 

+  /Ql  *  Q2  *  Q3  *  /MR 

+  DSIN  *  /Q3  *  ZEROS  *  /MR 

+  QO  *  Ql  *  Q2   *  /Q3   *  /MR 

+  QO  *  Q2   *  Q3   *  /ZEROS  *  /MR 


Q2  :=  /DSIN  *  /QO  *  /Ql  *  Q2  *  /MR 

+  /DSIN  *  QO  *  Ql  *  Q2   *  /Q3   *  /MR 

+  /QO  *  /Ql  *  Q2  *  /ZEROS  *  /MR 

+  /Ql  *  Q2  *  Q3  *  /ZEROS  *  /MR 

+  QO  *  Ql  *  Q2  *  /Q3  *  /ZEROS  *  /MR 

+  DSIN  *  /QO  *  /Q2  *  Q3  *  /ZEROS  *  /MR 

+  DSIN  *  QO  *  Q2  *  Q3  *  /ZEROS  *  /MR 

+  /DSIN  *  /QO  *  Q3  *  ZEROS  *  /MR 

+  /DSIN  *  /Ql  *  /Q2  *  /Q3  *  ZEROS  *  /MR 

+  /DSIN  *  Q2  *  Q3  *  ZEROS  *  /MR 


Ql  :=  /DSIN  *  /QO  *  Ql  *  /Q3  *  /MR 

+  /DSIN  *  QO  *  /Ql  *  Q2  *  /MR 
+  /QO  *  Ql  *  /Q3  *  /ZEROS  *  /MR 
+  QO  *  /Ql  *  Q2  *  /ZEROS  *  /MR 
+  /QO  *  /Q2  *  Q3  *  /ZEROS  *  /MR 
+  QO  *  Q2   *  Q3   *  /ZEROS  *  /MR 
+  /DSIN  *  /QO  *  /Q2  *  Q3  *  /MR 
+  /DSIN  *  QO  *  Q2  *  Q3  *  /MR 


QO  :=  DSIN  *  /QO  *  ZEROS  *  /MR  ;   LSB  STATE  VARIABLE 

+  DSIN  *  /Ql  *  ZEROS  *  /MR 
+  /Ql  *  Q2  *  Q3  *  /MR 
+  Ql  *  /Q2  *  /Q3  *  /MR 
+  DSIN  *  Q2  *  ZEROS  *  /MR 
+  ODDEVEN  *  /QO  *  Q3  *  ZEROS  *  /MR 
+  ODDEVEN  *  /QO  *  /Q2  *  ZEROS  *  /MR 

SIMULATION 

TRACE_ON  MR  CLK  DSIN  ZEROS  ODDEVEN  Q3  Q2  Ql  QO  RESET  ERROR 

SETF  OE  MR  DSIN     ZEROS     ODDEVEN       ; RESET  CONDITION 
CLOCKF  CLK 


SETF  /MR 

SETF  DSIN 
CLOCKF  CLK 


ZEROS  /ODDEVEN 


SETF  /DSIN  /ZEROS  /ODDEVEN 
CLOCKF  CLK 

SETF  DSIN  /ZEROS  /ODDEVEN 
CLOCKF  CLK 


; VECTOR  1,  STATE  14,  1001 
; ERROR 

; VECTOR  2,    STATE  0,  0000 


; VECTOR  3,  STATE  1,  1000 
;  PHDB 
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SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  DSIN 
CLOCKF  CLK 

SETF  DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 


SETF 


CLOCKF 


/DSIN 
CLK 


SETF 
CLOCKF 

SETF 
CLOCKF 


DSIN 
CLK 

DSIN 
CLK 


SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  /DSIN 
CLOCKF  CLK 

SETF  DSIN 
CLOCKF  CLK 


SETF 


CLOCKF 


SETF 


CLOCKF 


/DSIN 
CLK 


/DSIN 
CLK 


ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 
/ZEROS 

ZEROS 
/ZEROS 


ODDEVEN 
ODDEVEN 
ODDEVEN 
ODDEVEN 
/ODDEVEN 
ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
/ODDEVEN 
ODDEVEN 
ODDEVEN 


,-VECTOR  4,   STATE  8,  0111 


; VECTOR  5,   STATE  5,  1100 


,-VECTOR  6,    STATE  6,  1101 


; VECTOR  7,   STATE  7,  1111 
; PHDB,  RESET 

; VECTOR  8,   STATE  3,  1110 
;  NHDB 

; VECTOR  9,   STATE  1,  1000 
;  PHDB 

,-VECTOR  10,    STATE  9,  0110 
;NHDB,  RESET 

,-VECTOR  11,    STATE  10,  0010 


; VECTOR  12,    STATE  11,  0011 


,-VECTOR  13,  STATE  12,  0001 
,-NHDB,  RESET 

,-VECTOR  14,   STATE  1,  1000 
;  PHDB 

,-VECTOR  15,   STATE  3,  1110 
;NHDB 

,-VECTOR  16,    STATE  4,  0100 
;PHDB,  RESET 

,-VECTOR  17,   STATE  5,  1100 


,-VECTOR  18,    STATE  6,  1101 


,-VECTOR  19,    STATE  7,  1111 
,-PHDB,  RESET 

,-VECTOR  20,    STATE  3,  1110 
;  NHDB 

,-VECTOR  21,   STATE  13,  0101 


,-VECTOR  22,    STATE  10,  0010 
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SETF  /DSIN  /ZEROS  ODDEVEN 
CLOCKF  CLK 

SETF  /DSIN  /ZEROS  ODDEVEN 
CLOCKF  CLK 

TRACE  OFF 


; VECTOR  23,    STATE  11,  0011 

; VECTOR  24,  STATE  12,  0001 
; NHDB ,  RESET 


PALASM  SIMULATION,  V2.22  -  MARKET  RELEASE  (11-19-86) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 
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Title        HDB3  shift  register 
Pattern    HDB3EN3 . pds 
Revision  A 
Author      Cindy  Lee 
Company    Monothic  Memories  Inc. 
ate  2/24/87 


Santa  Clara,  Ca 


HP  HDB3EN3  PAL16R4 

?IN1  PIN2     PIN3  PIN4  PIN5  PIN6  PIN7  PIN8  PIN9  PIN10 
CLK      NRZIN  QINO  QIN1  QIN2  QIN3     MR      NC       NC  GND 

PIN11  PIN12  PIN13  PIN14  PIN15  PIN16  PIN17  PIN18  PIN19  PIN20 
/OE  NC         NC       DSIN       Q2         Ql         QO       NHDB     PHDB  VCC 

INPUT  SIGNALS 

CLK       :   EXTERNAL  CLOCK 
NRZIN  :   NON-RETURN-ZERO  INPUT  SIGNAL 
QINO,   QIN1,    QIN2,   QIN3    :    FOUR  STATE  VARIABLE  INPUTS 
MR         :  MASTER  RESET 

/OE       :  ASSERTIVE  LOW  OUTPUT  ENABLE 

OUTPUT  SIGNALS 
DSIN,  Q2,   Ql,   QO   :   4-BIT  SHIFT  REGISTER  OUTPUT  SIGNALS 
PHDB     :   POSITIVE  HDB3  SIGNAL 
NHDB     :  NEGATIVE  HDB3  SIGNAL 


EQUATIONS 

;16R4  GENERATES  TWO  HDB  SIGNALS  AND  PROVIDES  A  4-BIT  SHIFT  REGISTER 
1)   A  POSITIVE  HDB3  AND  A  NEGATIVE  HDB3  SIGNALS  ARE  GENERATED 


/PHDB       =  QINO  *  /QIN1 

+  /QINO  *  QIN2  *  QIN3 
+  QIN1  *  QIN2  *  /QIN3 
+  QIN1  *  /QIN2 
+  /QIN2  *  /QIN3 

/NHDB       =  /QINO  *  /QIN1 
+     QIN1  *  /QIN2 
+  /QIN2  *  QIN3 
+  QINO  *  QIN2 


THE  PHDB  IS  GENERATED  AT 
STATE  1,   STATE  4,   OR  STATE  7 
THIS  EQUATION  IS  THE  INVERTING 
LOGIC  OF  THE  PHDB  SIGNAL 


THE  NHDB  IS  GENERATED  AT 
STATE  3,   STATE  9,   OR  ATATE  12 
THIS  EQUATION  IS  THE  INVERTING 
LOGIC  OF  THE  NHDB  SIGNAL 


;(2)   THIS  IS  A  4-BIT  SHIFT  REGISTER  WITH  MASTER  RESET 

/QO 


:=  /NRZIN 
+  MR 


;  LSB  OF  SHIFT  REGISTER 
;  MASTER  RESET 


=  /QO 
+  MR 


/Ql 

/Q2         :=  /Ql 
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MR 


/DSIN     :=  /Q2 
+  MR 


;  MSB  OF  SHIFT  REGISTER 


SIMULATION 

TRACE_ON  MR  CLK  QINO  QIN1  QIN2  QIN3  PHDB  NHDB  NRZIN  DSIN 

SETF     OE  MR     QINO  /QIN1  /QIN2     QIN3     NRZIN  ; RESET  CONDITION 

CLOCKF  CLK 


SETF  /MR  QINO  /QIN1  /QIN2  QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  /QIN1  /QIN2  QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  /QINO  /QIN1  /QIN2  /QIN3  /NRZIN 
CLOCKF  CLK 

SETF  /MR  /QINO  /QIN1  /QIN2  QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  /QINO  QIN1  /QIN2  QIN3  /NRZIN 
CLOCKF  CLK 

SETF  /MR  /QINO  /QIN1  QIN2  QIN3  /NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  /QIN1  QIN2  QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  QIN1  QIN2  QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  /QIN1  QIN2  /QIN3  /NRZIN 
CLOCKF  CLK 

SETF  /MR  /QINO  QIN1  /QIN2  /QIN3  NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  QIN1  /QIN2  /QIN3  /NRZIN 
CLOCKF  CLK 

SETF  /MR  QINO  /QIN1  /QIN2  /QIN3  NRZIN 
CLOCKF  CLK 


; VECTOR  1 
; VECTOR  2 
; VECTOR  3 


; VECTOR  4,  PHDB 
;DSIN  STARTS 

; VECTOR  5 
;DSIN 

; VECTOR  6 
;/DSIN 

; VECTOR  7 
;DSIN 

; VECTOR  8,  PHDB 
;/DSIN 

; VECTOR  9 
;/DSIN 

/VECTOR  10 
;DSIN 

; VECTOR  11 
;DSIN 

; VECTOR  12,  NHDB 
;/DSIN 


TRACE  OFF 
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Title        HDB3  decoder 
Pattern  HDB3DE.pds 
Revision  A 
Author      Cindy  Lee 

Company  Monolithic  Memories  Inc.,  Santa  Clara,  Ca 
Date  3/12/87 

CHIP  HDB3DE  PAL20RS4 


;PIN1  PIN2  PIN3  PIN4  PIN5  PIN6 
CLK      NC       PHDB  NHDB    MR  NC 


;PIN7  PIN8  PIN9  PIN10  PIN11  PIN12 
NC         NC       NC       NC         NC  GND 

;PIN13  PIN14  PIN15  PIN16  PIN17  PIN18 
/OE         ERROR  VERROR    NC        Q3  Q2 

;PIN19  PIN20  PIN21  PIN22  PIN23  PIN24 
Ql  QO         NC         DSOUT  RESET  VCC 


;   INPUT  SIGNALS 

CLK         :   EXTERNAL  CLOCK 
PHDB       :   POSITIVE  HDB3  SIGNAL 
NHDB       :  NEGATIVE  HDB3  SIGNAL 
;       MR  :  MASTER  RESET 

/OE         :  ASSERTIVE  LOW  OUTPUT  ENABLE 

OUTPUT  SIGNALS 

ERROR     :   ERROR  FLAG 
VERROR  :  BIPOLAR  VIOLATION  ERRIR 
Q3,   Q2,  Ql,   QO   :  HDB3  DECODING  STATE  VARIABLE  OUTPUTS 
DSOUT     :   DATA  SERIAL  OUTPUT 
RESET     :  RESET  SIGNAL 


EQUATIONS 

;THE  20RS4  GENERATES  ONE  SIGNAL,   TWO  ERROR  FLAGS 
;  AND  A  HDB3  DECODING  STATE  MACHINE 

;(1)   A  RESET  SIGNAL  IS  GENERATED  TO  CLEAR 
THE  4-BIT  SHIFT  REGISTER 

RESET     =     QO  *  Q2  *  /Q3  *  /PHDB  *  NHDB       ;  STATE  8  OR  STATE  9 

;  WITH  /PHDB  *  NHDB 
+     Ql  *  /Q2   *  Q3   *  PHDB  *  /NHDB        ;   STATE  3  OR  STATE  4 

;  WITH  PHDB  *  /NHDB 

2)   A  VERROR  SIGNAL  IS  GENERATED  TO  INDICATE  THE  VIOLATION  ERROR 

VERROR  =     /QO  *     Ql  *     Q2  *     Q3  ;  STATE  11 

+       QO  *  /Ql  *     Q2  *     Q3  ;  STATE  12 
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CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 

SETF  /MR  PHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB 
CLOCKF  CLK 


NHDB  ; VECTOR 
; ERROR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

NHDB  ; VECTOR 

/NHDB  ; VECTOR 

NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 

NHDB  ; VECTOR 

/NHDB  ; VECTOR 

NHDB  ; VECTOR 

/NHDB  ; VECTOR 

/NHDB  ; VECTOR 


1,  STATE  0,  0000 

2,  STATE  0,  0000 

3,  STATE  0,  0000 

4,  STATE  1,  1000 

5,  STATE  2,  1001 

6,  STATE  3,  1011 

7,  STATE  4,  1010 

8,  STATE  5,  0010 
; RESET 

9,  STATE  6,  0110 

10,  STATE  1,  1000 

11,  STATE  6,  0110 

12,  STATE  7,  0100 

13,  STATE  8,  0101 

14,  STATE  10,  1111 
; RESET 

15,  STATE  1,  1000 

16,  STATE  6,  0110 

17,  STATE  1.  1000 

18,  STATE  2,  1001 


il  MonoUtHicKB  Memories  £1 


2-401 


HDB3  Line  Coding  Using  PAL  Devices 


ERROR 


Q3 


(3)   A  ERROR  SIGNAL  IS  GENERATED  WHEN  PHDB  AND  NHDB  BOTH  ARE  HIGH 
AT  THE  SAME  TIME, 


NHDB  *  PHDB 


;  BOTH  PHDB  AND  NHDB 
;  ARE  HIGH 


(4)  A  DATA  SERIAL  OUTPUT  SIGNAL  IS  GENERATED 

DSOUT     =  NHDB 
+  PHDB 

(5)  THE  HDB3  DECODING  STATE  MACHINE 


Q2 


Ql 


QO 


/NHDB 
/NHDB 
/NHDB 
/NHDB 
/NHDB 

/NHDB  *  PHDB  *  QO 
NHDB  *  /PHDB  *  Q2 
NHDB  *  /PHDB  *  QO 


/PHDB  *  /Q2  *  Q3  *  /MR 
PHDB  *  /QO  *  /Q3  *  /MR 
/QO  *  Ql  *  /Q2  *  /Q3  *  /MR 
/QO  *  Ql  *  Q2  *  Q3  *  /MR 
/Ql  *  /Q2  *  Q3  *  /MR 

*  Q2  *  /MR 

*  /Q3  *  /MR 

*  Q2  *  /MR 


MSB  STAE  VARIABLE 


/NHDB  *  PHDB 
/NHDB  *  PHDB 
/NHDB  *  PHDB 
NHDB  *  /PHDB 
NHDB  *  /PHDB 
NHDB  *  /PHDB 
/PHDB  *  Q2  * 
/PHDB  *  QO  * 


*  /QO  *  Ql  *  /Q2   *  /Q3   *  /MR 

*  /QO  *  Ql  *  Q2  *  Q3  *  /MR 

*  /Ql  *  /Q2  *  Q3  *  /MR 

*  /QO  *  /Q3  *  /MR 

*  Ql  *  Q3  *  /MR 

*  /Q2  *  Q3  *  /MR 
/Q3  *  /MR 

Q2  *  /MR 


:=  /NHDB  *  /PHDB  *  QO  *  Q3  *  /MR 


+  /NHDB  *  PHDB 
+  /NHDB  *  PHDB 
+  /NHDB  *  PHDB 
+  NHDB  *  /PHDB 
+  NHDB  *  /PHDB 
+  /PHDB  *  QO  * 
+  /PHDB  *  Ql  * 


*  /Q2 

*  /QO 

*  /QO 

*  /QO 

*  /QO 
/Q2  * 
/Q2  * 


Q3  *  /MR 
Ql  *  /Q2  *  /MR 
Ql  *  Q3  *  /MR 
/Q2  *  /MR 
Ql  *  Q3  *  /MR 
Q3  *  /MR 
Q3  *  /MR 


/MR 


LSB  STATE  VARIABLE 


+  /PHDB  *  QO  *  Q2  *  /Q3  *  /MR 


:=  /NHDB  *  /PHDB  *  /Ql  *  /Q2  *  Q3 
+  NHDB  *  /PHDB  *  Q2  *  /Q3  *  /MR 
+  /PHDB  *  /Ql  *  Q2  *  /Q3  *  /MR 
+  /PHDB  *  QO  *  Q2  *  /Q3  *  /MR 
+  NHDB  *  /PHDB  *  QO  *  Q2  *  /MR 


SIMULATION 

TRACE_ON  MR  CLK  PHDB  NHDB  DSOUT  Q3  Q2  Ql  QO  RESET  VERROR  ERROR 
SETF     OE  MR     PHDB     NHDB  ; RESET  CONDITION 
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SETF  /MR  /PHDB  /NHDB 
CLOCKF  CLK 

SETF  /MR  PHDB  /NHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB  NHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB  /NHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB  /NHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB  NHDB 
CLOCKF  CLK 

SETF  /MR  /PHDB  NHDB 
CLOCKF  CLK 

TRACE  OFF 


.•VECTOR  19,   STATE  3,  1011 

.-VECTOR  20,   STATE  5,  0010 
; RESET 

,-VECTOR  21,   STATE  6,  0110 

.-VECTOR  22,   STATE  7,  0100 

;VECTOR  23,   STATE  8,  0101 

,-VECTOR  24,   STATE  9,  0111 

,-VECTOR  25,   STATE  10,  1111 
,- RESET 
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AN-170 


th  the  rapid  increase  in  popularity  of  T1  trunk  links  following 
deregulation  of  AT&T,  the  transparent  nature  of  the  24  data 
must  be  supported  by  handling  of  the  framing  bits, 
must  comply  with  certain  standards,  the  most  common 
which  is  known  as  D4. 


W 
the 

channels 
These 
of 


T1 -Carriers 

More  than  100  million  miles  of  T-carrier  lines  are  installed  in  the 
U.S.  and  Canada.  They  are  full-duplex  digital  communication 
links  with  a  specific  bit  rate  and  format.  T-carrier  lines' 
transmission  quality  and  reliability  exceeds  that  of  analog 
lines.  T-carrier  lines  are  available  in  a  range  of  transmission 
rates  between  1 .544  Mbps  and  274  Mbps,  designated  T1 ,  T1 C, 
T2,  T3,  and  T4. 

The  most  common  standard,  T1  can  be  considered  a  building 
block  for  the  others.  T1  transmits  at  1.544  Mbps.  AT&T 
specifications  define  D1.  D1D,  D2,  D3,  D4  and  ESF  framing 
structures  for  T1  carriers.  These  standards  specify  the 
framing-bit  pattern  and  frame  synchronization.  D1  was 
established  in  1969  and  has  been  superseded  by  the  later 
standards.  D1 D,  D2,  D3,  and  D4  framing  formats  group  twelve 
frames  into  a  superframe.  Today,  the  D4  framing  format  is  the 
most  prevalent.  The  future  format  is  ESF,  also  called  Fe,  which 
groups  twenty-four  frames  into  an  extended  superframe.  It  is 
presently  under  definition,  and  will  be  supported  in  the  future. 


04 


Framing  Format 

The  T1  standard  for  D4  framing  defines  a  superframe  as  a 
group  of  twelve  frames.  A  frame  consists  of  twenty-four  8-bit 
channels  plus  one  framing  bit  for  a  total  of  193  bits.  The  8-bit 
channel  consists  of  one  polarity  bit  and  seven  magnitude  bits; 
however,  in  frames  six  and  twelve,  the  eighth  bit  in  each 
channel  carries  signaling  information.  Since  this  bit  cannot 
transmit  user  data  (either  voice  or  data)  it  is  called  robbed-bit 
signaling.  The  signaling  bit  transmits  requests  for  services  and 
calling  information.  At  the  1.544  Mbps  transmission  rate,  8 
kbps  are  overhead  for  framing;  so  the  user  transmission  rate  is 
1 .536  Mbps.  Because  the  user  transmission  rate  is  divided 
into  twenty-four  channels,  each  channel  transmits  at  64  kbps. 

A  frame  bit  should  occur  every  193rd  bit  in  the  data  stream. 
Frame  detection  occurs  when  a  sequence  of  sampled  frame 
bits  matches  the  frame  pattern.  However,  occasionally  data 
can  produce  the  same  pattern.  False  framing  occurs  when  the 
frame  synchronization  circuitry  mistakes  data  bits  for  the 
framing  bits. 

A  frame-bit  error  occurs  when  the  incoming  frame  bit  does  not 
match  the  expected  framing  bit.  If  two  out  of  four  consecutive 
frame  bits  are  in  error,  then  frame  synchronization  is  "lost." 
Once  frame  synchronization  is  "lost,"  a  search  for  the  frame 
pattern  is  started.  AT&T  T1  standards  specify  the  maximum 
time  to  search  and  re-synchronize  as  50  ms.  This  time  period 
is  called  the  reframe  time. 

For  the  D4  frame  format,  the  framing  bits  are  divided  into  two 


groups:  Ft  and  Fs.  The  terminal  framing  bit  (Ft)  identifies  frame 
boundaries.  The  signaling  frame  bit  (Fs)  identifies  the 
superframe  boundary  and  the  frames  containing  signaling  bits. 
Ft  and  Fs  are  interleaved  to  create  the  12-bit  framing  pattern 
shown  in  Table  1  This  pattern  is  repeated  every  twelve  frames. 


FRAME 

TERMINAL 
FRAMING 
Fl 

SIGNALING 
FRAMING 
Fs 

SUPER- 
FRAMING 
SF 

1 

1 

1 

2 

0 

0 

3 

0 

0 

4 

0 

0 

5 

1 

1 

6 

1 

1 

7 

0 

0 

8 

1 

1 

9 

1 

1 

10 

1 

1 

11 

0 

0 

12 

0 

0 

Table  1.  Framing  Pattern 

Hardware  Implementation 

The  frame  synchronization  logic  was  implemented  in  PAL 
devices  because  current  single-chip  solutions  do  not  offer 
enough  flexibility  in  line  coding.  Using  PAL  devices,  this 
design  has  the  flexibility  to  interface  various  line  coding 
devices. 

The  PAL  devices  implement  the  frame  synchronization  logic, 
which  can  be  divided  into  three  sections.  The  first  section 
detects  the  D4  framing  pattern.  The  second  section  searches 
for  frame  synchronization.  The  third  section  provides  the  su- 
perframe timing.  These  are  implemented  using  four  CMOS  PAL 
devices,  a  shift  register,  and  a  counter  as  shown  in  Figure  1 . 

The  frame  detection  hardware  samples  every  1 93rd  bit  of  the 
data  stream.  The  data  is  sampled  on  the  positive  edge  of 
T1_CK.  Either  a  shift  register  or  a  state  machine  can  be  used 
to  sample  the  framing  bits.  The  data  stream  is  shifted  into  a 
(193x23)-bit  shift  register  with  taps  every  193  bits.  The  taps 
are  fed  into  two  identical  Frame  Pattern  Detection  (FPD)  PAL 
devices.  The  more  taps  sampled,  the  less  likely  a  data  pattern 
will  reproduce  the  framing  pattern.  In  this  design,  twenty-four 
taps  ensure  reasonable  prevention  of  false  framing. 
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Each  FPD  device  monitors  twelve  framing  bits.  If  the  framing 
pattern  is  found,  a  FP  DETB  signal  is  generated.  From  these 
signals,  the  controller  determines  the  correct  framing  pattern. 
The  FPD's  FRM1 B  indicates  the  start  of  a  superframe. 

The  first  FPD's  inputs  are  the  first  eleven  taps  from  the  shift 
register  plus  the  data  input  stream.  The  second  FDP  monitors 
the  remaining  taps  from  the  shift  register  as  shown  in  Figure  1, 

The  control  section  is  a  controller  and  a  counter,  which  search 
for  frame  synchronization.  The  controller,  SYNC  PAL,  uses  the 
193-bit  counter  to  determine  if  the  frame  detection  signals  are 
spaced  every  193  bits.  First  the  controller  searches  for  the 
first  framing  pattern.  During  the  search  operation,  SYNC  PAL 


waits  for  the  FPD's  frame  detection  signals.  When  a  framing 
pattern  is  found,  the  counter  is  reset.  If  193  bits  later,  the  next 
expected  framing  bit  is  found,  the  controller  is  "in  sync."  It 
continues  checking  the  framing  pattern  every  193  bits.  If  the 
framing  pattern  is  not  found,  it  assumes  that  the  incoming 
framing  bit  is  in  error.  If  the  next  four  framing  bits  are  correct, 
the  controller  remains  "in  sync."  If,  during  this  time,  a  second 
frame-bit  error  occurs,  the  framing  is  "lost."  The  controller, 
then,  asserts  the  lost  frame  signal,  NO_FRM.  Once  the  framing 
is  "lost,"  the  controller  begins  searching  for  the  correct  frame 
pattern.  While  the  controller  searches  for  the  correct  frame 
pattern,  the  previous  frame  synchronization  is  maintained  until 
the  controller  re-synchronizes. 


FRM  BIT 


193x23  SHIFT  REGISTER 


12. 


SRD(1-11) 


SRD(O) 


FPD  PAL 


■ FPD1  FPD2 


SYNC  PAL 
PALC20R4Z 


CB(2-0) 


193  COUNTER 


SRD(0-11) 
FPD  PAL 
PALC16L8Z 


NO_FRM 
BYT_CK 
SUNK 


SUPER 
FRAME 
PAL 
PALC16R6Z 


Figure  1.  Frame  Synchronization  Logic 
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Figure  2  shows  the  sync,  state  machine  for  the  controller. 
Signals  FPD1 B  and  FPD2B  (from  the  FPDs)  indicate  a  T1  frame 
pattern  detection,  while  SOF  (from  the  counter)  indicates  193 
counts.  Beginning  in  states  F  and  G,  the  controller  searches 
for  the  frame  pattern.  Once  the  framing  has  been  found,  the 
state  machine  enters  state  A.  At  state  A  SUNK  is  asserted, 
which  indicates  "in  sync."  The  state  machine  remains  in  state  A 
urjtil  a  single  frame-bit  error  occurs.  Upon  the  first  frame-bit 
error,  the  state  machine  enters  state  B.  If  the  next  four  framing 
bits  are  correct,  the  state  machine  sequences  through  states 
C,  D,  and  E,  which  returns  to  state  A.  However,  if  during  this 
time,  a  second  frame  bit  error  occurs,  the  state  machine  enters 
state  F.  Loss  of  T1  carrier  (NO_T1CB)  or  reset  (RSTB)  forces 
the  state  machine  into  the  searching  state. 

SYNC  PAL  performs  two  other  miscellaneous  functions.  When 
a  single  framing-bit  error  is  encountered,  the  device  corrects 
the  bit.  The  corrected  version  of  the  input  data  stream,  BIT,  is 
fed  into  the  shift  register.  The  device  also  generates  the 
BYT  CK.  A  single  pulse  is  asserted  for  every  channel  for 
external  serial-to-parallel  (S/P)  data  conversion.  For  this  imple- 
mentation BYTCK  is  asserted  when  the  counter's  three  least 


significant  bits  are  all  zeros.  Depending  on  the  requirements  of 
the  S/P  converter,  any  combination  of  the  counter's  three  least 
significant  bits  could  generate  BYT_CK  for  the  correct 
alignment  with  the  channel  sample. 

SUPER  FRAME  PAL  generates  the  superframe  timing.  It 
generates  FRM_6  and  FRM_12  by  counting  the  frames.  These 
signals  indicate  the  frames  containing  the  signaling  bits.  The 
CMOS  PAL  device  implements  a  4-bit  counter.  When  the  frame 
is  "in  sync,"  SOF  increments  the  counter,  while  FRM_1  clears 
it.  If  the  controller  is  "out  of  frame  sync,"  the  counter  continues 
counting,  based  on  the  last  known  frame  position. 


Design  Recommendations 

While  this  design  meets  the  D4  specification,  one  design 
improvement  might  be  useful.  A  signal  flagging  single-framing 
bit  errors  could  be  added.  Monitoring  this  signal  indicates  the 
amount  of  random  noise  on  the  line.  This  signal  could  warn  that 
a  line  is  noisy  even  though  frame  synchronization  was  never 
lost. 


FPD1B-SQF  +  FPD2B-SOF 


SOF  ( 


BIT  =  DATA 
LOB  =  SOF 
SUNK  b  1 
NO_FRM  =  0 


FPD1B  *  FPD2B  '  SOF 


FPD1 B  *  FPD2B  *  SOF 


FPD1B*SOF* 
.  FPD2B  •  SOF 


FPD1B "  SOF  + 
FPD2B  •  SOF 


SOF 


FPD1 B  *  SOF  + 
FPD2B  •  SOF 


SOF 


SOF 


BfT  =  DATj  _ 
LDB  =  FPD1B'FPD2B  +  SOF 
SUNK  =  0 
NO_FRM  =  1 


FPD1 B  +  FPD2B 
FPDTB  •  FPD2B  '  SOF 


FPD1 B  1  SOF  + 
FPD2B  ■  SOF 


SUNK  =  1 
NO_FRM  =  t) 


BfT  =  DATA 
LDB  =  SOF 
SUNK  =  1 
NO_FRM  b  0 


NOTE:  "BITA  = FPD1B  *  FPD2B  ■  SOF  *  DATA  + 

SOF*  DATA  +  

FPD1B  *  SOF  *  DATA  + 
FPD2B  *  SOF '  DATA 


Figure  2.  Sync  Slate  Machine 
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TITLE  FRAME_DETECTION_PAL 

PATTERN     Tl  FRAME  DETECTION  PAL  FOR  Tl  INTERFACE 
REVISION  PI. 04 

AUTHOR  STEVE  PATTERSON  AND  THERESA  SHAFER 

COMPANY 

DATE  6/4/87 

This  PAL  device  monitors  12  193rd  bits  in  the  incoming  Tl  NRZ  data  s 
It  detects  any  valid  Frame  Pattern  (start  of  any  Frame)  and  the  start 
;     of  Frame  1. 

CHIP  FPD  PAL16L8 


;  PINS 
;i 

SRDO 
;ii 

SRD9 


2 

SRD1 
12 

FRM1B 


3 

SRD2 
13 

SRD10 


4 

SRD3 
14 

SRD11 


5 

SRD4 

15 

NC 


6  7 
SRD5  SRD6 
16  17 
FRM1  6B  NC 


8 

SRD7 

18 

SOF 


9  10 
SRD8  GND 
19  20 
FP  DETB  VCC 


INPUTS : 


OUTPUTS : 


SRD(ll-O) 
SOF 

FP_DETB 
FRM1B 
FRM1  6B 


FRAMING  BITS 

LAST  KNOWN  START  OF  FRAME 

ACTIVE  LOW  SIGNAL  INDICATING  FRAMING 

PATTERN  DETECTED 
ACTIVE  LOW  SIGNAL  INDICATING  START  OF 

FRAME  1 

ACTIVE  LOW  SIGNAL  INDICATING  FRAME  1 
TO  6  DETECTED 


FRAMING  PATTERNS 


FRM_1 

FRM_2 

FRM_3 

FRM_4 

FRM_5 

FRM_6 

FRM_7 

FRM_8 

FRM_9 

FRM_10 

FRM_11 

FRM  12 


S  S  S 

R  R  R 

D  D  D 

119 

1  0 

1,0,0, 

0,1,0, 

0,0,1, 

1,0,0, 

1,1,0, 

1,1,1, 

0,1,1, 
1,0,1, 
1,1,0, 
0,1,1, 
0,0,1, 
0,0,0, 


SSSSSSSSS 
RRRRRRRRR 
DDDDDDDDD 
876543210 


0,1, 
0,0, 
0,0, 
1,0, 
0,1, 
0,0, 
1,0, 
1,1, 
1,1, 
0,1, 
1,0, 

1,1, 


1,0, 

1,1, 

0,1, 
0,0, 
0,0, 
1,0, 
0,1, 
0,0, 
1,0, 

1,1, 
1,1, 

0,1, 


1,1, 

0,1, 
1,0, 

1,1, 

0,1, 
0,0, 
0,0, 
1,0, 
0,1, 
0,0, 
1,0, 

1,1, 


1,0,0 
1,1,0 

1,1,1 

0,1,1 
1,0,1 
1,1,0 
0,1,1 
0,0,1 
0,0,0 
1,0,0 
0,1,0 
0,0,1 


EQUATIONS 


/FRM1B  =  SOF  *  SRD11  *  /SRD10  *  /SRD9  *  /SRD8  *  SRD7  *  SRD6  *  /SRD5 

*  SRD4  *  SRD3  *  SRD2  *  /SRD1  *  /SRDO 

/FRM1_6B  =  SRD11  *  /SRD10  *  /SRD9  *  /SRD8  *  SRD7  *  SRD6  *  /SRD5     ;  FRM  1 

*  SRD4  *  SRD3  *  SRD2  *  /SRD1  *  /SRDO 


Memories  & 
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/SRDll  *  SRD10  *  /SRD9  *  /SRD8  *  /SRD7  *  SRD6  *  SRD5 

*  /SRD4  *  SRD3  *  SRD2  *  SRD1  *  /SRDO 

/SRDll  *  /SRD10  *  SRD9  *  '/SRD8  *  /SRD7  *  /SRD6  *  SRD5 

*  SRD4  *  /SRD3  *  SRD2  *  SRD1  *  SRDO 

SRDll  *  /SRD10  *  /SRD9  *  SRD8  *  /SRD7  *  /SRD6  *  /SRD5 

*  SRD4  *  SRD3  *  /SRD2  *  SRD1  *  SRDO 

SRDll  *  SRDIO  *  /SRD9  *  /SRD8  *  SRD7  *  /SRD6  *  /SRD5 

*  /SRD4   *  SRD3   *  SRD2   *  /SRD1  *  SRDO 

SRDll   *  SRDIO  *  SRD9  *  /SRD8   *  /SRD7   *  SRD6   *  /SRD5 

*  /SRD4  *  /SRD3  *  SRD2  *  SRD1  *  /SRDO 


*  SRD8  *  /SRD7 

*  /SRD2  *  SRD1 


/FP_DETB  =  /FRM1_6B 

+  /SRDll  *  SRDIO  *  SRD9 

*  /SRD4  *  /SRD3 
+  SRDll  *  /SRDIO  *  SRD9  *  SRD8  *  SRD7  * 

*  SRD4  *  /SRD3  *  /SRD2  *  /SRD1 
+  SRDll  *  SRDIO  *  /SRD9  *  SRD8  *  SRD7  * 

*  /SRD4  *  SRD3  *  /SRD2  *  /SRD1 
+  /SRDll  *  SRDIO  *  SRD9  *  /SRD8  *  SRD7 

*  /SRD4  *  /SRD3  *  SRD2  *  /SRD1 
+  /SRDll  *  /SRDIO  *  SRD9  *  SRD8  *  /SRD7 

*  SRD4  *  /SRD3  *  /SRD2  *  SRD1  * 
+  /SRDll  *  /SRDIO  *  /SRD9  *  SRD8  *  SRD7 

*  SRD4  *  SRD3  *  /SRD2  *  /SRD1  * 


*  /SRD6  *  SRD5 

*  SRDO 

/SRD6  *  /SRD5 

*  SRDO 

SRD6  *  /SRD5 

*  /SRDO 

*  SRD6  *  SRD5 

*  /SRDO 

*  SRD6  *  SRD5 
/SRDO 

*  /SRD6  *  SRD5 
SRDO 


FRM  2 
FRM  3 
FRM  4 
FRM  5 
FRM  6 

FRM  1-6 
FRM  7 

FRM  8 

FRM  9 

FRM  10 

FRM  11 

FRM  12 


SIMULATION 
TRACE_ON 

SOF  SRDll  SRDIO  SRD9  SRD8  SRD7  SRD6  SRD5  SRD4  SRD3  SRD2  SRD1  SRDO 
FP_DETB  FRM1B  FRM1_6B 

SETF  SOF 


SETF 

;   FRAME  1 

SRDll 

/SRDIO  /SRD9 

/SRD8 

SRD7 

SRD6 

/SRD5 

SRD4 

SRD3 

SRD2 

/SRD1  /SRDO 

SETF 

;   FRAME  2 

/SRDll 

SRDIO  /SRD9  /SRD8  /SRD7 

SRD6 

SRD5  /SRD4 

SRD3 

SRD2 

SRD1  /SRDO 

SETF 

;  FRAME  3 

/SRDll 

/SRDIO 

SRD9 

/SRD8 

/SRD7 

/SRD6 

SRD5 

SRD4 

/SRD3 

SRD2 

SRD1  SRDO 

SETF 

;  FRAME  4 

SRDll 

/SRDIO 

/SRD9 

SRD8 

/SRD7 

/SRD6 

/SRD5 

SRD4 

SRD3 

/SRD2 

SRD1  SRDO 

SETF 

;  FRAME  5 

SRDll 

SRDIO 

/SRD9 

/SRD8 

SRD7 

/SRD6 

/SRD5 

/SRD4 

SRD3 

SRD2 

/SRD1  SRDO 

SETF 

;  FRAME  6 

SRDll 

SRDIO 

SRD9 

/SRD8 

/SRD7 

SRD6 

/SRD5 

/SRD4 

/SRD3 

SRD2 

SRD1  /SRDO 

SETF 

;   FRAME  7 

/SRDll 

SRDIO 

SRD9 

SRD8 

/SRD7 

/SRD6 

SRD5 

/SRD4 

/SRD3 

/SRD2 

SRD1  SRDO 

SETF 

;   FRAME  8 

SRDll 

/SRDIO 

SRD9 

SRD8 

SRD7 

/SRD6 

/SRD5 

SRD4 

/SRD3 

/SRD2 

/SRD1  SRDO 

SETF 

;   FRAME  9 

SRDll 

SRDIO 

/SRD9 

SRD8 

SRD7 

SRD6 

/SRD5 

/SRD4 

SRD3 

/SRD2 

/SRD1  /SRDO 

SETF 

;  FRAME  10 

/SRDll 

SRDIO 

SRD9 

/SRD8 

SRD7 

SRD6 

SRD5 

/SRD4 

/SRD3 

SRD2 

/SRD1  /SRDO 
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SETF 


/SRD11 

/SRD10 

SRD9 

SETF 

/SRD11 

/SRD10 

/SRD9 

SETF 

/SRD11 

/SRD10 

/SRD9 

SETF 

/SRD11 

/SRDIO 

SRD9 

SETF 

/SRD11 

/SRD10  /SRD9 

SETF 

/SRD11 

SRDIO 

SRD9 

SETF 

SRD11 

SRDIO 

SRD9 

SETF 

SRD11 

SRDIO 

SRD9 

TRACE  OFF 


SRD8  /SRD7  SRD6 
SRD8     SRD7  /SRD6 

/SRD8  /SRD7  /SRD6 
/SRD8  /SRD7  /SRD6 
/SRD8  /SRD7  /SRD6 
/SRD8  SRD7  /SRD6 
SRD8  /SRD7  SRD6 
SRD8     SRD7  SRD6 


SRD5     SRD4  /SRD3 

SRD5     SRD4  SRD3 

/SRD5  /SRD4  /SRD3 

/SRD5  /SRD4  /SRD3 

/SRD5  /SRD4  /SRD3 

/SRD5  /SRD4  /SRD3 

SRD5     SRD4  SRD3 

SRD5     SRD4  SRD3 


;  FRAME  11 
/SRD2     SRD1  /SRDO 

;  FRAME  12 
/SRD2  /SRD1  SRDO 


;   NOT  FRAMING  PATTERN 
/SRD2  /SRD1  /SRDO 
;   NOT  FRAMING  PATTERN 
/SRD2  /SRD1  /SRDO 
;   NOT  FRAMING  PATTERN 
/SRD2  /SRD1  /SRDO 
;   NOT  FRAMING  PATTERN 
/SRD2  /SRD1  /SRDO 
;   NOT  FRAMING  PATTERN 

SRD2     SRD1  SRDO 
;   NOT  FRAMING  PATTERN 

SRD2     SRD1  SRDO 


&  MonoUiMom Memories  & 
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PALASM  SIMULATION,   V2.22  -  MARKET  RELEASE  (11-19-86) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 


Title         :  FRAME_DETECTION_PAL  Author 
Pattern     :  Tl  FRAME  DETECTION  PAL  FCompany 
Revision  :  PI. 04  Date 


STEVE  PATTERSON  AND  THER 
6/4/87 


FPD 

Page  :  1 

gggggggggg  ggggggggg 
SOF  HHHHHHHHHH  HHHHHHHHH 

SRD11  XHLLHHHLHH  LLLLLLLHH 
SRD10  XLHLLHHHLH  HLLLLLHHH 
SRD9  XLLHLLHHHL  HHLLHLHHH 
SRD8  XLLLHLLHHH  LHHLLLLHH 
SRD7  XHLLLHLLHH  HLHLLLHLH 
SRD6  XHHLLLHLLH  HHLLLLLHH 
SRD5  XLHHLLLHLL  HHHLLLLHH 
SRD4  XHLHHLLLHL  LHHLLLLHH 
SRD3  XHHLHHLLLH  LLHLLLLHH 
SRD2  XHHHLHHLLL  HLLLLLLHH 
SRD1  XLHHHLHHLL  LHLLLLLHH 
SRDO  XLLHHHLHHL  LLHLLLLHH 
FP_DETB  XLLLLLLLLL  LLLHHHHHH 
FRM1B  XLHHHHHHHH  HHHHHHHHH 
FRM1  6B  XLLLLLLHHH  HHHHHHHHH 
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TITLE  SYNC_PAL 

PATTERN  Tl  FRAME  SYNC  PAL  FOR  Tl  INTERFACE 
REVISION  PI. 06 

AUTHOR  STEVE  PATTERSON  AND  THERESA  SHAFER 

COMPANY 

DATE  6/4/87 

;     This  PAL  device  decides  whether  the  Tl  Interface  is  in  Frame  Sync, 
;     provisional  Sync,  or  Out  of  Sync.     It  controls  the  Frame  Sync 
;     process . 


CHIP  SYNC  PAL2  0R4 
;PINS 

;12  3  4  5  6  7  8 

T1_CKB  RSTB  N0_T1CB  DATA  FPD1B  FPD2B  BC2  BC1 
;9  10  11  12 

BCO  SOF  NC  GND 


;13  14  15  16  17  18  19  20 

OEB  NC  NO_FRM  SUNK         LDB  Q0  Ql  Q2 

;21  22  23  24 

BYT  CK  BIT  NC  VCC 


INPUTS : T1_CKB 
RSTB 
NO_TlCB 
FPD(1-2)B 
BC(2-0) 
SOF 
DATA 
OEB 


ACTIVE  LOW  EXTERNAL  Tl  CLOCK 

ACTIVE  LOW  MASTER  RESET 

NO  Tl  SIGNAL  ACTIVE  LOW  INPUT 

ACTIVE  LOW  FRAME  PATTERN  DETECT  INPUTS 

193  COUNTER  3  LSBs  INPUTS  USED  FOR  BYT_CK 

LAST  KNOWN  START  OF  FRAME 

INPUT  DATA  STREAM 

ACTIVE  LOW  OUTPUT  ENABLE  INPUT 


; OUTPUTS: BIT 
LDB 
SUNK 
NO_FRM 
Q(2-0) 
BYT  CK 


NEXT  Tl  BIT  STREAM 

ACTIVE  LOW  LOAD  COUNTER  SIGNAL 

IN  SYNC  STATE  AND  NO  FRAME  BIT  ERROR 

FRAME  LOST,   IN  SEARCH  STATE 

STATE  VARIABLES 

8 -BIT  CHANNEL  SAMPLE  CLOCK 


EQUATIONS 


; STATE  MACHINE  OUTPUTS  AND  STATE  VARIABLES 


/BIT         =  DATA  *  /Q2  *  /Ql  *  Q0  *  FPD1B  *  SOF 
+  DATA  *  /Q2  *  /Ql  *  Q0  *  FPD2B  *  SOF 
+  /DATA  *  /SOF 
+  /DATA  *  /FPD1B  *  /FPD2B 
+  /DATA  *  Q2 
+  /DATA  *  Ql 
+  /DATA  *  /Q0 

/LDB         :=  Q2  *  Ql  *  Q0  *  /FPD1B  *  /FPD2B  +  SOF 

/SUNK       =  /Q2  *  /Ql  *  /Q0  +  Ql  *  Q0  ;   IN  FRAME  SYNC 


£1  MonolHhic  FJfJ  Memories  £1 
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/NO_FRM  =  /Q2  +  /Ql  +  /QO 


SEARCHING  FOR  FRAME  SYNC 


USING  THE  FOLLOWING  STATE  ASSIGNMENTS 


A  =  001 
B  =  010 
C  =  110 
D  =  100 
E  =  101 
F  =  111 
G  =  Oil 
H  =  000 


IN  SYNC  STATE 

SINGLE  FRAME  BIT  ERROR  STATE 


SEARCHING  STATE 
SEARCHING  STATE 
UNUSED  STATE  -  GOES  TO 


/Q2  :=  /FPD2B  *  /FPD1B  *  Ql  *  QO  *  RSTB  *  N0_T1CB 

+  /Q2   *  Ql  *  /SOF  *  RSTB  *  NO_TlCB 
+  /Q2  *  /Ql  *  QO  *  RSTB  *  N0_T1CB 

+  /FPD2B  *  /FPD1B  *  QO  *  SOF  *  RSTB  *  N0_T1CB 

/Ql  :=  /Ql  *  QO  *  /SOF  *  RSTB  *  N0_T1CB 

+  Q2  *  /Ql  *  /SOF  *  RSTB  *  N0_T1CB 

+  /FPD2B  *  /FPD1B  *  Q2  *  /QO  *  SOF  *  RSTB  *  N0_T1CB 

+  /FPD2B  *  /FPD1B  *  Q2  *  /Ql  *  RSTB  *  N0_T1CB 

+  /FPD2B  *  /FPD1B  *  /Q2  *  QO  *  SOF  *  RSTB  *  N0_T1CB 

/QO  :=  Ql  *  /QO  *  /SOF  *  RSTB  *  N0_T1CB 

+  Q2   *  /QO   *  /SOF  *  RSTB  *  N0_T1CB 
+  FPD1B  *  /Q2  *  /Ql  *  qo  *  SOF     *  RSTB  *  N0_T1CB 
+  FPD2B  *  /Q2  *  /Ql  *  QO  *  SOF     *  RSTB  *  N0_T1CB 
+  /FPD2B  *  /FPD1B  *  Ql  *  /QO  *  RSTB  *  NO_TlCB 

;  BYTE  CLOCK 


/BYT_CK  =  BC2  +  BC1  +  BCO 


SIMULATION 

TRACE_ON  T1_CKB  RSTB  N0_T1CB  DATA  FPD1B  FPD2B  SOF 
Q2  Ql  QO  BIT  LDB  SUNK  NO_FRM 
BYT  CK  BC2  BC1  BCO 


SETF  /OEB 

/RSTB  /N0_T1CB 
CLOCKF  T1_CKB 

SETF  RSTB  N0_T1CB 
DATA 

/BC2  /BC1  /BCO 
FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 

SETF  FPD1B  /FPD2B  SOF 
CLOCKF  Tl  CKB 


;  ENABLE  OUTPUT 
;  RESET  REGISTERS 


;  RESET  REGISTERS 

;  STATE  MACHINE  -  GO  TO  STATE  F 


;  LOOPS  AT  STATE  F 
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SETF  /FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 

SETF  FPD1B  /FPD2B  /SOF 
CLOCKF  T1_CKB 

SETF  /FPD1B  FPD2B  /SOF 
CLOCKF  T1_CKB 

SETF  FPD1B  FPD2B  /SOF 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 

SETF  FPD1B  FPD2B  SOF 

/BC2  /BC1  BCO 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B 
/BC2  BC1  /BCO 
CLOCKF  T1_CKB 

SETF  /SOF 

/BC2  BC1  BCO 
CLOCKF  T1_CKB 
SETF  FPD1B  /FPD2B  SOF 

BC2  /BC1  /BCO 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  /SOF 

BC2  /BC1  BCO 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 

BC2  BC1  /BCO 
CLOCKF  T1_CKB 

SETF  /SOF 

BC2  BC1  BCO 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 

/BC2  /BC1  /BCO 
CLOCKF  T1_CKB 

SETF  /FPD1B  FPD2B  SOF 

CLOCKF  T1_CKB 

SETF  FPD1B  FPD2B  /SOF 

/BC2  /BC1  BCO 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 

SETF  /SOF 
CLOCKF  Tl  CKB 


;  LOOPS  AT  STATE  F 

;  LOOPS  AT  STATE  F 

;   LOOPS  AT  STATE  F 

;  LOOPS  AT  STATE  F 
;  GO  TO  STATE  G 

;  GO  TO  STATE  F 

;  GO  TO  STATE  G 

;  LOOP  AT  STATE  G 
;  GO  TO  STATE  F 

;  GO  TO  STATE  G 
;  GO  TO  STATE  A 

;  LOOP  AT  STATE  A 

;  LOOP  AT  STATE  A 

;  GO  TO  STATE  B 

;  LOOP  AT  STATE  B 
;  GO  TO  STATE  C 

;   LOOP  AT  STATE  C 


£1 


2-413 


CLOCKF  Tl  CKB 


;  GO  TO  STATE  D 


SETF  /SOF 

CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  SOF 

CLOCKF  T1_CKB 

SETF  /SOF 

CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  SOF 

CLOCKF  T1_CKB 

SETF  FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  /SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
FOR  l:=l  TO  1  DO  BEGIN 
CLOCKF  T1_CKB 

END 

SETF  FPD1B  SOF 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  /SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
FOR  I:=l  TO  2  DO  BEGIN 
CLOCKF  T1_CKB 

END 

SETF  FPD1B  /SOF 
CLOCKF  T1_CKB 
SETF  FPD1B  SOF 
CLOCKF  T1_CKB 

SETF  /FPD1B  /FPD2B  /SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
FOR  I:=l  TO  3  DO  BEGIN 
CLOCKF  Tl  CKB 


;  LOOP  AT  STATE  D 
;  GO  TO  STATE  E 

;  LOOP  AT  STATE  E 
;  GO  TO  STATE  A 

;  GO  TO  STATE  B 
;  GO  TO  STATE  F 

;  GO  TO  STATE  G 
;  GO  TO  STATE  A 
;  GO  TO  STATE  B 

;   GO  TO  STATE  C 

;  GO  TO  STATE  F 

;  GO  TO  STATE  G 
;  GO  TO  STATE  A 
;  GO  TO  STATE  B 

;  GO  TO  STATES  C,D 

;  LOOP  AT  STATE  D 
;  GO  TO  STATE  F 

;  GO  TO  STATE  G 
;  GO  TO  STATE  A 
;  GO  TO  STATE  B 

;  GO  TO  STATES  C,D, 
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END 

SETF  FPD1B  SOF 
CLOCKF  Tl  CKB 


SETF  /DATA 

SETF  /FPD1B  /FPD2B  /SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  FPD2B  SOF 
CLOCKF  T1_CKB 
SETF  /FPD1B  /FPD2B  SOF 
FOR  l:=l  TO  4  DO  BEGIN 
CLOCKF  T1_CKB 

END 

SETF  FPD1B  SOF 
CLOCKF  T1_CKB 
TRACE  OFF 


GO  TO  STATE  F 
INVERT  DATA 
GO  TO  STATE  G 
GO  TO  STATE  A 
GO  TO  STATE  B 


;  GO  TO  STATES  C,D,E,A 


;  GO  TO  STATE  B 
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PttLASM  SIMULATION,    V2.22  -  MARKET  RELEASE    <1 1-19-86) 
(C)    -  CQF'YR  I GHT  MONOLITHIC  MEMORIES   INC.  1986 
PAL ASM  SIMULATION  SELECTIVE   TRACE  LISTING 


Title 
Pattern 
ISfevi  si  on 


SYNC 

Page   :  1 


SYNC   PAL  Author 
tl   FRAME  SYNC  PAL.  FOR  Ti  Company 
F 1.06  Date 


STEVE  PATTERSON  AND  THER 


6/4/87 


q     cq     eg  cq  cq     eg  eg  eg     eg     eg     eg  e 


rrrrnTTTTT-mTTTrn-TTTlTT-rTTTTTTTTiTT-Ti' 

riJTrun;i 


X  X  X  I 
J  X  X  X 
X  X  X  I 

x  x  x  x 


XX 
XX 
XX 


xxxx 
xxxxxx 
xx- 
xx 


■xxxx 

I  I 
1  I  1 

.xxxx 

I  I 

■  I  I 

•XXX* 

1  I 

■  1  I 

■xxxx 

I  I  I  I  I  I  I  I  I  I  I  I  I . 


urn 
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SYNC 

Paae  :  2 


q     eg     eg     eg     eg     cq     eg     eg     eg     eg  e 


T  1  ...  CKB 

RSTB 

N0...T1CB 

DATA 

FPD1B 

FPD2B 

SDF 

Q2 

Dl 

DO 

BIT 

LDB 

SUNK 

NO  _ FRM 

BYT  _  CK 

BC2 

BC1 

BCO 


1 1  U  1 1 1 1 1 1 1  TTTTTT^rTTrrrTTrrTTTn-rrrrm 


u 


u: 


I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I 
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SYNC 
Pace  : 


T 1  CKB 

RBTB 

NO.TtCB 

DATA 

FPD  1  B 

FPD2B 

SOF 

02 

Bl 

QO 

BIT 

LDB 

SUNK 

N0_FRM 

BYT  _  CK 

BC2 

BC1 

BCO 


g     eg     eg     eg     eg     c     eg     eg     eg     eg  eg 

M  M  n  m  1 1 1 1 1 1 1 1 1 1 1 1  n 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 


mm 

n 

m 

m 

m 

4  1- 

i — ^ 

4  'r- 

4  l. 

'.*  ■  i 

+  -r 

!  ! 

- !  ! 
..;  4. 

1  ! 
4  4 

-i — i- 

-1  V 

T  V 

•  ' 
"i  1 

-1  

T" 

-:  i 

1  1 

4  i- 

i 

! 

|  ! 

1  1 

;  ; 

1- 

'•  ■ 

I  ! 

: 

1  I 
)  I 

'<  ! 

i 

-J  : 

!  - 

-:  : 

1  1 

>  ■ 

-i  

1  1 

i  ; 

i  ; 

1  1 

\  1 

;  | 

:  ft 

->— 

- !  ! 
- ,  i 

' 

;  i ; 

I : 

:  r! 

;  ; 

!  i 

!  i 

;  ii 

;  ii 

!  Ii 

;  | 
-»  !■ 

;  l 

i 

j! 

• 

■4  1" 

■ 

4  r 

i  n 

-J  i 

;  w 

'  ' 

;  r; 

!  i; 

u 

'<  ' 

1 — 1  : 

i  ; 
j; 

:  i: 

I   I   I    I   I   I   I   I   I   I   I   I   I   I   I   I   I    I   I   I    I   I   I   I   I    I   I   I   I    L.I    I   I   I   I    I    I    I    I  I. 
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SYNC 

Page 


cq     eg     eg     eg     c     eg     cq     cq     eg  eg 


T1..CKB 

RBTB 

N0_ T1CB 

DATA 

FPD 1  B 

FPD2B 

SOF 

Q2 

01 

00 

BIT 

LDB 

SUNK 

NO  frm 

BYT..CK 
BC2 
BC1 
BCO 


:  i 


U 


h 


rr 


MU!il|li||ii||ii|ii.Mii||H|lii| 


 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1  1 1 1 1 1 1 1 1 
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SYNC 
Page 


eg 


eg     egg     eg     cq  eg 


T1_CKB 

RSTB 

NO  T  1  CE< 

DATA 

FPD1B 

FPD2B 

SDF 

Q2 

Ql 

QO 

BIT 

LDB 

SUNK 

NO.FRM 

BYT_CK 

BC2 

BC1 

BCO 


I  I  Ml  I  I  I  I  I  I  I  I  I  I  I 

runrtniruir 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


14 


n 


1 1 1 1 1 1 1 1 1 1 1 1 1 1  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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SYNC 

Page  :  6 


eg 


T1_CKB 

RSTB 

ND..T1CB 

DATA 

FPD1B 

FPD2B 

SOF 

Q2 

SI 

QO 

BIT 

LDB 

SUNK 

NQ_FRM 

BYT_CK 

BC2 
BC1 
BCO 

H 


71 


h 


ST. 

u 


D4  Frame  Synchronization 


PALASM  SIMULATION,  V2.22  -  MARKET  RELEASE  (11-19-86) 
(C)   -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 


Title        :  SYNC_PAL  Author 
Pattern     :  Tl  FRAME  SYNC  PAL  FOR  TICompany 
Revision  :  PI. 06  Date 


STEVE  PATTERSON  AND  THER 


6/4/87 


SYNC 
Page  : 

T1_CKB 
RSTB 
N0_T1CB 
DATA 
FPD1B 
FPD2B 
SOF 
Q2 
Ql 
Q0 
BIT 
LDB 
UNK 
0_FRM 
YT_CK 

2 
CI 
CO 


g    eg  eg 

XHHLLHHLLH 

LLLLHHHHHH 

LLLLHHHHHH 

XXXXHHHHHH 

XXXXHHHHHH 

XXXXHHHHLL 

XXXXHHHHHH 

XXHHHHHHHH 

XXHHHHHHHH 

XXHHHHHHHH 

XXXXHHHHHH 

XXXXXXLLLL 

XXLLLLLLLL 

XXHHHHHHHH 

XXXXHHHHHH 

XXXXLLLLLL 

XXXXLLLLLL 

XXXXLLLLLL 


eg  eg  eg 

LLHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HLLLHHHHLL 

LHHHLLLLHH 

HHHHLLLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

LLLLLLHHHH 

HHHHHHHHHH 
HHHHHHHHHH 


LLLLLLLLLL 
LLLLLLLLLL 


eg  eg  eg 

LLHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

LHHHLLLLHH 

HHHHLLLLHH 

LLLLHHHHHH 

HHHHHHLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHLLLL 

LLLLLLLLLL 

HHHHHHHHLL 
LLLLLLLLLL 


eg  eg  c 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHLLLLLLLL 
HHLLLLLLLL 
HHHHHHLLLL 
HHHHLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLHH 

HHHHLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLHHHHHHHH 
HHLLLLHHHH 


g    eg  eg 

LHHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHLLLLLL 

LLLLLLLLLL 

HHHHLLLLHH 

LLHHHHLLLL 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

HHLLLLLLLL 

LLLLLLLLLL 

LLHHHHLLLL 

LLLLLLLLLL 

HHHHHHHHHH 

LLLLLLLLHH 

LLLLHHHHLL 


eg  eg  c 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
HHLLLLHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLHHHHLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLHHHH 
HHHHHHLLLL 
HHHHHHLLLL 
LLHHHHLLLL 


g    eg  eg 

LHHLLHHLLH 

HHHHHHHHHH 

HHHHHHHHHH 

HHHHHHHHHH 

LLLLHHHHLL 

HHHHHHHHLL 

HHHHLLLLHH 

LLLLLLLLLL 

LLHHHHHHHH 

HHLLLLLLLL 

LLHHHHHHHH 

LLLLLLHHHH 

HHHHHHHHHH 

LLLLLLLLLL 

HHHHLLLLLL 

LLLLLLLLLL 

LLLLLLLLLL 

LLLLHHHHHH 


eg  eg  c 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
HHLLLLHHHH 
HHHHHHHHHH 
HHHHHHHHLL 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLHHHHLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
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SYNC 

Page  :  3 


g    eg    eg      eg    eg    c      eg    eg      eg    eg  eg 

T1_CKB  LHHLLHHLLH  HLLHHLLHHL  HHLLHHLLHH  LLHHLLHHLI 

RSTB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

N0_T1CB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

DATA  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

FPD1B  LLLLLLLLLL  LLLLLLHHHH  HHHLLLLLLL  LLLLLLLLLH 

FPD2B  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LHHHHLLLLL 

SOF  LLLLHHHHLL  LLHHHHHHHH  HHHLLLLHHH  HHHHHHHHHH 

Q2  HHHHHHHHHH  HHHHLLLLLL  LHHHHLLLLL  LLLLLLLHHH 

Ql  LLLLLLLLLL  LLLLLLLLHH  HHHHHHHHHL  LLLHHHHHHH 

QO  LLLLLLHHHH  HHHHHHHHLL  LHHHHHHHHH  HHHLLLLLLL 

BIT  HHHHHHHHHH  HHHHHHLLHH  HHHHHHHHHH  HLLHHHHHHH 

LDB  LLHHHHLLLL  HHHHLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

SUNK  HHHHHHHHHH  HHHHHHHHHH  HLLT,T,T,LLLH  HHHHHHHHHH 

NO_FRM  LLLLLLLLLL  LLLLLLLLLL  LHHHHLLLLL  LLLLLLLLLL 

BYT_CK  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC2  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC1  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BCO  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 


SYNC 

Page  :  4 


eg    eg      eg    eg    c      eg    eg    c  g    eg  eg 

T1_CKB  HHLLHHLLHH  LLHHLLHHLH  HLLHHLLHHL  LHHLLHHLLH 

RSTB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

NOJTICB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

DATA  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

FPD1B  HHHLLLLLLL  LLLLLLLLLL  LLHHHHHHHH  LLLLLLLLLL 

FPD2B  LLLLLLLLLL  LHHHHLLLLL  LLLLLLLLLL  LLLLLLLLHH 

SOF  HHHLLLLHHH  HHHHHHHHHH  HHLLLLHHHH  LLLLHHHHHH 

Q2  HHHHHLLLLL  LLLLLLLHHH  HHHHHHHHHH  HHLLLLLLLL 

Ql  HHHHHHHHHL  LLLHHHHHHH  LLLLLLLLHH  HHHHHHLLLL 

QO  LHHHHHHHHH  HHHLLLLLLL  LLLLLLLLHH  HHHHHHHHHH 

BIT  HHHHHHHHHH  HLLHHHHHHH  HHHHHHHHHH  HHHHHHHHLL 

LDB  LLLLLLLLLL  LLLLLLLLLL  LLLLHHHHLL  LLLLLLLLLL 

SUNK  HLLLLLLLLH  HHHHHHHHHH  HHHHHHHHLL  LLLLLLHHHH 

NO_FRM  LHHHHLLLLL  LLLLLLLLLL  LLLLLLLLHH  HHLLLLLLLL 

BYT_CK  TiTiTiTiLTiTiTiTiL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC2  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC1  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 
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SYNC 

Page  :  5 


eg    c    c      eg    egg      eg    eg    eg      c    c  c 

T1_CKB  HLLHHLHHLH  HLLHHLLLHH  LLHHLLHHLL  HHLHHLHHLH 

RSTB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

N0_T1CB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 

DATA  HHHHHHHHHH  HHHHHHLLLL  LLLLLLLLLL  LLLLLLLLLL 

FPD1B  LLLLLLLLLL  LLHHHHHLLL  LLLLLLLLLL  LLLLLLLLLL 

FPD2B  HHLLLLLLLL  LLLLLLLLLL  LLLLLHHHHL  LLLLLLLLLL 

SOF  HHHHHHHHHH  HHHHHHHLLL  LHHHHHHHHH  HHHHHHHHHH 

Q2  LLLLHHHHHH  HHHHHHHHHL  LLLLLLLLLL  LHHHHHHHHH 

Ql  HHHHHHHLLL  LLLLHHHHHH  HHHLLLLHHH  HHHHLLLLLL 

QO  LLLLLLLLLL  HHHHHHHHHH  HHHHHHHLLL  LLLLLLLHHH 

BIT  HHHHHHHHHH  HHHHHHLLLL  LLLLLHHLLL  LLLLLLLLLL 

LDB  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

SUNK  HHHHHHHHHH  HHHHLLLLLL  LLLHHHHHHH  HHHHHHHHHH 

NO_FRM  LLLLLLLLLL  LLLLHHHHHL  LLLLLLLLLL  LLLLLLLLLL 

BYT_CK  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC2  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BC1  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL 

BCO  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH 


SYNC 

Page  :  6 


eg  c 

Tl  CKB 

HLLHHL 

RSTB 

HHHHHH 

NO  T1CB 

HHHHHH 

DATA 

LLLLLL 

FPD1B 

LLHHHH 

FPD2B 

LLLLLL 

SOF 

HHHHHH 

Q2 

LLLLLL 

Ql 

LLLLHH 

QO 

HHHHLL 

BIT 

LLHHLL 

LDB 

LLLLLL 

SUNK 

HHHHHH 

NO  FRM 

T.T.T.T.T.T. 

BYT  CK 

T1T1T1T1T1T1 

BC2 

T.T.T.T.T.T. 

BC1 

LLLLLL 

BCO 

HHHHHH 
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TITLE  SUPER_FRAME_PAL 

PATTERN  SUPER  FRAME  PAL  FOR  Tl  INTERFACE 
REVISION  PI. 03 

AUTHOR  STEVE  PATTERSON  AND  THERESA  SHAFER 

COMPANY 

DATE  6/4/87 

This  PAL  device  counts  the  Tl  Frames  and  controls  the  Signal  Bits 
extraction  process,  including  Fly  Wheeling.     It  also  provides  variou 
other  signals  which  indicate  the  frames  with  signal  bits. 
The  counter  is  reset  with  either  RSTB  or  when  frame  detection  is  SUNK 
and  frame  1  occurs  from  two  different  sources  (FRM1  &  SOF) . 


CHIP  SUPER  FRAME  PAL16R6 


;  PINS 
;l 

Tl  CKB 


RSTB 


FRM1B 


SUNK 


SOF 


NC 


NC 


NC 


NC 


10 
GND 


;ii 

OEB 


12 
NC 


13 
Q3 


14 
Q2 


15 
Ql 


16 

Q0 


17 

FRM  6 


18 

FRM  12 


19 
NC 


20 
VCC 


INPUTS : T1_CKB 
RSTB 
SOF 

FRM1B 

SUNK 
OEB 

OUTPUTS:Q(3-0) 
FRM_6 
FRM  12 


ACTIVE  LOW  EXTERNAL  Tl  CLOCK 

ACTIVE  LOW  MASTER  RESET 

ACTIVE  HIGH  INPUT  SIGNAL  INDICATING 

LAST  KNOWN  START  OF  FRAME 
ACTIVE  LOW  INPUT  SIGNAL  INDICATING 

START  OF  FRAME  1 
ACTIVE  HIGH  INPUT  SIGNAL  INDICATING  "IN  FRAME  SYNC" 
ACTIVE  LOW  OUTPUT  ENABLE  INPUT 

STATE  VARIABLES 

CLOCK  SIGNAL  WHICH  INDICATES  SIGNAL  BIT  A 
CLOCK  SIGNAL  WHICH  INDICATES  SIGNAL  BIT  B 


EQUATIONS 


/Q3    :=  /Q2   *  Ql  *  Q0 
+  /Q3   *  /Q2 
+  /Q3   *  /Ql 
+  /Q3   *  /Q0 
+  /FRM1B  *  SOF  *  SUNK 
+  /RSTB 


;  RESET  WHEN  SUNK  AND  FRAME  1  OCCURS 
MASTER  RESET 


/Q2   :=  Q2  *  Ql  *  Q0 
+  /Q2  *  Q3 
+  /Q2  *  /Ql 
+  /Q2  *  /Q0 
+  /FRM1B  *  SOF  *  SUNK 
+  /RSTB 
+  /RSTB 


RESET  WHEN  SUNK  AND  FRAME  1  OCCURS 
MASTER  RESET 

MASTER  RESET 


2-425 


D4  Frame  Synchronization 


/Ql   :=  Ql  *  QO 

+  /Ql  *  /QO 
+  /FRM1B  *  SOF  *  SUNK 


/QO 


:=  QO 

+  /FRM1B  * 
+  /RSTB 


SOF  *  SUNK 


;  RESET  WHEN  SUNK  AND  FRAME  1  OCCURS 


;  RESET  WHEN  SUNK  AND  FRAME  1  OCCURS 
;  MASTER  RESET 


SIGNAL  BIT  EXTRACTION  OUTPUTS 
/FRM_6     :=  Q3  +  /Q2  +  Ql  +  QO 
/FRM_12   :=  /Q3  +     Q2  +  /Ql  +  QO 
SIMULATION 

TRACE_ON  T1_CKB  RSTB  FRM1B  SOF  SUNK 
Q3  Q2  Ql  QO  FRM_6  FRM_12 

SETF  /OEB 

/RSTB 
CLOCKF  T1_CKB 
SETF  RSTB  /SOF  FRM1B  SUNK 
CLOCKF  T1_CKB 

FOR  l:=l  TO  24  DO 
BEGIN 

CLOCKF  T1_CKB 
END 

SETF  /SUNK  SOF  /FRM1B 

CLOCKF  T1_CKB 

SETF  /SUNK  /SOF  /FRM1B 

CLOCKF  T1_CKB 

SETF  /SUNK  SOF  FRM1B 

CLOCKF  T1_CKB 

SETF  /SUNK  /SOF  FRM1B 

CLOCKF  T1_CKB 

SETF  SUNK  SOF  /FRM1B 

CLOCKF  T1_CKB 

SETF  SUNK  /SOF  /FRM1B 

CLOCKF  T1_CKB 

SETF  SUNK  /SOF  FRM1B 

CLOCKF  T1_CKB 

SETF  SUNK  SOF  FRM1B 

CLOCKF  T1_CKB 

TRACE  OFF 


ENABLE  OUTPUT 
RESET  REGISTERS 


;  COUNT 


;  RESET  CONDITIONS 
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PAL ASM  SIMULATION.    V2.22  -  MARKET  RELEASE    (1 1-19-86} 
(C)    -   COPYRIGHT  MONOLITHIC  MEMORIES   INC.  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 

Title  :    SUPER_FRAME_PAL  Author        :    STEVE  PATTERSON  AND  THEF 

Pattern      :    SUPER  FRAME  PAL  FOR  Tl    ICompany  : 
Revision    :    PI. 03  Date  :  6/4/37 


SUPER  FRAME 

Page  :  l 


gcgccccccc 


fTTJ'fTT]  !  I  |  I  I  |  I  T  |  I  I  |  I  I  |  I  \  JTFfl  I  |  I  I  |  I  I  |TT 

x^aruirTr-uiririnTTn-ri 


1XXXXX- 

I  i  I  1  I  I 


i  M 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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SUPER  FRAME 
Page   :  2 


T 1  CKB 

RSTB 

FRM  1  B 

SOF 

SUNK 

Q3 

02 

Qfi 
00 

FRM_6 
FRM  12 


jti  1 1  i  [  1 1  1 1 1 1 1 1 1 1  rnTfTTTTTTrrTTTjrrfTTj 

:^rLnJTr-TrTririj--TnrTriJ-ii 


-L_LX-LJ_I    I    I   I    I  i_l   I   I    I   I   i   I  I   I    I   I   I   I   I  i  I   1   I   I   I   I  II 


SUPER...  FRAME 
Page  :  3 

g     eg     eg     eg     eg     eg     eg     eg  c 


T 1 _CKB 

RSTB 

FRM  IB 

SOF 

SUNK 

Q3 

Q2 

at 

QO 

FRM_6 
FRM..  1  2 

H 


n||ii||ii|i  ii||ii||M||MJ|n7 

:riu"U"l4J^rurururT 


71 


1— U 


JT 
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PALASM  SIMULATION,  V2.22  -  MARKET  RELEASE  (11-19-86) 
(C)    -  COPYRIGHT  MONOLITHIC  MEMORIES  INC,  1986 
PALASM  SIMULATION  SELECTIVE  TRACE  LISTING 

Title         :   SUPER_FRAME_PAL  Author       :   STEVE  PATTERSON  AND  THER 

Pattern     :  SUPER  FRAME  PAL  FOR  Tl  ICompany  : 
Revision  :  PI. 03  Date  :  6/4/87 


SUPER_FRAME 

Page  :  1 

g  eg  c 
T1_CKB  XHHLLHHLHH 
RSTB  LLLLHHHHHH 
FRM1B  XXXXHHHHHH 
SOF  XXXXLLLLLL 
SUNK  XXXXHHHHHH 
Q3  XXLLLLLLLL 
Q2  XXLLLLLLLL 
Ql  XXLLLLLLLH 
QO  XXLLLLHHHL 
FRM_6  XXXXXXLLLL 
FRM  12  XXXXXXLLLL 


c  c  c  c 
LHHLHHLHHL 
HHHHHHHHHH 
HHHHHHHHHH 


HHHHHHHHHH 
LLLLLLLLLL 
LLLLLHHHHH 
HHHHHLLLLL 
LLHHHLLLHH 
LLLLLLLLHH 
LLLLLLLLLL 


c  c  c 
HHLHHLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLHHH 
HHHHHHHLLL 
LHHHHHHLLL 
HLLLHHHLLL 
HLLLLLLLLL 
LLLLLLLLLL 


c  c  c 
HLHHLHHLHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHL 
LLLLLLLLLL 
LLLHHHHHHL 
HHHLLLHHHL 
LLLLLLLLLL 
LLLLLLHHHL 


SUPER_FRAME 

Page  :  2 

c  c  c  c 
T1_CKB  LHHLHHLHHL 
RSTB  HHHHHHHHHH 
FRM1B  HHHHHHHHHH 
SOF  LLLLLLLLLL 
SUNK  HHHHHHHHHH 
Q3  LLLLLLLLLL 
Q2  LLLLLLLLLL 
Ql  LLLLLHHHHH 
QO  LLHHHLLLHH 
FRM_6  LLLLLLLLLL 
FRM 


c  c  c 
HHLHHLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LHHHHHHHHH 
HLLLLLLHHH 
HLLLHHHLLL 
LLLLHHHLLL 
LLLLLLLLLL 


c  c  c 
HLHHLHHLHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLHHHHHHH 
HHHLLLLLLL 
HHHLLLLLLH 
HHHLLLHHHL 

LLLLLLLLLL 


c  c  c  c 
LHHLHHLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHLLLLL 
LLLLLLLLLL 
HHHHHLLLLL 
LLHHHLLLHH 
LLLLLLLLLL 
LLHHHLLLLL 
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SUPER_FRAME 

Page  :  3 

g  eg  eg  eg  eg  eg  eg  eg  c 
T1_CKB  LHHLLHHLLH  HLLHHLLHHL  LHHLLHHLLH  HL 
RSTB  HHHHHHHHHH  HHHHHHHHHH  HHHHHHHHHH  HH 
FRM1B  LLLLLLLLHH  HHHHHHLLLL  LLLLHHHHHH  HH 
SOF  HHHHLLLLHH  HHLLLLHHHH  LLLLLLLLHH  HH 
SUNK  LLLLLLLLLL  LLLLLLHHHH  HHHHHHHHHH  HH 
Q3  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LL 

Q2  LLLLLLLLLL  HHHHHHHHLL  LLLLLLLLLL  LL 

Ql  LLHHHHHHHH  LLLLLLLLLL  LLLLLLHHHH  HH 

QO  HHLLLLHHHH  LLLLHHHHLL  LLHHHHLLLL  HH 

FRM_6  LLLLLLLLLL  LLLLHHHHLL  LLLLLLLLLL  LL 
FRM_12  LLLLLLLLLL  LLLLLLLLLL  LLLLLLLLLL  LL 


T1  Extended  Superframe  Provides 
Transmission  Error  Detection 


Principles  of  CRC-6 

In  digital  transmission,  bits  are  grouped  into  frames.  North 
America's  standard,  T-carrier  uses  a  framing  bit  to  group  data 
into  frames.  lnT1  extended-superframe  (ESF  or  Fe),  the  frame- 
bit  position  carries  additional  information  besides  framing.  One 
such  extended  functionality  provides  transmission  line  error 
detection.  In  transmission  channels,  most  errors  are  bursty  in 
nature;  the  cyclic-redundancy-checking  (CRC)  codes  provide 
sufficient  detection  for  these  types  of  errors.  With  CRC,  the 
block  length  and  the  polynomial  determine  the  type  and  percent- 
age of  error  which  can  be  detected.  For  ESF,  the  block  length  is 
one  extended-superframe  or  4,632  bits.  The  T1  Fe  standard 
specifies  the  CRC-6  polynomial.  The  CRC-6  =  X6  +  X  +  1 
polynomial  detects  98.4%  of  one  or  more  errors. 


output  as  a  parallel  word.  The  CHECK_BIT  signal  enables  the 
remainder  to  be  shifted  out  serially;  it  must  be  held  HIGH  until 
the  entire  6-bit  remainder  has  been  shifted  out.  When  CHECK_ 
BIT  =  0,  CRC6  echoes  the  serial  data  input  stream.  CB1 -CB6  are 
the  parallel  remainder  bits  where  CB1  is  the  most  significant  bit 
and  CB6  is  the  least  significant  bit. 

There  are  two  implementations  for  error  detection  at  the 
receiver.  First,  the  transmitted  CRC-6  value  is  fed  serially  along 
with  the  data.  If  a  non-zero  remainder  is  calculated,  an  error  has 
occurred.  In  this  case,  the  ERROR  flag  indicates  at  least  one 
transmission  error.  Second,  the  CRC-6  for  the  extended- 
superframe  can  be  recalculated  and  externally  compared  with 
the  transmitted  CRC-6  value. 


Functional  Description 

Figure  1  shows  a  simple  configuration  of  XOR-gates  and 
registers  which  calculates  the  CRC-6  polynomial. 


Figure  1.  CRC-6  Block  Diagram 


The  low-power  PALC16R6Z  implements  the  CRC-6  polynomial 
for  a  serial  data  stream  (see  Figure  1 ).  At  the  beginning  of  CRC 
generation,  the  remainder  is  preset  to  all  zeros  by  enabling  the 
synchronous  CLR.  After  shifting  in  the  4,632-bit  superframe,  the 
result  is  the  CRC-6  remainder.  This  remainder  is  transmitted  in 
the  check-bit  positions  of  the  subsequent  superframe.  The 
remainder  can  be  shifted  out  serially,  most  significant  bit  first  or 


Figure  2.  CRC-6  Configuration 


PAL*  is  a  registered  trademark  of  Monolithic  Memories 
ZPAL'"  is  a  trademark  of  Monolithic  Memories. 
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PAL®  Device  Design  Specification 

TITLE  CRC6 

PATTERN  CRC  6  ERROR  DETECTION  PAL 

REVISION  1.01 

AUTHOR  THERESA  SHAFER 

COMPANY  MMI 

DATE  9/10/86 

CHIP  CRC 6  PAL16R6 

CK  DATA  /CLR  CHECK_BIT  EOX  NC  NC  NC  NC  GND 
/OE  ERROR  CBl  CB2  CB3  CB4  CB5  CB6  CRC 6  VCC 


THE  CRC- 6  PAL  PERFORMS  ERROR  DETECTION  ON  A  SERIAL  DATA 
STREAM.      CRC-6  PAL  SUPPORTS  THE  Tl  Fe  STANDARD  FOR  ERROR 
DETECTION.   THE  CRC  RESULT  CAN  BE  OUTPUT  EITHER  IN  SERIAL 
OR  IN  PARALLEL. 

CRC-6  =  X**6  +  X  +  1 


INPUTS : 


OUTPUTS ! 


CK 

/OE 

/CLR 

DATA 

CHECK_BIT 
EOX 


CRC  6 
ERROR 

CBl  -  CB6 


EQUATIONS 


EXTERNAL  CLOCK 

ACTIVE  LOW  OUTPUT  ENABLE  SIGNAL 
ACTIVE  LOW  CLEAR  SIGNAL  WHICH  RESETS  THE 
CRC 

SERIAL  DATA  STREAM  INPUT 

ACTIVE  HIGH  SELECT  INPUT  WHICH  SWITCHES 
FROM  SERIAL  DATA  STREAM  TO  CHECK  BITS 
ACTIVE  HIGH  SIGNAL  WHICH  INDICATES  END  OF 
EXTENDED  SUPERFRAME  AND  ENABLES  THE 
ERROR  DETECTION  FLAG 

CRC  SERIAL  OUTPUT 

ACTIVE  HIGH  OUTPUT  FLAG  WHICH  INDICATES 
A  TRANSMISSION  ERROR 
REMAINDER  BITS    (CHECK  BITS) 
CBl  -  MOST  SIGNIFICANT  BIT 
CB6  -  LEAST  SIGNIFICANT  BIT 


CRC-6  PAL 

/CRC 6  =  /CBl  *  CHECK_BIT 
+  /DATA  *  /CHECK_BIT 


/: 

/ 
/< 
/ 
/ 

SRROR  =  EOX  *  /CBl  *  /CB2  *  /CB3  *  /CB4  *  /CB5  *  /CB6 
+  /EOX 

:B1  :=  /CB2  +  CLR 

:B2   :=  /CB3  +  CLR 

:B3   :=  /CB4  +  CLR 

:B4   :=  /CB5  +  CLR 

2- 
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/CB5 


/CB6 


:=  /DATA  *  /CB1  *  /CB6 

+  DATA  *  CB1  *  /CB6 

+  DATA  *  /CB1  *  CB6  *  /CHECK_BIT 

+  /DATA  *  CB1  *  CB6  *  /CHECK_BIT 

+  /CB6  *  CHECK_BIT 

+  CLR 


:=  DATA  *  CB1  *  /CHECK_BIT 
+  /DATA  *  /CB1  *  /CHECK_BIT 


/CB1 
CLR 


*  CHECK  BIT 


SIMULATION 

TRACE_ON  CK  /OE  /CLR  CHECK_BIT  EOX  DATA  CRC6  ERROR 
CB1  CB2  CB3  CB4  CB5  CB6 


SETF  OE 

CLR 
CLOCKF  CK 

;  INITIALIZE 
SETF  DATA 

/CHECK_BIT 

/EOX 

/CLR 
CLOCKF  CK 

;  FIND  PERIOD 

SETF  /DATA 

FOR  J:=  0  TO  70  DO 
BEGIN 
CLOCKF  CK 
END 


ENABLE  OUTPUT 

CLEAR  SHIFT  REGISTER 


CALULATE  CRC-6  FOR  1110010101 


SETF  CLR 

DATA 
CLOCKF  CK 
SETF  /CLR 
DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 


CLEAR  SHIFT  REGISTER 


Monollthlol 
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CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 

CHECK  ERROR  DETECTION 
SETF  EOX  ;     CHECK  ERROR  FLAG 

SETF  /DATA  ;  001001 

CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 

;  READ  CRC  RESULT 
SETF  /EOX 
SETF  /DATA 
CLOCKF  CK 
SETF  DATA 
CLOCKF  CK 
SETF  CHECK  BIT 


DATA 

FOR 


FOR  J:=  0  TO  6  DO 
BEGIN 
CLOCKF  CK 
END 


TRACE  OFF 
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Time  Division  Multiplexing 

with  the  LCA  Device  AN-161 


Introduction 

The  Logic  Cell  Array  (LCA  device)  implements  a  multiplexer  and 
counter  used  in  time  division  multiplexing.  With  the  device's 
flexible  I/O  pins,  the  multiplexer  and  counter  are  implemented 
into  a  single  CMOS  device.  This  application  note  covers  time 
division  multiplexing  and  the  design  of  a  multiplexer  and  coun- 
ter. Additional  information  on  how  to  design  with  an  LCA  device 
can  be  found  in  the  LCA  design  methodology  chapter  in  Mono- 
lithic Memories'  LCA  Design  and  Applications  Handbook.  For 
programming  the  LCA  device,  refer  to  "Configuring  the  LCA  De- 
vice", Monolithic  Memories'  Application  Note  1 82. 

Principles  of  Multiplexing 

Multiplexing  efficiently  utilizes  data  communication  lines  by 
combining  multiple  low-speed  signals  into  a  single,  high-speed 
line.  The  two  methods  of  performing  multiplexing  are  Time  Divi- 
sion Multiplexing  (TDM)  and  Frequency  Division  Multiplexing 
(FDM).  TDM  divides  the  transmission  bandwidth  into  equal  time 
slots  where  each  input  signal  is  assigned  one  time  slot  per  time 
cycle.  Once  assigned,  that  time  slot  is  not  used  by  any  other 
input.  Figure  1  shows  a  multiplexer  combining  three  low-speed 
signals  into  one  high-speed  line.  Terminal  A  transmits  during 
the  first  time  slot,  B  transmits  during  the  second  time  slot,  and 
C  transmits  during  the  third  time  slot.  This  sequence  is  repeat- 
ed every  time  cycle.  FDM,  on  the  other  hand,  divides  the  fre- 
quency spectrum  among  logical  channels  where  each  channel 
has  full  bandwidth  of  its  assigned  frequencies.  Analog  sys- 
tems usually  use  FDM. 


Figure  1.  Time  Division  Multiplexing 


There  are  many  classes  of  TDMs  including  bit  interleave,  char- 
acter interleave,  statistical  TDM,  and  T1  multiplexers.  In  bit  in- 
terleaved TDM,  each  input  is  assigned  to  one  time  slot.  Each 
time  slot's  length  is  one  bit.  Character  interleaved  TDM  is  simi- 
lar to  bit  interleave,  except  that  each  time  slot  represents  one 
character.  Statistical  multiplexing  assigns  the  bandwidth  into 
unequal  slots  where  only  the  active  incoming  lines  are  as- 


signed slots.  The  slots  are  of  variable  length,  so  each  input  is 
assigned  the  amount  of  bandwidth  needed.  The  T1  standard 
specifies  twenty-four  64  kbps  channels  multiplexed  on  a  1 .544 
Mbps  high-speed  link. 

Data  Selection/Time  Slot 
Assignment 

To  transmit  data  from  terminal  A  in  the  first  time  slot  as  shown 
in  Figure  1,  data  buffering,  rate  adaption  and  data  selection 
must  be  performed.  Data  buffering  and  rate  adaption  are  re- 
quired since  the  rates  of  the  lines  to  be  multiplexed  are  slower 
than  the  high-speed  link.  A  serializing  FIFO  or  shift  register  can 
implement  the  required  buffering.  Rate  adaption  is  performed 
by  a  clock/shift  scheme  tailored  to  the  exact  application.  Once 
buffered,  the  data  must  be  selected  in  the  proper  order.  Sever- 
al methods  can  be  used  for  data  selection;  one  method  is  a  32- 
to-1  multiplexer.  To  implement  sequential  selection  of  input 
lines,  an  on-board  counter  selects  the  multiplexer's  output. 
However,  for  random  selection  needed  in  statistical  multiplex- 
ing, more  flexibility  is  needed.  A  2-to-1  multiplexer  provides 
this  flexibility  by  selecting  between  the  counter  and  random 
selection  inputs. 

Comparison  of  Design  Methodologies 

The  32-to-1  multiplexer  and  counter  design  can  be  implement- 
ed in  several  ways.  Briefly,  we  will  evaluate  discrete  logic,  PAL 
and  LCA  device  implementations. 

For  this  design,  thirty-eight  inputs,  five  registers,  and  one  out- 
put are  required.  Conventional  PLD  devices  do  not  meet  this 
large  I/O  requirement.  A  single  PLD  device  is  not  suited  for  this 
specific  application  because  flexible  I/O  and  buried  registers 
are  not  supported.  Instead,  the  design  must  be  divided  into 
four  sections.  The  counter  fits  in  a  PALC16R6Z  device  and  the 
32-to-1  multiplexer  requires  three  PALC20L8Z  devices. 

Since  a  large  32-to-1  multiplexer  is  not  available  as  a  discrete 
part,  it  must  be  built  using  smaller  multiplexers.  The  total  chip 
count,  using  discrete  logic,  is  five  devices:  two  74AS850s,  two 
74ALS257s  and  one  74AS867.  A  combination  of  PAL  devices 
and  discrete  logic  devices  is  possible,  however,  it  is  also  a 
multiple  chip  solution. 

Monolithic  Memories'  LCA  device  is  a  high-density  programma- 
ble CMOS  circuit  with  flexible  l/Os.  It  has  forty  pins  which  are 
user-defined  as  either  inputs,  outputs,  or  bidirectional.  The 
LCA  device  meets  the  I/O  requirement  for  this  design.  Since 
the  LCA  circuit  allows  multiple  logic  levels,  implementing  the 
counter  does  not  use  up  output  pins.  Moreover  an  LCA  device 
enables  the  32-to-1  multiplexer  with  counter  to  be  implemented 
in  a  single  low-power  device. 
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Detailed  LCA  Design 

The  M2064  LCA  device  in  a  48-pin  DIP  is  used  in  this  design, 
al  hough  both  PLCC  and  PGA  packages  are  available.  Eight  of 
the  forty-eight  pins  are  reserved  for  programming,  power,  and 
ground,  and  the  remaining  forty  I/O  pins  are  available  to  the 
user.  Table  1  shows  the  efficient  use  of  the  package  pinout. 


PIN 

DESCRIPTION 

TOTAL  PINS 

ADDR(4:0) 

External  Address  Inputs 

5  inputs 

CK 

External  Clock  Input 

1  input 

LD 

Counter  Load  Input 

1  input 

D(31:0) 

32-to-1  Multiplexer 

32  inputs 

Data  Inputs. 

OUT 

Output 

1  output 

TOTAL  I/O 

40  pins 

Table  1.  32-to-1  Multiplexer  Pins 


Figure  2  shows  the  block  diagram  of  the  32-to-1  multiplexer  and 
the  counter.  The  select  lines  of  the  32-to-1  multiplexer  are  ei- 
ther the  address  lines  or  the  output  of  the  counter.  When  LD  is 
deasserted,  the  5-bit  counter  sequentially  selects  each  input. 
Asserting  the  LD  signal  loads  the  counter  with  the  external  ad- 
dresses.It  also  selects  the  external  address  for  the  32-to-1 
multiplexer's  select  lines.  This  permits  random  input  selection 
from  the  external  address  lines  which  supports  statistical  mul- 
exing. 


tip 


ADDR 


LD 

CK 


5-BIT 
>  COUNTER 


D  ^- 


32:1 
MUX 


Figure  2.  Block  Diagram 

The  select  signals  for  the  32-to-1  multiplexer  are  determined  by 
the  LD  signal.  When  LD  =  0,  the  counter  selects  the  multiplex- 
er's output.  When  LD  =  1 ,  the  counter  is  loaded  and  the  exter- 
nal address  selects  the  output.  Also,  the  LD  asynchronously 
loads  the  5-bit  counter.  Table  2  summarizes  the  device  opera- 
tion. 


INPUTS 

OUTPUT 

FUNCTION 

LD  ADDFt(4:0) 

OUT 

Counter  as  Select 
Address  as  Select 
and  Load  Counter 

0  XXXXX 

1  ADDR(4:0) 

D(CNT) 
D(ADDR) 

Table  2.  32-to-1  Multiplexer  Function  Table 

The  design  was  entered  with  FutureNet's  DASH  schematic  cap- 
ture package  using  high-level  macros.  Then  the  schematic 
capture  file  was  translated  into  an  LCA  design  file.  The  transla- 
tion software,  called  PIN2LCA,  partitions  the  design  into  CLBs 
and  generates  the  equations  for  the  CLBs'  configuration.  The 
translation  software  produces  an  unrouted  LCA  design  file. 
This  design  file  requires  final  placement  of  CLBs  and  intercon- 
nect routing  which  was  performed  using  Monolithic  Memories' 
APR  software,  an  automatic  place  and  route  program.  After  the 
design  is  placed  and  routed,  it  was  optimized  with  Monolithic 
Memories'  XACT  Design  Editor  System.  The  XACT  system  pro- 
vides a  graphic  interface  to  manually  optimize  the  CLB  logic 
functions  and  connections. 

The  logic  functions  and  interconnects  of  an  LCA  device  are  es- 
tablished with  CMOS  memory  cells,  so  the  array  is  never  physi- 
cally altered,  although  it  is  physically  programmed.  A  program- 
mable LCA  device  can  be  reconfigured  for  the  prototype.  In  ad- 
dition, it  can  be  reprogrammed  any  number  of  times  in  the  tar- 
get system.  With  reconfigurable  devices  such  as  this  one,  the 
array  can  also  be  programmed  on  power-up,  or  whenever  the 
design  details  need  to  be  changed.  For  volume  production,  a 
hard-array  version  will  be  available. 


Entering  the  Design  with  Futurenet 

The  schematic  entered  in  FutureNet's  DASH  is  comprised  of 
different  components  called  from  the  macro  library  supplied  by 
Monolithic  Memories.  The  M8-1  and  M4-1  macros  are  the  multi- 
plexers which  implement  the  32-to-1  multiplexer  as  shown  in 
Figure  3.  The  5-bit  counter  was  built  from  INV,  C16BPRD, 
XOR2,  and  FDM  macros.  Since  the  C16BPRD  macro  is  only  a 
4-bit  parallel-load  binary  counter,  the  XOR2  and  FDM  macros 
were  added  to  form  another  bit  which  increased  the  length  to 
five  bits.The  GMUX  macro,  a  2-to-1  multiplexer,  selects  be- 
tween the  counter  and  the  external  address  lines. 

To  translate  from  FutureNet  to  LCA,  the  PIN2LCA  program  is  in- 
voked. This  program  generates  an  unrouted  <filename>.LCA 
from  the  FutureNet's  <filename>.PIN.  The  PIN2LCA  software 
partitions  the  design  into  CLBs  and  generates  each  CLB's  con- 
figuration. 

The  PIN2LCA  performs  logic  reduction  by  eliminating  unused 
logic  to  maximize  CLB  utilization.  With  this  design,  for  exam- 
ple, C16BPRD's  reset  logic  is  not  used,  and  PIN2LCA  elimi- 
nates all  the  reset  logic  in  the  C16BPRD  macro.  Another  means 
to  maximize  CLB  utilization  is  by  combining  macro  logic.  At 
times,  this  combined  logic  results  in  a  single  CLB  which  imple- 
ments logic  from  two  different  macros.  For  designs  not  utilizing 
all  the  CLBs,  this  logic  reduction  is  not  required  and  may  actu- 
ally hinder  placement  and  delay  tradeoffs.  If  it  does  produce 
delay  problems,  it  is  possible  to  edit  the  CLBs  in  XACT. 
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At  this  stage  in  the  design  flow,  the  PIN2LCA  software  can 
generate  a  P-SILOS  simulation  file.  Since  the  <filename>.SIM 
was  generated  from  an  unrouted  LCA  design  file,  the  simulation 
only  has  unit  delays  and  does  not  have  actual  routing  delays. 
The  simulation  verifies  the  logic,  but  does  not  give  any  timing 
information.  It  may  be  necessary  to  make  changes  in  DASH'M 
depending  on  the  simulation  results. 


Place  and  Route  Using  APR 

Once  the  logic  is  finalized,  the  LCA  design  must  be  placed  and 
routed.  Monolithic  Memories'  APR  software  performs  the  final 
placement  of  CLBs  and  interconnect  routing.  The  input  to  APR 
is  an  unrouted  <filename>.LCA,  and  a  routed  LCA  design  file 
and  report  document  are  the  outputs. 

At  first  glance,  the  APR  program  seems  cumbersome  to  apply 
to  every  application.  However,  it  can  be  used  to  create  an  effi- 
cient design.  While  the  software  uses  random  placement,  it 
does  allow  tailoring  for  specific  requirements. 

Initially,  the  multiplexer  and  counter  design  did  not  route  com- 
pletely. To  achieve  good  placement  and  100%  routing  comple- 
tion, several  things  were  tried.  Some  worked  well  while  others 
did  not.  An  overview  of  how  100%  routing  completion  was 
achieved  is  explained  below. 

Since  this  design  utilizes  only  53%  of  the  LCA  device,  delay  is 
a  larger  concern  than  CLB  utilization.  In  this  case,  placement 
is  critical  to  achieve  an  efficient  design  with  the  desired  de- 
lays. The  ideal  placement  would  be  for  the  5-bit  counter  and  2- 
to-1  multiplexer  to  be  placed  near  the  ADDR(4:0),  CK,  and  LD 
signals.  The  four  M8-1  macros  should  be  near  the  8-bit  cluster 
of  inputs.  The  remaining  M4-1  macro  should  be  placed  near  the 
OUT  signal. 

In  FutureNet,  the  l/Os  are  assigned  pin  numbers  so  that  the 
signals  are  clustered  into  groups.  These  I/O  assignments  re- 
strict the  APR's  placement  of  the  signals  and  are  found  in  the 
<filename>.SCP.  Other  restrictions  and  options  are  specified 
when  invoking  the  APR  software.  The  options,  "-a",  "-g",  "-e", 
and  "-k",  tailor  the  placement  for  this  specific  design. 

The  APR's  "-a"  option  specifies  the  depth  of  CLB  logic  levels 
which  are  considered  connected.  In  this  design,  "-a2"  was  se- 
lected because  the  C16BPRD  and  M8-1  macros  are  implement- 
ed in  two  CLB  logic  levels. 

The  "-g"  and  "-e"  options  are  related.  The  "-g"  option  specifies 
the  number  of  CLBs  that  should  be  grouped  together.  The  "-e" 
option  determines  the  number  of  CLBs  in  a  group  which  should 
be  evaluated  for  all  possible  placements.  Since  the  C16BPRD 
and  M8-1  macros  contain  four  to  six  CLBs,  the  options,  "-e4" 
and  "-g4",  were  used.  This  particular  value  was  selected  be- 
cause it  was  the  largest  possible  value  which  maintained  the 
e>=g  relationship.  Avoiding  values  where  e<g  insures  as  much 
as  possible  that  the  CLBs  within  each  group  are  placed  in  the 
best  possible  arrangement.  This  maintains  the  groups'  integrity. 


The  "-k"  option  specifies  the  number  of  shapes  per  group.  By 
trial  and  error,  "-k6"  works  well  for  this  design.  With  "-k6",  six  of 
the  best  arrangements  or  shapes  were  saved  for  each  group  of 
CLBs.  Every  combination  of  shaped  group  is  tried  with  all  six 
other  grouped  shapes  and  evaluated  for  best  placement.  Gen- 
erally, the  larger  values  result  in  better  placements,  however, 
run-time  grows  exponentially.  While  larger  values  of  "-k"  were 
tried,  they  did  not  improve  the  placement  significantly. 

Running  APR  with  these  options  and  restrictions  did  not  result 
in  a  100%  routed  design.  Therefore,  more  application-specific 
information  was  required.  Noting  that  the  APR  software  was  not 
fully  utilizing  the  high-level  macro  information,  placement  could 
still  be  optimized.  The  APR's  report  file  shows  that  the  CLBs 
which  comprise  C16BPRD  or  the  M8-1  macros  were  not 
grouped  together.  Instead,  the  CLBs  were  randomly  spread 
over  the  device.  In  a  constraint  file,  <filename>.CST,  CLBs 
from  the  high-level  macros  could  be  grouped  together  (see  Ap- 
pendix A).  If  necessary,  the  exact  CLB  placement  could  be 
specified  in  the  same  file. 

To  generate  the  constraint  file,  CLBs  which  implement  each 
high-level  macro  were  identified  using  the  cross-reference  file 
generated  by  PIN2LCA,  <filename>.CRF.  The  <filename>.CRF 
file  is  organized  into  three  cross-reference  sections:  macros, 
CLBs,  and  lOBs. 

The  macro  cross-reference  section  identifies  and  assigns  a 
hierarchical  symbol  number  to  each  macro.  The  symbol  num- 
ber is  used  to  generate  the  default  CLB  names. 

21-1,  \M8-1. DWG 
Path: 


ASSIGNED" 
SYMBOL  NUMBER 


\USER\THERESA\TDM.DWG(32) 
Title:  M8-1 


The  CLB  cross-reference  section  of  the  <filename>.CRF  con- 
tains the  CLB  names.  For  macros  with  multiple  CLBs,  the  sys- 
tem assigns  default  names.  The  default  CLB  name  consists  of 
two  parts.  The  first  is  the  macro  symbol  number  as  specified  in 
the  macro  section  and  the  second  number  is  the  signal  name. 
By  grouping  CLBs  with  related  symbol  numbers,  all  the  CLBs 
for  a  given  macro  will  be  placed  together. 

CLB  BD  Name  =  '21-D03': 

F  =  signal  '21-D03',  contains: 

symbol  '26-OR(2)\  output  signal  =  '21-D03' 
symbol  '26-AND(3)',  output  signal  =  '26-SO' 
symbol  '27-OR(2)',  output  signal  =  '21 -D01 ' 
symbol  '26-AND)1)',  output  signal  =  '26-S1' 

The  IOB  cross-reference  section  shows  the  IOB  assignments. 
For  this  design,  the  assignments  are  those  specified  in  the  Fu- 
tureNet's  schematic. 

IOB  P30:  Name  =  'D13',  Symbol  =  'PIN(40)' 

I  =  signal  'D13' 
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Monolithic  Memories'  XACT  Design  Editor  System  can  increase 
resource  utilization  and  performance  by  manually  modifying 
the  placement  and  routing.  The  XACT  system  provides  a 
graphic  interface  to  specify  the  CLB  design.  All  CLB  logic  func- 
tions and  connections  can  be  optimized  for  the  designs' 
needs.  With  XACT,  the  designer  can  partition  the  design  and 
optimize  the  placement  of  logic  blocks  to  gain  the  utilization 
and  performance  required. 

The  clock  line  routing  was  optimized  to  reduce  clock  skew.  Us- 
ing the  clock  buffer  resources,  the  common  clock  is  driven  by 
the  global  clock  buffer.  To  optimize  internal  routing,  "long  lines" 
were  used  for  signals  with  large  fanouts  such  as  the  select 
lines  for  the  multiplexers  and  the  LD  signal.  Use  of  the  "long 
lines"  minimizes  the  delay  for  these  control  signals.  The  "long 
lines"  were  selected  by  manually  routing  the  signal  net  through 
the  programmable  interconnect  points  (PIPs). 

On  the  device  itself,  implementing  the  5-bit  counter  and  2-to-1 
multiplexers  requires  ten  CLBs;  and  the  32-to-1  multiplexer  re- 
quires twenty-four  CLBs,  giving  a  total  of  thirty-four  CLBs.  This 
design  utilizes  53%  of  the  LCA  device. 


-implemented  in  a  singletMOS  device.  FutureNet's  DASH 
schematic  capture  with  Monolithic  Memories'  LCA  macro  library 
was  used  to  implement  a  design  in  an  LCA  device.  Placement 
of  CLBs  and  signal  routing  were  performed  by  Monolithic  Mem- 
ories' APR  software.  Optimization,  which  may  be  required  to 
improve  performance,  can  be  done  using  Monolithic  Memories' 
XACT  Design  Editor  System.  This  is  useful  for  placing  and  rout- 
ing critical  signal  nets  such  as  clock  or  control  signals.  More 
information  can  be  found  in  P-SILOS,  FutureNet,  and  LCA  user 
guides  and  handbooks. 

This  design,  developed  with  an  LCA  device,  is  available  upon 
request.  The  FutureNet  drawing,  LCA  design  and  bit  pattern 
files  can  be  provided  for  programming  the  LCA  device  in  an  EP- 
ROM.  Please  ask  for  design  XDES07.LCA. 


Print  World:  THERESA. LCA  (2064PD48-7O)  ,  XACT  1.30,    10:15:46  JUL  31,  1987 
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Figure  3.  FutureNet  Schematic 
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Appendix  A>  Constraint  File 


;    CONSTRAINT  FILE  GROUPING  MACROS   IN  BLOCKS 

DEFINE  GROUP  CNT_GROUP 
3-T3   3-T2  CKB 
C4  C2   C3   CI  CO 
M3  M2; 

DEFINE  GROUP  OUT_GROUP 
CLB1  4  5-D2  3; 

DEFINE  GROUP  Ml_GROUP 
47-SO 

19-SO  19-S1 
13-D03  13-D47; 

DEFINE  GROUP  M2_GROUP 
47-S1 

15-SO  15-S1 
21-D03  21-D47; 

DEFINE  GROUP  M3_GROUP 
27-SO  27-S1 
23-SO  23-S1 

36-  SO  36-Sl; 

DEFINE  GROUP  M4_GROUP 
OUTM4 

42-SO  42-S1 

37-  D47  37-D67 
39-SO; 
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$ 

S  Simulation  file  for  design  'TDM.sim'  ,  type  '2064N48-50' 
S  Created  by  PIN2LCA  Ver  .  1  .Olx  at  10  :  58  :  00  JUL  14,  1987 
S 

!  INPUT  TDM.  sim 


GLOBALRESET-  .  CLK  0  SO  1  SI  S  Initial  pulse  to  reset  latches 

CK.PAD  .  CLK  0050011 0000  .  REP  0 


LD . PAD 

.  CLK  0 

0  40000 

I 

ADDRO . PAD 

.  CLK  0 

0  40000 

0 

41000 

1 

42000 

0 

.  KEr  4UUUU 

+  78000  0 

ADDR1 . PAD 

.  CLK  0 

0  40000 

0 

42000 

1 

44000 

0 

.  REP  40000 

+  78000  1 

ADDR2 . PAD 

.  CLK  0 

0  40000 

0 

44000 

1 

48000 

0 

.  REP  40UU0 

+  78000  1 

ADDR3 . PAD 

.  CLK  0 

0  40000 

0 

48000 

1 

56000 

0 

.  REP  40000 

+  78000  0 

ADDR4 . PAD 

.  CLK  0 

0  40000 

0 

56000 

1 

72000 

0 

.  REP  40000 

+  78000  1 

DO . PAD 

.  CLK  0 

1  1000 

0 

40000 

0 

41000  1 

Dl .PAD 

.  CLK  0 

0  1000 

1 

2000 

0 

40000 

1 

41000  0  42000 

1 

D2  .  PAD 

,  CLK  0 

0  2000 

1 

3000 

0 

40000 

1 

42000  0  43000 

1 

D3  .  PAD 

.  CLK  0 

0  3000 

1 

4000 

0 

40000 

1 

43000  0  44000 

1 

D4 . PAD 

.  CLK  0 

0  4000 

1 

5000 

0 

40000 

1 

44000  0  45000 

1 

D5 . PAD 

.  CLK  0 

0  5000 

1 

6000 

0 

40000 

1 

45000  0  46000 

1 

D6 . PAD 

.  CLK  0 

0  6000 

1 

7000 

0 

40000 

1 

46000  0  47000 

1 

D7 .PAD 

.  CLK  0 

0  7000 

1 

8000 

0 

40000 

1 

47000  0  48000 

1 

D8 . PAD 

.  CLK  0 

0  8000 

1 

9000 

0 

40000 

1 

48000  0  49000 

1 

D9 .PAD 

.  CLK  0 

0  9000 

1 

10000 

0 

40000 

1 

49000  0  50000 

1 

D10 .PAD 

.  CLK  0 

0 10000 

1 

11000 

0 

40000 

1 

50000  0  51000 

1 

Dll .PAD 

.  CLK  0 

0 11000 

1 

12000 

0 

40000 

1 

51000  0  52000 

1 

D12.PAD 

.  CLK  0 

0 12000 

1 

13000 

0 

40000 

1 

52000  0  53000 

1 

D13.PAD 

.  CLK  0 

0 13000 

1 

14000 

0 

40000 

1 

53000  0  54000 

1 

D14.PAD 

.  CLK  0 

0 14000 

1 

15000 

0 

40000 

1 

54000  0  55000 

1 

D15 . PAD 

.  CLK  0 

0 15000 

1 

16000 

0 

40000 

1 

55000  0  56000 

1 

D16.PAD 

.  CLK  0 

0 16000 

1 

17000 

0 

40000 

1 

56000  0  57000 

1 

D17.PAD 

.  CLK  0 

0 17000 

1 

18000 

0 

40000 

1 

57000  0  58000 

1 

D18.PAD 

.  CLK  0 

0 18000 

1 

19000 

0 

40000 

1 

58000  0  59000 

1 

D19 .PAD 

,  CLK  0 

0 19000 

1 

20000 

0 

40000 

1 

59000  0  60000 

1 

D20 . PAD 

.  CLK  0 

0  20000 

1 

21000 

0 

40000 

1 

60000  0  61000 

1 

D21 .PAD 

.  CLK  0 

0  21000 

1 

22000 

0 

40000 

1 

61000  0  62000 

1 

D22.PAD 

.  CLK  0 

0  22000 

1 

23000 

0 

40000 

1 

62000  0  63000 

1 

D23.PAD 

.  CLK  0 

0  23000 

1 

24000 

0 

40000 

1 

63000  0  64000 

1 

D24 . PAD 

.  CLK  0 

0  24000 

1 

25000 

0 

40000 

1 

64000  0  65000 

1 

D25 . PAD 

.  CLK  0 

0  25000 

1 

26000 

0 

40000 

1 

65000  0  66000 

1 

D26 . PAD 

.  CLK  0 

0  26000 

1 

27000 

0 

40000 

1 

66000  0  67000 

1 

D27.PAD 

.CLKO 

0  27000 

1 

28000 

0 

40000 

1 

67000  0  68000 

1 

D28 . PAD 

,  CLK  0 

0  28000 

1 

29000 

0 

40000 

1 

68000  0  69000 

1 

D29 .PAD 

.  CLK  0 

0  29000 

1 

30000 

0 

40000 

1 

69000  0  70000 

1 

D30.PAD 

.  CLK  0 

0  30000 

1 

31000 

0 

40000 

1 

70000  0  71000 

1 

D31 .PAD 

.  CLK  0 

0  31000 

1 

32000 

0 

40000 

1 

71000  0  72000 

1 

.MONITOR  CK.PAD  LD. PAD  ;  ADDR4  .  PAD  ADDR3  .  PAD  ADDR2  .  PAD  ADDR1  .  PAD  ADDRO  .  PAD  ;  ; 

+  C4  C3  C2  CI  CO  ; 

+  DO  .  PAD  Dl  .  PAD  D2  .  PAD  D3  .  PAD  D4  .  PAD  D5  .  PAD  D6  .  PAD  D7  .  PAD  ; 

+  D8  .  PAD  D9  .  PAD  D10  .  PAD  Dll  .  PAD  D12  .  PAD  D13  .  PAD  D14  .  PAD  D15  .  PAD  ; 

+  D16  .  PAD  D17  .  PAD  D18  .  PAD  D19  .  PAD  D20  .  PAD  D21  .  PAD  D22  .  PAD  D23  .  PAD  ; 

+  D24  .  PAD  D25  .  PAD  D26  .  PAD  D27  .  PAD  D28  .  PAD  D29  .  PAD  D30  .  PAD  D31  .  PAD  ;  OUT  .  PAD 

.  TABLE  CK  .  PAD  LD  .  PAD  ;  ADDR4  .  PAD  ADDR3  .  PAD  ADDR2  .  PAD  ADDR1  .  PAD  ADDRO  .  PAD  ;  ; 

+  C4  C3  C2  CI  CO  ; 

+  DO  .  PAD  Dl  .  PAD  D2  .  PAD  D3  .  PAD  D4  .  PAD  D5  .  PAD  D6  .  PAD  D7  .  PAD  ; 

+  D8  .PAD  D9.PAD  D10.PAD  Dll  .PAD  D12.PAD  D13.PAD  D14.PAD  D15.PAD  ; 

+  D16  .  PAD  D17  .  PAD  D18  .  PAD  D19  .  PAD  D20  .  PAD  D21  .  PAD  D22  .  PAD  D23  .  PAD  ; 

+  D24  .  PAD  D25  .  PAD  D26  .  PAD  D27  .  PAD  D28  .  PAD  D29  .  PAD  D30  .  PAD  D31  .  PAD  ;  OUT  .  PAD 
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Introduction 

In  a  Private  Branch  Exchange  (PBX)  or  Central  Office  (CO), 
incoming  digitized  voice  circuits  are  "switched"  or  routed  to 
their  appropriate  destination.  The  "switching"  takes  place  while 
the  data  is  in  serial  or  parallel  format.  An  eight-bit  format  conver- 
ter circuit,  widely  used  in  a  variety  of  PBX  and  CO  architectures, 
translates  the  serial  data  streams  into  parallel  data  streams  and 
vice-versa. 

The  eight-bit  format  converter  circuit  requires  a  large  number  of 
register  elements.  These  elements,  namely  serial  shift  registers, 
can  be  implemented  efficiently  in  an  LCA  device  due  to  its  high 
register  content  and  flexible  structure.  The  LCA  M2018  pro- 
grammable device  contains  one  hundred  configurable  register 
elements,  ninety-nine  of  which  are  used  in  the  implementation 
of  the  converter  circuit.  This  circuit  is  capable  of  running  at  12.5 
MHz  and  18.5  MHz  in  a  50-MHz  and  70-MHz  LCA  device, 
ctively. 


Powerful  software  tools  and  circuit  uniformity  allowed  the 
design  to  be  laid  out  and  simulated  in  just  two  days.  However,  if 
di  ferent  support  logic  becomes  necessary,  PBX  vendors  are 
able  to  generate  a  new  configuration  program  in  the  LCA  re- 
programmable device.  Since  these  devices  are  manufactured  in 
a  CMOS  technology,  a  complete  PBX  system  with  several  LCA 
devices  can  help  keep  the  active  power  consumption  down. 
(Pease  refer  to  the  LCA  Design  and  Applications  Handbook- 
Reference  3  for  basic  information  regarding  structure  and  pro- 
gramming aspects.) 


Overview  of  a  PBX 

Central  Offices  (CO)  and  Private  Branch  Exchanges  (PBX)  pro- 
vide both  telephone  and  data  services.  A  PBX  may  be  consi- 
dered as  a  localized  telephone  exchange  serving  the  intercon- 
nection requirements  of  users  in  office  environments.  Outgoing 
calls  are  directed  out  of  the  PBX  and  routed  through  the  local 
CO  to  the  far  destination.  In  general  terms,  a  PBX  or  a  CO 
consists  of  major  blocks  shown  in  Figure  1 .  Racks  of  peripheral 
equipment  modules  containing  line  cards  provide  access  to 
telephone  units,  or  to  data  adapter  modules  which  provide  data 
communication  services.  Trunk  cards  provide  high-speed  links 
to  host  computers,  or  other  PBXs.  Network  switching  modules 
interconnect  calling  parties  to  answering  parties  via  digital  mul- 
tiplexed links.  Control  modules  containing  a  Central  Processor 
Unit  (CPU),  mass  storage  and  local  memory,  perform  the 
"setup"  and  "tear-down"  of  each  circuit  connection,  as  well  as 
the  monitoring  of  PBX  activities.  Central  resources  are  modules 
of  hardware  and  software  which  operate  in  a  time-shared 
manner.  The  central  resources  include  facilities  such  as  ringing 
tone  generator  and  message  recording  modules. 


MICROPROCESSOR  BUS 


SYSTEM  ACCESS^ 
TERMINAL 


TO  CO., 
PHONE       PBX,  ETC. 


^CONTROL 
I  MODULES 


CENTRAL 
RESOURCES 


Figure  1.  PBX  Communication  and  Control  Architecture 
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Hierarchy  of  Channel  Multiplexing 
in  a  PBX 

The  main  function  of  switching  modules  is  to  interconnect  cir- 
cuits between  two  or  more  parties  such  as  in  a  conference  call. 
A  digital  telephone  unit,  or  a  line  card,  contains  a  Coder/De- 
coder (CODEC)  device.  The  CODEC  device  digitizes  voice  and 
transmits  it  into  a  Pulse  Code  Modulated  (PCM)  data  stream  on 
a  common  serial  data  highway  at  appropriate  time  intervals, 
which  are  assigned  at  the  start  of  the  call. 

Analog  voice  is  sampled  at  8  KHz  and  passes  through  an  eight- 
bit  analog-to-digital  converter.  The  digitized  sample  (eight-bit 
binary  word)  is  transmitted  serially  to  produce  a  serial  data 
stream  of  (8  x  8  =  )  64  Kbps.  Thirty-two  of  these  samples  or 
CODECs  are  normally  connected  to  a  common  highway  operat- 
ing at  (32  x  64  =  )  2.048  Mbps.  Each  CODEC  is  assigned  "Serial 
Time  Slots"  of  eight-bit  duration,  and  sends  its  eight  bits  of  data 
at  the  rate  of  2.048  Mbps  every  8  KHz.  Another  time  slot  is 
available  on  a  separate  highway  for  the  receive  data  stream. 

Therefore,  a  2.048  Mbps  highway  can  support  thirty-two  chan- 
nels or  voice  connections  in  one  direction.  However,  there  are 
several  ways  to  combine  serial  highways  for  additional  voice 
connections.  One  way  is  to  combine  four  highways  into  one 


high-speed  highway  of  8.192  Mbps.  Further  integration  is  possi- 
ble, but  the  aggregate  data  rate  becomes  high  and  leads  to 
implementation  problems  including  timing,  signal  reflection, 
and  radiation,  to  name  a  few. 

Another  way  to  integrate  channels  without  increasing  the  clock 
rate  is  to  convert  eight  serial  data  highways  into  a  single  eight-bit 
parallel  bus.  With  this  method,  the  serial  clock  rate  and  the 
parallel  clock  rate  are  the  same,  2.048  Mbps.  This  means  that 
data  throughput  on  the  eight  serial  highways  of  16.384  Mbps  is 
maintained  in  the  eight-bit  parallel  bus  operating  at  2.048  Mbps. 
This  works  by  careful  alignment  of  the  incoming  parallel  time 
slots  to  serial  time  slots  on  the  serial  highways.  An  eight-bit 
format  converter  scheme,  sometimes  called  a  "Corner  Bender", 
is  widely  used  in  PBX  or  CO  switching  modules.  Four  of  these 
parallel  buses  are  combined  (see  Figure  2)  to  form  a  single 
parallel  system  bus  that  is  thirty-two  bits  wide,  yet  still  operates 
at  2.048  Mbps.  The  multiplexing  architecture  shown  in  Figure  2 
is  used  in  the  Harris  20-20T"  Integrated  Network  Switch  (see 
Reference  1). 

Space  switching  and/or  time  slot  switching  to  interconnect  dif- 
ferent parties  is  performed  either  in  the  serial  highway  format  or 
in  the  parallel  data  stream  format. 
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Figure  2.  Multiplexing  Levels 
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additional  shift  registers  which  provide  proportional  delays  to 
synchronize  the  outputs.  The  principle  of  operation  is  shown  in 
Figure  3,  and  the  circuit  is  shown  in  Figure  4.  Parallel  data  from 
the  parallel  bus  is  loaded  with  each  clock  into  one  of  the  eight 
shift  registers  in  a  rotating  pattern.  The  data  then  shifts  out  of  the 
shift  register  with  each  subsequent  clock.  After  all  eight  registers 
have  been  loaded,  the  first  register  will  be  empty  on  the  next 
clock.  New  data  is  loaded  into  this  register  at  the  same  clock 
edge  as  the  last  bit  shifts  out  to  the  next  stage.  Thus,  a  continu- 


PARALLEL  TIME  SLOTS 


ous  flow  of  data  through  the  registers  is  insured.  The  data  out  of 
the  eight  shift  registers  is  skewed  in  relation  to  each  other  as 
shown  in  Figure  3b.  The  various  space  slots  on  the  highways 
need  to  be  aligned  to  keep  synchronization.  This  is  achieved  by 
proportionally  delaying  the  outputs.  Channel  1  output  is  delayed 
by  seven  clocks  (using  a  seven-bit  serial  shift  register),  channel  2 
is  delayed  by  six  clocks,  etc.  Channel  8  output  does  not  go 
through  any  additional  delay.  The  final  outputs  of  the  delay 
registers  are  aligned  and  data  is  clocked  out  in  a  synchronous 
fashion.  See  Figure  3c. 


01     234567,01  23 

Figure  3a.  Parallel  Bus  (8  Bit),  2.048  Mbyte/sec 


Figure  3b.  After  Parallel  to  Serial  Conversion 

|  i 

BYTE*  0 


Figure  3c.  After  Appropriate  Time  Delay  to  Synchronize 


Figure  3.  Principles  of  "Corner  Bending" 
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Figure  4  also  shows  the  additional  circuitry  needed  to  control 
the  shift  registers.  A  preloadable  three-bit  counter  keeps  count 
of  eight  clock  pulses.  The  parallel-to-serial  bus  conversion  can 
be  "programmed"  to  start  in  any  register  by  setting  the  approp- 
riate binary  value  on  the  counter  preload  inputs  and  applying  a 
pulse  to  the  sync  input.  If  the  loading  sequence  produced  by  the 
counter  is  not  required,  it  can  be  disabled  by  connecting  the 
"clock"  to  "sync"  input.  At  each  positive  clock  edge,  the  register 
loaded  will  depend  upon  the  data  at  the  counter  inputs  on  the 
previous  positive  clock  edge.  The  3-to-8  decode  circuit  produ- 


ces load  pulses  to  latch  parallel  data  into  the  shift  registers. 
Figure  5  shows  the  parallel-to-serial  conversion  data  matrix. 

The  description  above  shows  the  conversion  of  parallel  data  into 
eight  streams  of  serial  data.  However,  the  same  circuit  also 
performs  serial-to-parallel  conversion.  A  serial  eight-bit  data 
stream  on  one  of  the  eight  inputs  appears  as  an  eight-bit  parallel 
word  on  the  eight  outputs.  Successive  parallel  words  appearing 
at  the  eight  outputs  correspond  to  the  serial  data  on  each  of  the 
eight  inputs  in  rotation.  See  Figure  5. 


SYNC 

LOAD  PULSE 

GENERATOR 

o/p  /EN 


H^o  ►  5 


i  ►  4 


Figure  4.  8-Bit  Format  Converter  ("Corner  Bender")  Circuit 
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INPUTS 


DATA 
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D  |  DO |  PI  |  D2|D3  |D4  |D5  |D6  |D7  | 
E  |  EO  |  E1  |E2  |  E3 1  E4  |  E5  |  E6|E7| 


F  |F0|  F1|  F2 1  F3  |  F4  |  FS  |F6~|F7^ 


G  |G0|  Gl]G2lG3|G4lG5|G6|G7| 
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I     I      I  I 


III! 


.H  |  HO |  HI  |H2  |H3  [H4  |H5  |H6~TH7l 
0 
1 
2 
3 
4 
5 
6 
7 


I      I  I 


I     I     1  I 


iHOlGOl  FO  |  EO  j  DO  |  CO  |  BO  |  AO  1 


I  H11  Gil  F1 1  E1  |D1  |  CI  |  B1  |  A1  I 


H2|  G2j  F2[  E2  |D2|C2  |B2  |  A2[ 


I  H3|  G3|  F3  j  E3  |  D3  |  C3  |  B3  ]  A3 


|H4|G4|  F4 1  E4  I  D4  I C4  I  B4  I A4  I 
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PARALLEL  TO  SERIAL  CONVERSION 

AO,  BO,  CO,  DO,  EO,  FO,  GO,  HO  =  IN(l)0,  IN(I)1,  IN(I)2,  IN(I)3,  IN(I)4,  IN(I)5,  IN(I)6,  IN(I)7 
A1,  B1,  CI,  D1,  E1,  F1,  G1,  H1  =  IN(ll)0,  IN(II)1,  IN(II)2,  IN(II)3,  IN(II)4,  IN(II)5,  IN(II)6,  IN(II)7 

SERIAL  TO  PARALLEL  CONVERSION 

AO.  A1,A2,A3.A4,A5,A6.A7  =  IN(l)0,  IN(I)1,  IN(I)2,  IN(I)3,  IN(I)4,  IN(I)5,  IN(I)6,  IN(I)7 
BO,  B1,  B2,  B3.B4,  B5,  B6,  B7  =  IN(ll)0,  IN(II)1,  IN(II)2.  IN(II)3.  IN(II)4,  IN(II)5.  IN(II)6,  IN(II)7 

•  IN(1)X  ARE  THE  INPUT  8-BIT  WORDS 

Figure  5.  Data  Conversion 


Comparison  of  Implementation 
Techniques 

The  "Corner  Bender"  circuit  can  be  implemented  in  several 
ways.  The  efficiency  of  an  LCA  design  can  be  compared  to  that 
of  a  design  in  SSI/MSI  logic  devices  or  Erasable  Programmable 
Logic  devices  (EPLDs).  Table  1  contains  the  comparison  of 
package  counts  with  the  different  alternatives. 

The  circuit  implemented  in  74LSxx  devices  requires  nineteen 
packages.  Alternatively,  sixteen  simple  PLD  devices  (such  as  a 
PAL20R8  or  a  PAL20R4)  are  required  to  implement  the  same 
circuit.  Since  the  circuit  is  register  intensive  and  not  designed  for 
high  speed,  PLDs  are  not  chosen  for  this  application. 

The  functionally  larger  EPLD  devices,  EP1200  and  EP1800,  are 
more  register  intensive  and  contain  twenty-eight  and  fourty- 
eight  register  elements,  respectively.  Nevertheless,  four  EP1 200 
devices,  or  two  EP1800  plus  a  smaller  PLD  device,  is  required  to 
implement  this  design. 

The  entire  Corner  Bender  circuit  fits  neatly  into  a  single  LCA 
201 8  device.  It  uses  ninety-nine  out  of  the  one  hundred  available 
internal  macrocells  and  demonstrates  efficient  implementation 
of  shift  registers  and  small  counters.  An  LSI  device  from  Plessey 
(Reference  2)  is  available  to  implement  the  same  circuit.  How- 
ever, it  is  an  NMOS  device  and  limited  to  speeds  of  under  2  MHz. 


EQUIVALENT  CIRCUIT  IN  SSI/MSI  DEVICES  PACKAGES 

 1  

Parallel  in  serial  out  shift  Registers  74LS165 

8 

Serial  in  Parallel  out  shift  Registers  74LS164 

7 

4-bit  counter  74LS161 

1 

3  to  8  decoder  74LS138 

1 

Three-state  buffers  74LS241 

1 

EQUIVALENT  CIRCUIT  IN  EPLDs 

a)  EP1200                   28  macro  cells  each 

4 

b)  EP1800                    48  macro  cells  each 

2 

plus  PAL  device  e.g.,  16R4 

1 

EQUIVALENT  CIRCUIT  IN  MMI  LCA  DEVICE 

LCA  2018  device                  100  macro  cell 

1 

Table  1.  Implementation  Alternatives  for  the  8-Bit  Format 
Converter  Circuit  "Corner  Bender" 
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"Corner  Bender"  Design  in  an 
LCA  Device 

A  count  of  the  register  elements  in  the  circuit  shows  a  need  of 
sixty-four  elements  for  the  eight-shift  registers,  twenty-eight 
elements  for  the  delay  registers,  three  elements  for  the  counter, 
and  four  elements  for  the  3-to-8  decode  circuit.  Each  configura- 
ble logic  block  (CLB)  in  the  LCA  device  can  produce  two  out- 
puts. Hence,  only  four  CLBs  are  required  for  the  3-to-8  decode 


circuit.  Since,  each  CLB  contains  one  register  element,  the  total 
count  of  CLBs  is  therefore  ninety-nine.  The  LCA  2018  contains 
one  hundred  CLBs.  Twenty-two  input/output  blocks  (lOBs) 
were  used.  Figure  6  shows  the  layout  of  the  design  in  the  LCA 
device.  The  circuit  fits  into  a  68-pin  PLCC  package. 


XACT  1.30,    10:50:36  MAY  5,  1987 


Print  World:  fc.lca  (2018PC68-70) ,  XAC 


I    Sm  ess  ese^  mm  mm^mrn  ^^"raS- aS-^^" 


Figure  6.  Layout  Diagram  of  "Corner  Bender"  Circuit  on  LCA  Device  of  Size  10x10 
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eight-bit  shift  register  in  this  design  is  shown  in  Figure  7.  Input  A 
is  the  decoder  input  and  it  is  ANDed  with  input  B,  the  data  to  be 
shifted.  To  implement  the  eight-bit  shift  registers  from  the  one- 
bit  shift  register,  eight  of  the  one-bit  shift  registers  were  linked 
together  so  that  the  output  of  each  register  became  an  input  to 
the  next  register.  This  is  shown  in  Figure  8  where  each  CLB 
represents  one  bit  of  the  eight-bit  shift  register. 

The  three-bit  counter  and  the  3-to-8  decoder  in  this  design  were 
implemented  inCLBsasshown  in  Figures9and  10. The  counter 
is  a  synchronous  binary  counter  with  ripple  carry  and  parallel 
load.  The  decoder  is  a  standard  3-to-8  decoder.  The  outputs  of 
the  counter  become  inputs  to  the  decoder,  whereas  the  outputs 
of  the  decoder  are  used  to  decode  the  eight  8-bit  shift  registers. 

With  the  XACT  Development  System,  the  designer  can  optimally 
arrange  the  logic  blocks  on  the  LCA  device  in  order  to  minimize 
net  delays  between  each  block.  With  this  in  mind,  the  layout  for 
the  design  is  described  below: 

Four  of  the  eight-bit  parallel-to-serial  shift  registers  were  placed 
starting  from  the  top  left-hand  edge  of  the  LCA  device  (see 
Figure  7).  The  three-bit  counter  (three  CLBs)  and  the  3-to-8 
decoder  (four  CLBs)  were  then  placed  on  the  following  row.  The 
next  four  rows  contained  the  last  four  parallel-to-serial  shift 
registers.  This  allowed  the  shift  register  select  lines  to  have 
minimal  delay  spread  when  accessing  all  eight  shift  registers. 
The  seven  serial-to-serial  shift  registers  were  placed  in  the 
remaining  CLBs  as  uniformly  as  possible. 


■  ^s^tnTr didL^svi£n.eij  iTTCTUb'fc ptuxiinii/can  useiocanntercon- 
nection  resources  which  incur  short  signal  propagation  delays, 
whereas  blocks  placed  furtherapart  must  use  either  "long  lines" 
or  other  interconnection  resources.  Manual  optimization  using 
the  delay  efficient  "long  lines"  was  performed  for  the  most  criti- 
cal net  connections.  After  routing  completion,  the  longest  delay 
between  two  clock  pulses  was  the  delay  for  the  counter  to 
change  state,  the  state  which  is  decoded  via  the  3-to-8  decoder 
and  selects  the  appropriate  shift  register  to  load  the  parallel  data 
(see  Figure  7).  This  delay  was  54  ns,  79  ns,  and  106  ns,  respec- 
tively for  the  70-,  50-,  and  33-MHz  versions  of  the  device.  The 
delays  were  measured  using  the  XACT  simulation  package  by 
invoking  the  timing  delay  calculator.  This  translates  to  a  maxi- 
mum circuit  operating  speed  of  about  1 8.5  MHz  for  the  70-MHz 
version  of  the  device,  or  9.43  MHz  for  the  33  MHz  version. 

Although  fabricated  in  a  CMOS  technology,  the  inputs  to  the 
LCA  device  can  be  made  either  TTL  or  CMOS  compatible.  For 
high  fan-out  CMOS  or  LS  TTL-compatible  loads,  the  output 
buffers  of  the  LCA  device  are  capable  of  driving  4  mA.  Moreover, 
each  output  buffer  can  be  put  into  a  HIGH-Z  state  for  bus-driving 
applications.  This  features  was  also  used  in  the  design  of  the 
eight-bit  format  converter. 

More  information  about  entering  a  design  with  the  XACT  Devel- 
opment System  is  included  in  the  LCA  Design  and  Applications 
Handbook  (Reference  3).  Information  about  configuring  the 
LCA  device  is  described  in  the  "Configuring  the  LCA  Device" 
applications  note. 
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Figure  7.  One  Bit  of  an  8-Bit  Serial  -  Parallel  Shift  Register 


Figure  8.  CLB  Schematic  Output  for  the  8-Bit  Shift  Register 
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Figure  9a.  Decode  Outputs  1  and  2  of  3-to-8  Decoder 
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Figure  9c.  Decode  Outputs  5  and  6  of  3-to-8  Decoder 
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Figure  9b.  Decode  Outputs  3  and  4  of  3-to-6  Decoder 
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Figure  9d.  Decode  Outputs  7  and  8  of  3-to-8  Decoder 


Figure  9.  CLB  Configuration  of  a  3-to-8  Decoder 
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Figure  10b.  2nd  Bit  of  a  3-Bit  Counter 


Figure  10a.  1st  Bit  of  a  3-Bit  Counter 
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Figure  10c.  3rd  Bit  of  a  3-Bit  Counter 


Figure  10.  CLB  Configuration  of  a  3-Bit  Binary  Counter  with  Ripple  Carry 


Summary 

The  LCA  device  has  achieved  a  good  solution  to  the  principles 
used  for  multiplexing  digitally-encoded  voice  channels  in  both 
serial  and  parallel  hierarchies.  These  hierarchies  can  accom- 
modate thousands  of  voice  circuits  in  a  PBX  switching  module. 
The  detailed  design  of  the  eight-bit  (parallel-to-serial  and  serial- 
to-parallel)  format  converter  circuit,  or  Corner  Bender,  deve- 
loped in  Monolithic  Memories'  LCA  M2018  device  was  shown  to 
be  efficiently  implemented.  Despite 'the  fast  development  of  the 
system  using  the  XACT  Design  Editor,  the  final  design  was 
programmed  and  bench  tested  to  verify  functional  integrity. 

The  Corner  Bender  design  is  available  from  Monolithic  Memo- 
ries upon  request.  The  bit  pattern  and  .LCA  file  will  be  provided 
for  programming  the  LCA  device  in  an  EPROM.  Please  ask  for 
design  XDES05.LCA 
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Serial  Data  Link  Controller 


Functional  Description 

This  application  was  first  developed  by  LTT,  Conflans  Ste. 
Honorine,  France.  Part  of  the  design,  reprinted  with  courtesy  of 
LTT,  is  used  to  control  a  serial  data  link  based  upon  a  specialized 
LSI  chip. 

Originally  designed  with  six  standard  SSI/MSI  circuits,  this  same 
function  can  now  be  implemented,  not  only  into  a  single 
PAL20RA10,  but  with  even  more  features  and  better  perfor- 
mance. The  function  can  be  divided  into  three  sub-functions: 

1.  Address  Decoding 

2.  Control  Flags 

3.  Transmission  Speed  Selection 

Up  to  four  address  lines  are  allowed  (eight  were  actually  used), 
plus  two  extra  lines  which  are  special  decoding  controls 
(MEM/IO  selection,  Enable  Control . . .).  Two  flip-flops  load  flag 
conditions,  from  the  address  bus  (A1  and  A2),  providing  hand- 
shake between  the  6850  UART  and  the  communication  lines. 
They  have  a  common  clock  which  also  serves  as  Chip  Select 
(CSO)  for  the  UART. 

The  UART  Transmit  clock  (TXCLK)  can  be  directly  connected  to 
the  Receive  Clock  (CK  or  RXCLK)  or  represents  the  Receive 
Clock  value  divided  by  sixteen.  This  function  was  performed  by 
four  "D"  Flip-Flops  connected  as  a  4-stage  Asynchronous 
Divider.  Since  each  basis  cell  in  a  PAL20RA10  has  four  Product 
Terms  available,  this  function  could  be  implemented  either 
asynchronously  or  synchronously.  In  the  PAL  Design  Specifica- 
tion example,  a  4-bit  synchronous  divider  was  used  instead  of 
the  asynchronous  circuit  shown  in  the  schematic. 


Pin  Description 


1 .  TEST    Allows  preload  function  for  testing. 

2.  SYSRESET    Reset  line  from  microprocessor. 

3.  A2    Address  line  from  address  bus. 

4.  A1    Address  line  from  address  bus. 

5.  HDSHAKE    Handshake  line  (CTS/RTS). 

6.  CK    External  clock. 

7.  E    Enable  line  from  microprocessor. 

8.  AUXDECOD   Extra  decoding  line 

(e.g.  board  level  decoding). 

9.  A3    Address  line  from  address  bus. 

10.  A3    Address  line  from  address  bus. 

11.  A3    Address  line  from  address  bus. 

12.  GND    Reference  power  supply  ground. 

13.  /OE   Output  enable  line. 

14.  A6    Address  line  from  address  bus. 

15.  SPEEDSEL    Speed  selection  line. 

16.  DIV4    MSB  4-bit  synchronous  counter. 

17.  DIV3    3rd  stage  synchronous  counter. 

18.  DIV2    2nd  stage  synchronous  counter. 

19.  DIV1    LSB  4-bit  synchronous  counter. 

20.  SCO    UART  chip  select  line  (CSO). 

.  21.  BLOCREC    Bloc  receive  line. 

22.  DIR  DIV    Direct  or  divided  clock. 

23.  /TPH    External  use  flag. 

24.  VCC    5  V  power  supply. 
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simulation  Hesuits 


Title 
Pattern 
Revision 
Author 


Date 


Serial  Data  Link  Controller 

Link.pds 

A 

Jose  Juntas  /  Kelvin  Chow 

Monolithic  Memories  Inc.,  Santa  Clara,  Ca 

3/1/85 


CHIP  SE_CH_CNTRL  PAL20RA10 

TEST  SYSRESET  A2  Al  HDSHAKE  CK  E  AUXDECOD  A3  A4  A5  GND 
/OE  A6  SPEEDS EL  DIV4  DIV3  DIV2  DIV1  CSO  BLOCREC  DIRDIV 
/TPH  VCC 


/TPH 

/TPH. CLKF 
/TPH.SETF 

DIRDIV 
DIRDIV. CLKF 
DIRDIV. SETF 


CSO 
/DIV1 


/DIV1 . CLKF 
/DIV1.SETF 


CSO 

SYSRESET 


CSO 
'  /HDSHAKE 


;Load  A2  as  flag 
;CLK  W/  ADDR.  decode 
;global  system  reset 

;Load  speed  ratio 
;CLK  W/  ADDR.  decode 
;CLR  by  CTS/RTS  line 


=  /DIRDIV  ;Controlled  by  speed 

+  HDSHAKE  ;option  and  CTS/RTS 

;  line 

=  /A6*A5*A4*A3*AUXDECOD*E 

;UART  address  valid 
:=  DIV1  ;4-bit  synchronous 

;divider  LSB 
=  CK  ;CLK  by  CK(external) 

=  /DIRDIV  ;CLR  by  speed  option 


/DIV2 


/DIV2 . CLKF 
/DIV2 . SETF 


/DIV1*/DIV2 
+  DIV1*DIV2 

-  CK 

=  /DIRDIV 

:=  /DIV2*/DIV3 

+  /DIV1*/DIV3 

+  DIV1*DIV2*DIV3 

=  CK 

=  /DIRDIV 

:=  /DIV3*/DIV* 
+  /DIV2*/DIV4 
+  /DIV1*/DIV4 
+  DIV1*DIV2*DIV3*DIV4 
=  CK 

-  /DIRDIV 

:=  /Al 

-  CSO 

=  /HDSHAKE 


/DIV3 


/DIV3.CLKF 
/DIV3 . SETF 


/DIV4 . CLKF 
/DIV4.SETF 

SPEEDSEL 
SPEEDSEL. CLKF 
SPEEDSEL. SETF 

SIMULATION 


TRACE_ON  A1,A2,A3,A4,A5,A6,E, 

AUXDECOD , SYSRESET , /TPH , HDSHAKE 
CSO, SPEEDSEL, DIRDIV, CK, 
DIV1 , DIV2 , DIV3 , DIV4 

SETF  SYSRESET, /HDSHAKE 


;2ND  stage  of 
; divider 

;CLK  by  CK( external) 
;CLR  by  speed  option 

;3RD  stage  of 
;divider 

;CLK  by  CK(external) 
;CLR  by  speed  option 

;4TH  stage  of 
rdivider  MSB 


;CLK  by  CK(external) 
;CLR  by  speed  option 

;Load  speed  choice 
;CLK  W/  ADDR.  decode 

;CLR  by  CTS/RTS  line 


;Signals  to  be 

; observed 


;Reset  all  regs 


CHECK   /SPEEDSEL, /DIRDIV, TPH 

SETF  /SYSRESET, A1,A2, A3, A4,A5,/A6, HDSHAKE,      ;Set  decode 
E ,  AUXDECOD  :  condition 


CHECK   /SPEEDSEL, DIRDIV 

FOR  TO  15  DO 

BEGIN 
SETF  CK 

SETF  /CK 
END 


.■Check  SPEEDSEL  and 
; DIRDIV  regs 

;This  portion 
jsimulates  divide 
;by  four  counter 


A6 
E 

AUXDECOD 
SYSRESET 
/TPH 
HDSHAKE 
CSO 

SPEEDSEL 

DIRDIV 

CK 

DIV1 

DIV2 

DIV3 

DIV4 


g  g   g  gg 

XXHHHHHHHH 
XXHHHHHHHH 
XXHHHHHHHH 
XXHHHHHHHH 
XXHHHHHHHH 
XXLLLLLLLL 
XXHHHHHHHH 
XXHHHHHHHH 
HHLLLLLLLL 
LLLLHHHHHH 
LLHHHHHHHH 
XXHHHHHHHH 
LLLLLLLLLL 
LLLLHHHHHH 
XXXXXHHLHH 
XLLLLLHHHL 
XLLLLLLLLH 
XLLLLLLLLL 
XLLLLLLLLL 


gg  gg  gg  g 

HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 


HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LHHLHHLHHL 
LLHHHLLLHH 
HHHHHLLLLL 
LLLLLHHHHH 
LLLLLLLLLL 


g  gg  gg  gg 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 

HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHLHHLHHLH 
HLLLHHHLLL 
LHHHHHHLLL 
HHHHHHHLLL 
LLLLLLLHHH 


gg  gg  gg 
HHHHHHHHHH 


HHHHHHHHHH 


HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HLHHLHHLHH 
HHHLLLHHHL 
LLLHHHHHHL 
LLLLLLLLLH 


A4 

A5 
A6 

E 

AUXDECOD 
SYSRESET 
/TPH 
HDSHAKE 
CSO 

SPEEDSEL 

DIRDIV 

CK 

DIV1 
DIV2 
DIV3 
DIV4 


gg  gg  gg  g 

HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LHHLHHLHHL 
LLHHHLLLHH 
LLLLLHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
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The  ZPAL  PALC20R8Z  family  consists  of  four  devices: 
combinatorial  PALC20L8Z,  and  registered  PALC20R8Z, 
PALC20R6Z,  and  PALC20R4Z.  These  devices  offer  zero 
standby  and  very  low  active  power  dissipation,  along  with 
traditional  architecture  and  functionality  of  standard  PAL 
devices.  These  24-pin  devices  offer  high  speed  (TPD  =  35  ns) 
and  are  ideal  for  applications  in  communications  equipment 
and  instrumentation,  which  are  typically  under  severe  power 
constraints. 


This  note  describes  one  such  application  in  communications 
equipment  for  Quadrature  Amplitude  Modulation  (QAM) 
encoding  and  provides  a  design  based  on  a  PALC20R8Z 
device.  QAM  encoding  technique  is  used  in  standalone  modem 
equipment  which  uses  CMOS  devices.  Monolithic  Memories' 
CMOS  ZPAL  devices  became  a  natural  choice  for  replacing 
glue  logic  or  any  complete  function  that  is  being  implemented  in 
discrete  SSI  components. 

QAM  Encoder 

Data  modems  use  encoders  as  a  means  of  increasing  data 
throughput  within  a  given  bandwidth  of  a  channel.  Quadrature 
Amplitude  Modulation  (QAM)  gives  an  input  data  rate  of  four 
times  the  encoding  rate,  which  is  the  output  rate  of  the 
encoder.  The  encoding  rate  determines  the  channel  bandwidth 
required  by  the  output  signal.  Voice  band  data  modems  have 
become  popular  for  use  over  the  switched  telephone  network 
for  interconnecting  personal  computers  to  distant  computers. 
In  these  modems  the  data  encoder  function  is  usually 
integrated  within  the  modem  chip  set.  However,  higher-speed 
modems  for  20-MHz  digital  radio,  satellite  up/down  links  and 
other  group  band  modems  implement  the  encoder  as  a  dis- 
crete hardware  function  due  to  the  speed  requirements.  A 
PALC20R8Z  device  is  ideally  suited  for  implementing  the  data 
encoder  function  because  of  its  ability  to  implement  state 
machines  efficiently  and  its  low  power  consumption. 

The  PAL  Design  Specification  (QAM.PDS)  file  enclosed  shows 
the  QAM  Encoder  example  implemented  using  the  PALASM®2 
software  state  machine  syntax. 


Modem  Encoding  Techniques: 

Figure  1  shows  the  encoder  function  within  a  typical  modem 
transmitter.  Incoming  data  may  be  scrambled  (to  randomize 
the  data)  and  then  passed  through  the  encoder.  The  output 
streams  (I,  Q)  are  passed  through  pulse-shaping  filters  before 
being  multiplied  by  the  sine  and  cosine  carrier  frequencies. 

Let  us  consider  the  QAM  encoding  technique  as  used  in  the 
voice  band  data  modems.  The  scheme  is  also  widely  used  in 
other  high-speed  applications.  Some  of  the  popular  modem 
standards  for  the  switched  telephone  network  are  Bell  21 2A 
(for  1200-bps  operation),  CCITT  V.22  (for  1200-bps  operation), 
CCITT  V.22bis  (for  2400-bps  operation)  and  CCITT  V.32  (for 
9600-bps  operation). 

According  to  Bell  21 2A,  CCITT  V.22  and  V.22bis  recom- 
mendations, full  duplex  communication  over  a  pair  of  tele- 
phone wires  is  achieved  by  separating  the  bandwidth  occupied 
by  the  signals  traveling  in  opposite  directions,  (see  Figure  2). 


800  1S00  2000  2800 

1200  Hz  24O0HZ 
FREQUENCY  -  Hz 

Figure  2.  Full  Duplex  Communication  by  Bandwidth  Separation 

The  transmit  carrier  frequency  of  one  modem  is  1200  Hz  while 
the  far-end  modem  uses  2400  Hz  as  its  transmit  carrier  fre- 
quency. The  encoding  rate  (or  symbol  rate)  for  each  carrier  is 
600  baud  (equivalent  to  transitions  per  second)  and  the  band- 


TO  A  AND  B  CHANNELS 
OF  OSCILLOSCOPE 


IN-PHASE 
DATA  (I) 


TRANSMIT 

DATA   

STREAM 


DATA 

SCRAMBLER 

ENCODER 

QUADRATURE 
PHASE  DATA  (Q) 


PULSE 
SHAPING 
FILTER 


sin  {Loa) 

I— 4—1 


PULSE 
SHAPING 
FILTER 


COSINE  (oi,;,) 

Figure  1.  Data  Encoder  in  a  Typical  Modem  Transmitter 
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width  occupied  by  the  outgoing  signal  spectrum  is  about  800 
Hz,  centered  around  the  carrier.  Since  the  two  bands  of  trans- 
mission do  not  overlap,  no  interference  takes  place  between 
the  two  channels  of  communications. 

The  transmit  data  throughput  in  each  direction  is  increased 
from  the  600  baud  encoding  rate  by  means  of  encoding  two  or 
four  incoming  data  bits  at  a  time.  This  produces  a  multilevel 
signal  which  is  then  used  to  modulate  the  carriers.  The 
transmit  data  bit  rate  is  then  1200  bits  per  second  (bps)  and 
2400  bps  respectively. 

CCITT  V.22bis  and  V.32  recommendation  modems  employ 
Quadrature  Amplitude  Modulation  (QAM)  technique  for 
modulating  the  carrier  frequency.  The  16-state  QAM  signal 
constellation  can  be  observed  at  the  I  and  Q  outputs  (Figure  1) 
of  the  encoder.  The  diagram  of  Figure  3  is  observed  by 
connecting  the  I  and  Q  outputs  to  the  A  and  B  channels  of  a 
dual  channel  oscilloscope  and  turning  the  timebase  to  the  X-Y 
setting. 

The  data  stream  to  be  transmitted  is  divided  into  groups  of  four 
consecutive  bits  called  quadbits.  The  first  two  bits  are 
encoded  as  a  phase  quadrant  change  relative  to  the  quadrant 
occupied  by  the  preceding  signal  element.  See  Figure  3  and 
Table  1  The  last  two  bits  of  each  quadbit  define  the  amplitude 
of  the  signal  element  in  each  phase  quadrant.  The  left-hand 
bits  in  Table  1  are  the  first  of  each  pair  in  the  data  stream  as  it 
enters  the  encoder  after  the  data  scrambler. 

Bell  212A  and  CCITT  V.22  recommendation  modems  employ 
the  Differential  Quadrature  Phase  Shift  Keying  (DQPSK) 
technique  for  modulating  the  carrier.  The  data  stream  to  be 
transmitted  is  divided  into  groups  of  two  consecutive  bits 
called  dibits.  The  dibits  are  encoded  as  a  phase  quadrant 
change  relative  to  the  quadrant  occupied  by  the  preceding 
signal  element.  The  signal  element  corresponding  to  XX01  (Y0, 
Y1.X2,  X3)  in  the  V.22bis  signal  constellation  (Figure  3)  is 
transmitted  irrespective  of  the  quadrant  occupied.  This  ensu- 
res compatibility  with  the  V.22bis  modems  operating  in  a  fall- 
back mode  of  1200  bps. 


QUADRATURE  PHASE  (Q) 


X 

10,11 


X 

10,10 


B    X  3 

10,01 


X  1 

10,00 


X  C 

00,01 


X 

00,11 


-1 

X 


00,10 


X 

11,10 


X 

11,00 


X 

11,11 


A  X 

11,01 


1 

X 

01,00 


IN  PHASE 

(I) 


X   -3--     X  D 


X 

01,10 


x 

01,11 


A,  B,  C,  D  ARE  HALF  RATE 
POINTS  e.g.,  1200  bps 


Figure  3.  QAM  Signal  Pattern  (Y0,  Y1,  X2,  X3)  e.g.,  2400 
bps  (V.22bis) 


FIRST  TWO  BITS 

PHASE 

OF  QUADBITS  OR  A 

QUADRANT 

DIBIT  (X0,  X1) 

CHANGE 

00 

+  90 

01 

0 

11 

270 

10 

180 

Table  1.  Relative  Quadrant  Change  for  Input  (X0,  X1) 
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Figure  4.  PALC20R8Z  Device  QAM  Encoder 
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require  both  phase  and  amplitude  modulation,  are  traditionally 
used  for  deep-space  satellite  links,  and  digital  radio  communi- 
cation applications. 

PALC20R8Z-based  QAM  Encoder 

Figure  4  shows  a  PALC20R8Z  device  implementing  a  high- 
speed QAM  encoder.  Four  bits  of  transmit  data  stream  are 
input  to  the  device  every  symbol  clock.  For  a  V.22bis  modem, 
the  symbol  clock  rate  is  600  baud  and  the  bit  rate  is  2400  bps. 
However  the  encoder  device  can  run  at  several  megahertz  for 
use  in  deep-space  satellite  communications.  The  design 
utilizes  all  of  the  available  outputs  on  the  PALC20R8Z.  Q0  and 
10  signals  are  held  in  logic  high  state.  The  extra  inputs  on  the 
device  may  be  used  to  control  the  start-up  phase  of  the 
encoder. 


It  is 


Figure  5.  Phase  Quadrant  State  Diagram 


assumed  that  a  quadbit  (X0,  X1,  X2,  X3)  of  incoming 


transmit  data  is  available  every  symbol  clock.  The  first  two 
bits,  X0  and  X1 ,  are  used  to  compute  the  phase  quadrant  of 
the  next  output  symbol.  The  quadrants  are  assigned  states 
(represented  by  Y0  and  Y1 )  and  the  state  transitions  (or  the 
differential  quadrant  change)  is  described  using  a  state 
diagram  shown  in  Figure  5  and  Table  1 .  The  state  transition 
outputs  are  latched  at  the  positive  edge  of  the  first  symbol 
clock.  Equations  for  Y0  and  Y1  could  be  derived  using 
Karnaugh  maps  (Figure  6)  and  the  reduced  equations 
implemented.  Alternatively,  PALASM  2  software  has  a  state 
machine  syntax  which  can  be  used  to  input  the  state  diagram 
directly  (see  the  design  file).  The  Minimize  software  module  of 
PALASM  2  software  is  then  used  to  reduce  the  equations. 
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00 

01 

11 

10 
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00 

01 

11 

10 
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1 
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01 
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1 

01 

1 

1 

11 

1 

11 

1 

10 

R 

1 

10 

1 

n 

Karnaugh  Map 
for  Next  /Y0  State 


Karnaugh  Map 
for  Next  /Y1  State 


Figure  6.  Karnaugh  Maps  for  Next  States  of  /Y0,  /Y1 


The  last  two  bits,  X2  and  X3,  are  latched  at  the  same  time  as 
the  new  states  are  computed  (see  design  file).  The  output  data 
values,  I  and  Q,  are  combinatorial  functions  of  Y0,  Y1 ,  X2,  and 
X3.  These  outputs  are  latched  at  the  positive  edge  of  the  next 
symbol  clock.  Thus  the  computational  delay  through  the 
encoder  is  two  symbol  clocks.  If  the  processing  delay  is 
critical  to  the  design,  it  could  be  reduced  to  one  symbol  clock 
by  using  a  PAL20RA10  device  because  this  device  allows 
independent  clocking  of  registers.  The  phase  quadrant  state 
could  then  be  latched  on  the  positive  edge  of  the  symbol  clock 
and  the  output  could  be  available  on  the  negative  edge  of  the 
clock. 


• 

jii 

0 

ft" 

m 

• 

• 

• 

Figure  7.  Photograph  of  QAM  Pattern 


The  values  +3,  +1,-1,  -3  of  I  and  Q  (Figure  3)  are  represented 
by  the  three-bit  binary  words,  111,  101,  011,  001,  (12  11  10  or 
Q2  Q1  Q0),  respectively.  These  three-bit  values  are  output 
every  symbol.  This  number  scheme  allows  the  use  of  simple 
digital-to-analog  conversion  as  required  in  the  satellite 
applications.  For  these  high-speed  applications  the  pulse- 
shaping  filtering  and  carrier-modulation  functions  are 
performed  in  analog  domain  using  operational  amplifiers  and 
active  components. 

The  PALC20R8Z  device  has  only  eight  registered  outputs, 
whereas  this  design  needs  ten  registered  outputs  (two  for  the 
phase  quadrant  state,  and  eight  for  the  I  and  Q  outputs). 
Fortunately  the  10  and  Q0  are  always  "1"  (as  defined  above) 
and  hence  can  be  tied  to  this  state  at  the  input  to  the  next 
stage.  In  this  case  the  eight  registers  in  the  PALC20R8Z 
device  are  sufficient  to  implement  the  full  encoder  function. 

The  design  file  shows  an  example  of  using  the  state  machine 
syntax  of  PALASM  2  software.  The  QAM  encoder  is  a  standard 


Mealy  type  of  state  machine.  However,  to  keep  the  number  of 
product  terms  per  output  reasonable  (less  than  eight  for  the 
PALC20R8Z  device),  it  is  implemented  as  a  Moore  state 
machine  followed  by  the  output  decode  function  (see 
appendix).  There  is  a  pipeline  delay  between  the  new  state 
values  and  their  corresponding  outputs.  The  differential 
quadrant  change  is  implemented  in  the  State  section  of  the 
QAM.PDS  file  whereas  output  decoding  is  done  in  the 
Equations  section  of  the  same  file. 

In  the  laboratory  verification,  the  I  (10  to  12)  and  Q  (Q0  to  Q2) 
outputs  were  connected  to  two  digital-to-analog  converters. 
The  analog  outputs  were  then  connected  to  the  A  and  B 
channels  of  an  oscilloscope  and  the  X-Y  pattern  was  observed 
(see  Figure  7).  For  high  data-rate  operation  (several  MHz),  the 
PALC20R8Z  device  encoder  together  with  a  data  scrambler 
implemented  in  hardware  facilitates  a  cost-effective  solution 
for  a  modem  transmitter. 
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TITLE 

PATTERN 

REVISION 

AUTHOR 

COMPANY 

DATE 


QAM_ENCODER 

QAM.PDS 

A 

RAJ  PARIPATYADAR,  M. 
MONOLITHIC  MEMORIES 
5  April  1987 


GZOWSKI 
INC. ,    SANTA  CLARA 


CHIP         QAM  PAL2  0R8 

CK  X3   X2  XI  XO  NC  NC  NC  NC  NC  NC  GND 
/OE  NC  X3A  X2A  Yl  YO  12   II  Q2   Ql  NC  VCC 


STATE 

MOORE_ 

_MACHINE 

STATE 

0 

:=  CS1 

state" 

"l 

:=  CS1 

state" 

"2 

:=  CS1 

state" 

"3 

:=  CS1 

■>STATE 
->STATE' 


1  +  CS2  ->STATE_0 

"2  +  CS2  ->STATE_1 

"3  +  CS2  ->STATE_2 

"0  +  CS2  ->STATE  3 


+   CS3    ->STATE_3  +->STATE_2 

+  CS3   ->STATE_0  +->STATE_3 

+  CS3   ->STATE_1  +->STATE_0 

+  CS3   ->STATE   2  +->STATE  1 


Equations  for  the  next  states  represented  by  YO  Yl 
These  outputs  are  latched  on  the  positive  edge  of  the 
first  symbol  clock 


STATE_0  =  /YO  *  /Yl 
STATE_1  =  /YO  *  Yl 
STATE_2  =  YO  *  Yl 
STATE_3   =     YO   *  /Yl 


CONDITIONS 

CS1  =  /XO  *  /XI 

CS2   =  /XO  *  XI 

CS3   =     XO  *  XI 

CS4  =     XO  *  /XI 


EQUATIONS 


;   latch  values  of  X2  and  X3  on  the  same  symbol  clock 
/X3A  :=  /X3 
/X2A  :=  /X2 

;     The  following     four  outputs  are     latched  on  the  positive  edge 
;     of  the  next  symbol  clock.     Therefore  the  ouput  of  the  whole 
;     encoder  is  available  after  the  second  symbol  clock. 
;     The  value  of  10  and  Q0  are  HIGH  and  therefore  input  to  the  next 
;     stage  may  be  tied  HIGH. 

/I2    :=  STATE_0  +  STATE_1 
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/II    :=  STATE_0 *   X3 A 

+  STATE_1*  X2A 

+  STATE_2*/X3A 

+  STATE_3*/X2A 

/Q2    :=  STATE_0  +  STATE_3 


/Ql  : =  STATE_0*  X2A 
+  STATE_1*/X3A 
+  STATE_2*/X2A 
+  STATE  3*  X3A 


SIMULATION 

TRACE_ON         OE     XO     XI     X2     X3     X2a     X3a     YO     Yl  CK 
II     12     Ql  Q2 

SETF        CK     OE  ; Preset  polarity  and  enable  output 

PRLDF       X3A  X2A  /Yl     /YO   12   II  Q2  Ql 

;check  the  state  transitions  from 

SETF  /XO     /XI     /X2      /X3  /CK 

; state  Y£>=0  Y1=0  with  input 
CLOCKF  ;X0=0  and  X1=0 

CLOCKF  ;X2=0  and  X3=0  sets  the  smallest 

CLOCKF  ; vector  amplitude 

CLOCKF  ; output  available  on  the  negative 

;edge  of  clock 


PRLDF  X3A  X2A  /Yl     /YO   12   II  Q2  Ql 

; check  the  state  transitions  from 

SETF  XO     XI  ; state  Y0=0     Y1=0  with  inputs 

CLOCKF  ;X0=1  and  Xl=l 

CLOCKF 
CLOCKF 
CLOCKF 


PRLDF       X3A  X2A  /Yl     /YO  12  II 

SETF         XO  /XI 

CLOCKF 

CLOCKF 


Q2  Ql 

; check  the  state  transitions  from 
; state  Y0=0  Y1=0  with  inputs 
;X0=1  and  X1=0 


PRLDF       X3A  X2A  Yl  /YO   12   II  Q2  Ql 

; check  the  state  transitions  from 
CLOCKF  ; state  Y0=0  Yl=l  with  inputs 

CLOCKF  ;X0=1  and  X1=0 

PRLDF       X3A  X2A  /Yl  /YO   12   II  Q2  Ql 

; check  the  state  transition  from 
SETF         /XO  XI  ; state  Y0=0  Y1=0  with  inputs 

CLOCKF  ;X0=0  and  Xl=l 


Memories 
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CLOCKF 


; cnecK  tne  state  transitions  rrom 
; state  Y0=0  Yl=l  with  inputs 
;X0=0  and  Xl=l 


PRLDF  X3A  X2A  /Yl  YO  12  II  Q2 
CLOCKF 

PRLDF  X3A  X2A  Yl  YO  12  II  Q2 
CLOCKF 


Ql 

; check  the  state  transitions  from 
; state  Y0=1  Y1=0  with  inputs 
;X0=0  and  Xl=l 

Ql 

; check  the  state  transitions  from 
;state  Y0=1  Yl=l  with  inputs 
;X0=0  and  Xl=l 


check  the  4  vector  amplitudes 


PRLDF       X3A  X2A  /Yl  /YO   12   II  Q2  Ql 

:In  state  Y0=0  Y1=0 

SETF         /XO  XI  /X2  /X3 
CLOCKF 
CLOCKF 

SETF         /X2  X3 
CLOCKF 
CLOCKF 


SETF         X2  /X3 

CLOCKF 

CLOCKF 


SETF         X2  X3 

CLOCKF 

CLOCKF 


SETF         /X2  /X3 

CLOCKF 

CLOCKF 


TRACE  OFF 
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PAL  Devices  Implement  the 
Full  V.32  Convolution  Encoder 


The  continuing  drive  for  higher  communication  data  rates  is 
resulting  in  the  development  of  data  modems  complying  with 
CCITT  specification  V.32.  This  standard  uses  the  Quadrature 
Amplitude  Modulation  (QAM)  encoding  scheme  to  achieve  a 
data  rate  of  9600  bps  while  restricting  the  modulation  rate  to 
2400  Hz.  However,  this  high  rate  of  transmission  is  sensitive  to 
random  errors  caused  by  Gaussian  noise  on  the  telephone 
lines.  Forward  error  correction  by  means  of  convolution 
encoding  and  Viterbi  decoding  improves  the  bit-error  rate 
performance  in  the  presence  of  white  Gaussian  noise.  The 
CCITT  V.32  (see  Reference  1)  specification  provides  for  the 
optional  use  of  a  convolution  encoding  scheme  whenever  the 
background  noise  is  high.  A  PAL20X8A  and  a  PAL20RS8  can 
implement  this  complex  encoder  function  in  a  cost-effective 
manner,  thereby  providing  a  simple  way  of  implementing  the 
transmitter  of  a  V.32  modem.  This  design,  using  PAL  devices, 
provides  a  way  of  implementing  a  separate  modem  transmitter 
synchronized  only  to  the  transmit  clock.  The  receiver  of  the 
modem  can  then  be  implemented  on  a  separate  digital  signal 
processing  chip  synchronized  to  the  receive  clock. 

Implementation  of  the 
V.32  Convolution  Encoder 

Figure  1  shows  the  position  of  the  encoder  function  within  the 
simplified  V.32  modem  transmitter.  Incoming  data  is  scrambled 
to  randomize  the  data  and  then  passed  through  the  encoder. 
The  In-phase  and  Out-of-phase  (I,  Q)  output  data  streams  are 
passed  through  pulse-shaping  filters  before  being  multiplied  with 
the  sine  and  cosine  carrier  frequencies.  If  I  and  Q  values  are 
applied  to  the  X  and  the  Y  axes  of  an  oscilloscope  with  the  time 


base  turned  to  the  X-Y  setting,  a  signal  constellation  is 
observed.  This  signal  pattern  (photograph  in  Figure  8)  indicates 
the  modulation  amplitude  and  the  phase  of  the  outgoing  2400  Hz 
carrier  frequency.  The  standard  signal  constellation  for  the  V.32 
is  the  QAM  scheme  and  is  shown  in  Figure  2.  This  scheme 
produces  sixteen  states  or  points  on  the  signal  constellation 
map.  The  diagram  may  be  observed  by  connecting  the  I  and  Q 
outputs  of  the  encoder  (Figure  1)  to  the  A,B  channels  of  an 
oscilloscope  and  turning  the  time  base  to  the  X-Y  setting.  The 
optional  convolutional  encoding  scheme  specified  in  the  V.32 
standard  produces  thirty-two  points  in  the  signal  constellation 
(Figure  3). 

The  points  in  Figure  3  are  closer  to  each  other  than  the  points  in 
the  straight  QAM  scheme  depicted  in  Figure  2,  and  it  may  not  be 
obvious  why  the  second  scheme  should  give  better  performance 
in  the  presence  of  noise.  The  convolution  encoder,  with  its 
implicit  memory,  prevents  adjacent  (neighbor)  points  appearing 
in  succession.  The  resulting  distance  between  two  successive 
points  after  convolution  encoding  is  therefore  bigger  than  the 
distance  between  any  two  points  in  Figure  2.  This  translates  to  a 
more  accurate  decision  of  the  received  state  and  hence  better 
performance  under  high  Gaussian  noise. 

We  now  consider  the  implementation  of  this  modem  encoder 
with  convolution  encoding.  Figure  4  shows  the  block  diagram  of 
the  full  V.32  encoder  with  convolution  encoding.  Figure  5  shows 
the  implementation  of  this  circuit  in  two  PAL  devices,  a 
PAL20X8A  and  a  PAL20RS8.  The  quadrant-change  state 
machine  and  the  convolution  encoder  are  implemented  in  the 
PAL20X8A  and  the  coordinate  mapping  of  the  32-point  signal 
constellation  is  achieved  through  the  PAL20RS8  device. 


CONNECT  TO  A,  B  CHANNELS  OF 
OSCILLOSCOPE  FOR 
CONSTELLATION  PATTERN 


SIN  (Wet) 


IN  -  PHASE 
DATA  I 


TRANSMIT 
DATA  - 
STREAM 


DATA 
SCRAMBLER 


V.32 
CONVOLUTION 
ENCODER 


PULSE 
SHAPING 
FILTER 


t 


PULSE 
SHAPING 
FILTER 


QUADRATURE 
PHASE  DATA  (Q) 

COSINE  (Wet) 

Figure  1 .  V.32  Convolution  Encoder  in  a  Typical  Modem  Transmitter 
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QUADRATURE  PHASE  (Q) 


X  D    X      3+  X 

10,11  10,01 


X 

10,10 


X     1--  X 


A  X 

00,01 


-1 
X 


00,11  00,10 


11,10  11,11 


11,00 


C  X 

11,01 


1 

X 

01,00 


3 

X 

01,10 


X    -3--     X  B 


01.01  01,11 


IN  PHASE 
(I) 


A,  B,  C,  D  ARE  HALF  RATE 
POINTS  e.g.,  1200  bps 

Figure  2.  QAM  Signal  Pattern  (Y1,Y2,X3,X4) 


QUADRATURE 
PHASE  (Q) 


X 

000 


X  X  x 

01000  00101  01010 

X  X  2-L     X  X 

10010  10101  10011  10100 


f  4 

11111     I  11 

X 

00101 

4  x 

10011 

X 

000 

3011 

i 

00001 


X  X  X  X  X 

00000  01111  00010  01101  00011 

—I — X  1  X— j  X  1  X — |  IN  „ 

.4    11001     -2   11110  11010     2    11101  4 

X  X  X  X  X 

00111  01001  00110  01011  00100 

X  X  .2  I      X  X 

10000  10111      T   10001  10110 


XXX 
01110  00001  01100 


x  .44-  x 

11100       |  11011 

Figure  3.  32-Point  Signal  Pattern  (Y0,Y1,Y2,X3,X4) 
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X3- 
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ENCODER 


"i  r 


Y2  (n-1) 


—    D  — 


Y1  (n-1) 


I  I 

INPUT  DATA  STREAM 
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oHIKrrr 


CONVOLUTION  ENCODER 
-  PAL20X8A  


X3n 

X2n 

— »-Q3 

Y2n 

-»Q2 

-»-Q1 
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-»-00 

TABLE  2 
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-»-l3 

— »- 12 
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-»I0 

Figure  4.  Full  V.32  Convolution  Encoder  (9600  bps) 
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Figure  5.  V.32  Convolution  Encoder 
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The  scrambled  data  stream  to  be  transmitted  is  divided  into 
groups  of  four  consecutive  data  bits.  As  shown  in  Figure  4,  the 
first  two  bits  arrive  in  time  X1,  X2  in  each  group  are  first 
differentially  encoded  into  Y1  and  Y2,  according  to  Table  1 . 

These  state  transitions  are  mapped  on  Karnaugh  maps  (Figure 
6),  and  the  reduced  equations  are  derived  and  implemented  in 
the  PAL20X8A  design.  The  equation  for  Y1  turns  out  to  be  a 
single  Exclusive-OR  (XOR)  function  of  the  previous  state  of  Y1 
and  XO.  The  Karnaugh  map  for  Y2  (Figure  6b)  appears  to 
indicate  a  fairly  complex  function.  However,  after  a  number  of 
steps  of  juggling  with  the  Boolean  equations,  the  final  equation 
can  be  expressed  as  an  XOR  function  of  two  product  terms. 
The  PAL20X8  device  offers  an  efficient  implementation  of  this 
equation. 

The  two  differentially  encoded  bits,  Y1  and  Y2,  are  then  used 
as  input  to  a  systematic  convolution  encoder,  which  generates 
a  redundant  bit  YO.  The  convolution  encoder  circuit  of  Figure  4 
contains  five  XOR  functions  and  this  requires  the  use  of  a  PAL 
device  containing  XOR  elements.  The  XORs  feed  into  each 
other.  Combinatorial  values  DO  and  D3  are  produced  (see 
equations  for  pin  14  and  23  of  the  PAL20X8A).  Additionally, 
registered  values  D1  and  D2  are  generated  as  intermediate 
delay  values.  Note  that  some  of  the  XOR  elements  had  to  be 
multiplied  out,  such  that  the  equations  for  D2  and  YO  have  only 
1  XOR  term  in  the  equation.  However,  product  terms  feed  the 
two  inputs  of  the  XOR  gate.  This  arrangement  uses  the  full 
functionality  of  the  PAL20X8A  device. 


INPUTS 

OUTPUTS 

OUTPUTS 

XO  X1 

Y1n-1  Y2n-1 

Y1n  Y2n 

0  0 

0  0 

0  0 

0  0 

0  1 

0  1 

0  0 

1  0 

1  0 

0  0 

1  1 

1  1 

0  1 

0  0 

0  1 

0  1 

0  1 

0  0 

0  1 

1  0 

1  1 

0  1 

1  1 

1  0 

1  0 

0  0 

1  0 

1  0 

0  1 

1  1 

1  0 

1  0 

0  1 

1  0 

1  1 

0  0 

1  1 

0  0 

1  1 

1  1 

0  1 

1  0 

1  1 

1  0 

0  0 

1  1 

1  1 

0  1 

Table  1 .  Differential  Encoding  for  V.32  Signal  Pattern  at  9600  bps 


,  Y1(n-1),  Y2(n-1) 
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Figure  6a.  Karnaugh  Map  for  Y1(n) 
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Figure  6b.  Karnaugh  Map  for  Y2(n) 
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signal  mapping  in  KAL20KS8  Device 

The  extra  bit  YO  and  the  four  information  carrying  bits  Y1,  Y2, 
X2,  and  X3  are  then  mapped  into  the  coordinates  of  the  signal 
element  to  be  transmitted  according  to  the  signal  space 
diagram  shown  in  Figure  3  and  listed  in  Table  2.  The  coordinate 
values  for  I  and  Q  are  each  represented  as  a  4-bit  binary 
number.  The  values  (13,12,11,10)  1100,  1011,  1010,  1001, 
1000,  0111,  0110,  0101,  0100  represent  the  magnitudes  +4, 
+3,  +2,  +1,  0,  -1,  -2,  -3,  -4  respectively.  The  values  for  the  Q 
bits  have  a  similar  representation.  Karnaugh  maps  (Figure  7) 
are  produced  for  each  of  the  outputs  13  to  10  and  Q3  to  Q0,  and 
the  reduced  equations  are  implemented  in  the  PAL  device 
design.  These  eight  output  equations  are  dependent  on  only 
five  outputs,  Y0,  Y1,  Y2,  X2,  and  X3,  but  one  of  the  outputs 
(12)  requires  nine  product  terms.  A  standard  PAL  device  (e.g.. 
PAL20R8)  has  only  eight  product  terms.  A  PAL  device  (such 
as  PAL20RS8)  which  has  sixteen  product  terms  shared 
between  pairs  of  outputs  would  be  fine.  The  PAL20RS8  is 
ideally  suited  to  this  application  because  it  allows  the  nine 
product  terms  required  for  calculating  12  to  be  allocated  to  this 
output  (on  pin  17)  and  the  other  seven  product  terms  to  be 
allocated  to  the  neighboring  signal  13  (on  pin  18). 

Results 

The  output  of  the  first  PAL20X8A  is  available  after  the  first  half  of 
the  symbol  clock  and  the  final  output  is  available  from  the 
second  PAL20RS8  after  the  second  half  of  the  symbol  clock. 
Thus  a  delay  of  only  one  symbol  clock  is  introduced  by  the  full 
encoder.  The  two  PAL  device  designs  have  been  verified  and 
the  32-state  constellation  of  Figure  3  was  observed  on  an 
oscilloscope.  The  encoding  rate  was  2400  baud,  but  the  PAL 
device  design  will  work  at  10  MHz  for  applications  that  may 
require  this  speed. 

Summary 

The  full  V.32  encoder  with  convolution  encoding  has  been 
implemented  in  two  PAL  devices  as  shown  in  Figure  5.  Both  the 
differential  quadrant  encoder  state  machine  and  the 
convolution  encoder  fit  into  the  PAL20X8A  and  the  signal  co- 
ordinate mapping  is  achieved  through  the  PAL20RS8.  This  PAL 
device  design  for  the  encoder  provides  a  superior  way  of 
implementing  a  V.32  modem  transmitter  synchronized  only  to 
the  transmit  clock. 
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;V.32  trellis  encoder  PAL#1,  20X8 

;This  PAL  device  performs  the  quadrant  change  based  on  the  first 
;two  bits  and  the  the  convolution  encoding  to  produce  the  output  yO. 

; 

TITLE  V32_1.PDS 

PATTERN  V32_ENCODER 

REVISION  A 

AUTHOR  RAJ  PARI PAT YADAR 

COMPANY  MMI,   SANTA  CLARA 

DATE  APRIL  3  1987 

CHIP     TREL1  PAL20X8 

;PINS         1         2       3       4         5         6         7         8         9         10       11  12 
CK       X3     X2     XI       XO       NC       NC       NC       NC       NC       NC  GND 
INS         13       14       15       16       17       18       19       20       21       22       23  24 
/OE    /DO    /Dl    /D2     NC      /YOn  /Yin  /Y2n  X2n    X3n    /D3  VCC 

EQUATIONS 

;These  equations  implement  the  differential  quadrant  encoder 
;on  the  bits  XO  and  XI.  The  outputs  are  Yin  and  Y2n. 

Yin  :=  Yin  :+:  XO 

Y2n  :=  (XO  *Yln)    :+:      (Xl*/Y2n  +  /Xl*Y2n) 

;These  equations  produce  the  extra  output  YOn.     Intermediate  values 
;D0,D1,D2  AND  D3  need  to  be  calculated  first.  Dl  and  D2  are 
(•registered  outputs. 

Dl  :=  YOn 

D2  :=   (D0*/D1  +  /D0*D1) 

:+:    (Yln*/Y2n  +  /Yln*Y2n) 

YOn  :=  Yin  *  YOn  :+:  D3 


DO 


=  Y0n*D2*/V2n  +  Y0n*/D2*Y2n 


D3  =  D2*/Y2n  +  /D2*Y2n 


ie  following  equations  are  just  registered  storage  of  the  bits 
;X2  and  X3 

/X2n        :=  /X2  ; internal  node  equation 

/X3n        :=  /X3  ; internal  node  equation 


TRACE_ON  XO  XI  Y2n  Yin  YOn 
SETF  OE 

PRLDF  Dl  D2  YOn  /Yin  /Y2n  X2n 
SETF         XO     /XI     /X2  /X3 


CK     DO     Dl     D2     D3     X2n  X3n 

initialize  Output-Enable 
X3n 

; state  Yln=0  Y2n=0  with  input 

; PRELOAD  VALUES  ARE  OPPOSITE 

;TO  THOSE  INTENDED,   TO  GET  AROUND 

;A  BUG  IN  PALASM2  V2 . 2 

; check  the  state  transitions  from 


CLOCKF 
CLOCKF 
CLOCKF 
CLOCKF 


;X0=1    and  X1=0 

;X2=0  and  X3=0  sets  the  smallest 
; vector  amplitude 
; output  available  on  the  negative 
;edge  of  clock 


SETF        XO  XI 

PRLDF      Dl  D2  YOn  /Yin  /Y2n  X2n  X3n 

; Check  the  state  transitions  from 
CLOCKF  ; state  Yln=0    Y2n=0  with  inputs 

CLOCKF  ;X0=1  and  Xl=l 

CLOCKF 
CLOCKF 


SETF         /XO  XI 

PRLDF       Dl  D2  YOn  /Yin  /Y2n  X2n 

CLOCKF 
CLOCKF 


X3n 

; check  the  state  transitions  from 
; state  Yln=0  Y2n=0  with  inputs 
;X0=0  and  Xl=l 


PRLDF      Dl  D2  YOn  Yin  /Y2n  X2n  X3n 

; check  the  state  transitions  from 
CLOCKF  ; state  Yln=l  Y2n=0  with  inputs 

CLOCKF  ;X0=0  and  Xl=l 


SETF        /XO  /XI 

PRLDF      Dl  D2  YOn  /Yin  /Y2n  X2n  X3n 

; check  the  state  transition  from 
CLOCKF  ; state  Yln=0  Y2n=0  with  inputs 

;X0=0  and  X1=0 

PRLDF      Dl  D2  YOn  /Yin  Y2n  X2n  X3n 

; check  the  state  transitions  from 
CLOCKF  /state  Yln=0  Y2n=l  with  inputs 

;X0=0  and  X1=0 
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PRLDF      Dl  D2  YOn  Yin  /Y2n  X2n  X3n 

; check  the  state  transitions  from 
CLOCKF  ; state  Yln=l  Y2n=0  with  inputs 

;X0=0  and  X1=0 

PRLDF      Dl  D2  YOn  Yin  Y2n  X2n  X3n 

; check  the  state  transitions  from 
CLOCKF  ; state  Yln=l  Y2n=l  with  inputs 

;X0=0  and  X1=0 

TRACE_OFF 
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;V.32  trellis  encoder  PAL#2,  20RS8. 

;This  PAL  performs  the  signal    mapping  onto  the  3  2 

;state  constellation  according  to    CCITT  V.32,   9600  bps 

/•specification. 

TITLE  V32_ENCODER 
PATTERN  V32_2.PDS 
REVISION  A 

AUTHOR  RAJ  PARI PAT YADAR 
COMPANY  MMI,  SANTA  CLARA 
DATE  APRIL  1  1987 

CHIP     TREL2  PAL20RS8 


; PINS       1  2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

CLK  X3a 

X2a 

/Y2 

/VI 

/Y0 

NC 

NC 

NC 

NC 

SCKIN 

GND 

;PINS       13  14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

/OE  INVSCK 

10 

11 

12 

13 

Q0 

Ql 

Q2 

Q3 

SCK 

VCC 

EQUATIONS 

13  :=  /Yl*/Y2*X3a  ;IN-PHASE  OUTPUT 

+  /Y0*/Yl*X2a*/X3a  ;(I0~I3) 

+  /Y0*Y2*/X2a 

+  Yl*/Y2*X2a 

+  Y0*Y1*/Y2*/X3a 

+  Y0*Yl*Y2*/X2a*X3a 

+  Y0*/Y1*Y2*/X3a 

12  :=  /Y0*/Yl*/X2a*/X3a 

+  /Y0*/Yl*X2a*X3a 
+  /Y0*Yl*Y2*X2a 
+  /Y0*Y1*/Y2*/X2a 
+  Yl*/¥2*/X2a*X3a 

+  Y0*Y1*Y2*/X3a 

+  Y0*Y2*X2a*X3a 

+  Y0*/Yl*Y2*X3a 

+  Y0*/Yl*/Y2*/X3a 

II  :=  /Y0*Y1 

+  Y0*Y2 

10  :=  Y0 


Q3  :=  /Y0*/Yl*/Y2*/X3a 

+  /Yl*/Y2*X2a 

+  /Y0*Y2*/X2a*X3a 

+  /Y0*Yl*Y2*X3a 

+  Yl*/Y2*/X3a 

+  Y0*Yl*/X2a*X3a 

+  Y0*Yl*Y2*X2a 


; QUADRATURE  OUTPUT 
; (Q0-Q3) 
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Q2 


+  Y0*/Y1*Y2*/X2a 

:=  /Yl*/V2*/X2a*X3a 

+  /Y0*Y2*/X2a*/X3a 

+  /Y0*Y2*X2a*/X3a 

+  /Yl*Y2*X2a 

+  /Y0*Yl*/V2*X3a 

+  Y0*Y1*/X2a*/X3a 

+  Y0*Yl*X2a*X3a 

+  Y0*/Yl*/Y2*/X2a 


Ql 


QO 


+ 
+ 


/Y0*/Y1*Y2 
/Y0*Y1*/Y2 
Y0*/Y1 

/YO 


INVSCK     =  /SCKIN 
SCK  =  SCKIN 

SIMULATION 

TRACE_ON     YO     Yl     Y2  X2a 
10     II     12  13 

SETF     OE  /CLK 


X3a  CLK 
QO       Ql  Q2 


; GENERATE  /SYMBOL  CLOCK 
; GENERATE  SYMBOL  CLOCK 

Q3 


SETF 

/YO 

/Yl 

/Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

/YO 

/Yl 

/Y2 

/X2A 

X3A 

CLOCKF 

SETF 

/YO 

/Yl 

/Y2 

X2A 

/X3A 

CLOCKF 

SETF 

/YO 

/Yl 

/Y2 

X2A 

X3A 

CLOCKF 

SETF 

/YO 

/Yl 

Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

/YO 

/Yl 

Y2 

/X2A 

X3A 

CLOCKF 

SETF 

/YO 

/Yl 

Y2 

X2A 

/X3A 

CLOCKF 

SETF 

/YO 

/Yl 

Y2 

X2A 

X3A 

CLOCKF 

SETF 

/YO 

Yl 

/Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

/YO 

Yl 

/Y2 

/X2A 

X3A 

CLOCKF 

SETF 

/YO 

Yl 

/Y2 

X2A 

/X3A 

CLOCKF 

SETF 

/YO 

Yl 

/Y2 

X2A 

X3A 

CLOCKF 
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SETF 

/YO 

Yl 

Y2 

/X2A 

X3A 

CLOCKF 

SETF 

/YO 

Yl 

Y2 

X2A 

/X3A 

CLOCKF 

SETF 

/YO 

Yl 

Y2 

X2A 

X3A 

CLOCKF 

SETF 

YO 

/Yl 

/Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

YO 

/Yl 

/Y2 

/X2A 

X3A 

CLOCKF 

SETF 

YO 

/Yl 

/Y2 

X2A 

/X3A 

CLOCKF 

SETF 

YO 

/Yl 

/Y2 

X2A 

X3A 

CLOCKF 

SETF 

YO 

/Yl 

Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

YO 

/Yl 

Y2 

/X2A 

X3A 

CLOCKF 

SETF 

YO 

/Yl 

Y2 

X2A 

/X3A 

CLOCKF 

SETF 

YO 

/Yl 

Y2 

X2A 

X3A 

CLOCKF 

SETF 

YO 

Yl 

/Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

YO 

Yl 

/Y2 

/X2A 

X3A 

CLOCKF 

SETF 

YO 

Yl 

/Y2 

X2A 

/X3A 

CLOCKF 

SETF 

YO 

Yl 

/Y2 

X2A 

X3A 

CLOCKF 

SETF 

YO 

Yl 

Y2 

/X2A 

/X3A 

CLOCKF 

SETF 

YO 

Yl 

Y2 

/X2A 

X3A 

CLOCKF 

SETF 

YO 

Yl 

Y2 

X2A 

/X3A 

CLOCKF 

SETF 

YO 

Yl 

Y2 

X2A 

X3A 

CLOCKF 
TRACE  OFF 
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Applications  of  4B3T  Line  Coding 

Line  coding  devices  are  required  in  line  terminals  and  line  re- 
peaters to  convert  binary  data  into  a  proper  coded  format  on  the 
transmission  lines  and  vice  versa  (see  Figure  1 ). 

4B3T  line  coding  is  used  for  the  Integrated  Services  Digital  Net- 
work (ISDN)  in  Germany.  Also,  4B3T  is  used  on  T-carrier  trans- 
mission lines  for  two  24-channel  DS-1  signals  (forty-eight  chan- 
nels). Because  4B3T  coding  decreases  transmission 
requirements  by  twenty-five  percent,  transmission  rates  for  two 
24-channel  DS-1  signals  are  only  fifty  percent  more  than  one  24- 
channel  DS-1  signals.  The  4B3T  is  used  on  the  T148  transmis- 
sion line  developed  by  ITT  Telecommunication. 

Bipolar  Codes 

Bipolar  coding  is  a  popular  coding  scheme  over  long  transmis- 
sion lines.  Bipolar  codes  use  three  different  voltages  (+5  V,  0  V, 
and  -5  V)  to  represent  logic  one  (1)  and  logic  zero  (0)  (see  Fig- 
ure 2a).  Logic-one  signals  alternate  between  +5  V  and  -5  V  to  al- 
low the  mean  DC  level  to  be  integrated  to  zero  volts.  Therefore, 
the  bipolar  code  can  compensate  for  DC  wander  on  the  trans- 
mission lines.  However,  the  bipolar  code  does  not  take  advan- 
tage of  using  three  logic  levels.  Thus,  binary  bipolar  code  is 
called  a  pseudo-ternary  code. 

Ternary  Codes 

Ternary  code  takes  advantage  of  all  three  logic  levels.  Each  bit 
of  the  ternary  code  represents  a  positive  logic  one  (+),  a  logic 


zero  (0),  or  a  negative  logic  one  (-)  (see  Figure  2b).  These  three 
logic  levels  are  mapped  to  the  three  voltage  levels. 

On  the  transmission  lines,  the  power  of  three  (3n)  codes  are  car- 
ried using  the  ternary  code  and  the  power  of  two  (2n)  codes  are 
carried  using  the  binary  code.  For  example,  two  bits  of  binary  (0, 

I )  codes  only  provide  four  combinations  of  data  (00,  01 ,  1 0,  and 

II)  .  But  two  bits  of  ternary  (-,  0,  +)  codes  provide  nine  combina- 
tions of  data  (00,  0-,  -0,  --,  -+,  0+,  +0,  +-,  and  ++).  The  ternary 
code  is  more  efficient  than  the  binary  code  because  more  infor- 
mation can  be  represented  on  the  transmission  lines.  Three  bi- 
polar bits  are  capable  of  representing  eight  codes.  For  ternary 
code,  only  two  bits  are  required  to  represent  eight  codes  as 
shown  in  Table  1 . 

Comparison  of  Binary  Codes  and 
Ternary  Codes 

4B3T  is  a  ternary  code  which  maps  four  continuous  binary  bits 
into  three  ternary  bits.  To  transmit  twenty  binary  data  bits,  twenty 
bipolar  bits  are  needed.  But  with  ternary  code  only  fifteen  bits 
are  needed.  A  twenty-five  percent  saving  on  bits  is  made  using 
4B3T  line  coding  on  the  transmission  lines.  Without  increasing 
the  transmission  rate,  ternary  coding  carries  more  information 
than  bipolar  coding.  Four  binary  bits  on  the  computer  system  will 
run  at  the  same  rate  as  three  ternary  bits  on  the  transmission 
lines  using  the  4B3T  line  coding.  For  example,  a  group  of  four  bi- 
nary bits  in  a  10-MHz  computer  system.  Substituted  to  a  group 
of  three  ternary  bits  at  7.5  MHz  on  a  transmission  line. 
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Figure  1 .  Line  Coding  Devices  in  a  Line  Terminal  and  a  Line  Repeater 


0        10  10 

Figure  2a.  Bipolar  Codes 


0        +  0-0 

Figure  2b.  Ternary  Codes 
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CODE 

BINARY  CODE 

(0,1) 

TERNARY  CODE 

(-,0,+) 

NUMBER 

R1 

Rn 

DU 

1  1 

1  u 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

+ 

2 

0 

1 

0 

+ 

0 

3 

0 

1 

1 

+ 

+ 

4 

1 

0 

0 

0 

5 

1 

0 

1 

+ 

6 

1 

1 

0 

+ 

7 

1 

1 

1 

Table  1.  Eight  Codes  Represented  by  Three  Bipolar  Bits  or 
Two  Ternary  Bits 

4B3T  Line  Coding 

Four  binary  bits  represent  sixteen  combinations  of  data,  but 
three  ternary  bits  represent  twenty-seven  combinations  of  data. 
Except  for  all  zeros  code,  the  remaining  combinations  of  ternary 
codes  can  be  categorized  into  three  groups:  zero,  positive,  and 
negative  accumulated  disparity.  A  zero-accumulated  disparity  is 
a  set  of  ternary  codes  that  have  the  same  number  of  positive 
codes  and  negative  codes,  e.g.,  the  accumulated  disparity  of  0-+ 
is  zero  (0).  A  positive  accumulated  disparity  is  a  set  of  ternary 
codes  that  have  more  positive  codes  than  negative  codes,  e.g., 
the  accumulated  disparity  of  the  +-+  is  positive  (+1 ).  A  negative 
accumulated  disparity  is  a  set  of  ternary  codes  that  have  more 
negative  codes  than  positive  codes,  e.g.,  the  accumulated  dis- 
parity of  the  0-  is  negative  (-2).  The  4B3T  line  coding  with  accu- 
mulated disparity  is  shown  in  Table  2.  Six  sets  of  four  binary  bits 
have  zero-accumulated  disparity.  The  remaining  ten  sets  of  four 
binary  bits  map  into  words  with  accumulated  disparity. 

Four  binary  bits  can  be  converted  into  any  one  of  four  possible 
ternary  codes.  This  is  normally  represented  by  four  columns,  as 
shown  in  Table  3.  Figure  3  is  the  state  diagram  of  the  next  col- 
umn generation.  Four  possible  columns  are  generated  for  next 
ternary  codes.  The  column  number  of  the  next  word  depends  on 

o 


< 


COLUMN 

1 


COLUMN 

2 


COLUMN 
4 


COLUMN 

3 


Figure  3. 4B3T  Next  Column  Generation  State  Diagram 


the  previous  column  and  the  current  accumulated  disparity.  If 
the  current  disparity  is  zero,  the  number  of  the  next  column  is  the 
same  as  the  previous  one.  If  the  current  disparity  is  positive,  the 
column  number  for  the  next  word  is  found  by  adding  the  current 
disparity.  Likewise,  if  the  current  disparity  is  negative,  the  num- 
ber for  the  the  next  column  is  found  by  subtracting  the  current 
disparity.  In  column  1 ,  the  next  word  may  only  have  positive  or 
zero  disparity  since  the  maximum  of  -3  negative  disparity  has 
been  reached.  Similarly,  in  column  4,  the  next  word  will  have 
negative  or  zero  disparity. 


4-BIT 
BINARY 

WUnU 

3-BIT  TERNARY  WORD 
(ACCUMULATED  DISPARITY) 

POSITIVE 

ZERO 

NEGATIVE 

0001 

0-  + 

01 1 1 

-0  + 

UlUU 

-  +0 

0010 

+  -0 

1011 

+  0- 

1110 

0  +  - 

1001 

+  -  + 

0011 

00  + 

--0 

1101 

0  +  0 

-0- 

1000 

+  00 

0-- 

0110 

-  +  + 

-  -  + 

1010 

+  +  - 

+  -  - 

1111 

+  +  0 

00- 

0000 

+  0  + 

0-0 

0101 

0  +  + 

-00 

1100 

+  +  + 

-  +  - 

Table  2.  4B3T  Line  Coding  with  Accumulated  Disparity 

The  4B3T  line  coding  is  represented  in  the  four  different  col- 
umns of  Table  3  and  shows  that  a  4-bit  binary  word  can  be  con- 
verted to  any  of  four  different  sets  of  a  3-bit  ternary  word.  Each 
set  of  a  3-bit  ternary  word  includes  the  substituted  3-bit  ternary 
code  and  the  column  number  for  the  next  ternary  word. 


2-476 


PLD  Devices  Implement  4B3T  Line  Transcoder 


4-BIT 
BINARY 
WORD 

COLUMN 

1 

COLUMN 
2 

COLUMN 

3 

COLUMN 
4 

3-BIT 
TERNARY 
WORD 

* 

3-BIT 
TERNARY 
WORD 

* 

3-BIT 
TERNARY 
WORD 

• 

3-BIT 
TERNARY 
WORD 

* 

0001 

0-  + 

1 

0  -  + 

2 

0  -  + 

3 

0-  + 

4 

0111 

-0  + 

1 

-0  + 

2 

-0  + 

3 

-  0  + 

4 

0100 

-  +0 

1 

-  +  0 

2 

-  +  0 

3 

-  +0 

4 

0010 

+  -0 

1 

+  -0 

2 

+  -  0 

3 

+  -  0 

4 

1011 

+  0- 

1 

+  0- 

2 

+  0  - 

3 

+  0- 

4 

1110 

0  +  - 

1 

0+  - 

2 

0+  - 

3 

0+  - 

4 

1001 

+  -  + 

2 

+  -  + 

3 

+  -  + 

4 



1 

0011 

00  + 

2 

00  + 

3 

00  + 

4 

-  -0 

2 

1101 

0  +  0 

2 

0  +  0 

3 

0  +  0 

4 

-0- 

2 

1000 

+  00 

2 

+  00 

3 

+  00 

4 

0  -  - 

2 

0110 

-  +  + 

2 

-  +  + 

3 

-  -  + 

2 

-  -  + 

3 

1010 

+  +  - 

2 

+  +  - 

3 

+  -  - 

2 

+  -  - 

3 

1111 

+  +  0 

3 

00  - 

1 

00  - 

2 

00- 

3 

0000 

+  0  + 

3 

0  -  0 

1 

0  -  0 

2 

0  -  0 

3 

0101 

0  +  + 

3 

-  00 

1 

-  00 

2 

-  00 

3 

1100 

+  +  + 

A 

-  +  - 

1 

-  +  - 

2 

-  +  - 

3 

Denotes  next  new  column. 


Table  3.  4B3T  Line  Coding 


4B3T  Line  Encoder  Implementation 

A  4B3T  encoder  converts  binary  data  into  two  signals:  a  positive 
ternary  signal  and  a  negative  ternary  signal.  These  two  signals 
generate  a  three-voltage  level  signal  for  the  transmission  line. 
The  4B3T  encoder  divides  the  incoming  binary  data  stream  into 
4-bit  blocks.  This  4-bit  word  is  converted  to  a  3-bit  ternary  word. 
The  4-bit  binary  word  and  the  number  of  the  column  generate  a 
ternary  word. 

Figure  4  is  the  block  diagram  of  the  4B3T  encoder.  It  consists  of 
a  4-bit  serial-in  parallel-out  shift  register,  a  2-bit  counter,  a  6-bit 
input  and  an  8-bit  output  look-up  table  converter,  and  two  3-bit 
parallel-in  serial-out  shift  registers.  Two  Programmable  Array 
Logic  (PAL)  devices  and  one  registered  PROM  device  imple- 
ment the  4B3T  encoder  (see  Figure  5).  The  first  PALC16R6Z 
performs  a  4-bit  serial-in  parallel-out  shift  register  and  a  2-bit 
counter.  A  4-bit  shift  register  (SR3,  SR2,  SR1,  and  SRO)  con- 
verts the  serial  data  into  a  4-bit  parallel  word.  A  2-bit  counter  pro- 
vides the  code  conversion's  clock  rate  which  is  one-quarter  the 
speed  of  the  serial-in  parallel-out  shift  register's  clock  rate.  The 
63RS481A  implements  the  6-input  and  8-output  look-up  table 
converter.  The  4B3T  encoding  conversion  table  is  shown  in  Fig- 
ure 6.  The  4-bit  binary  word  (B3,  B2,  B1 ,  and  B0)  and  2-bit  cur- 
rent column  numbers  (C1 ,  CO)  generate  8-bit  output  (PT2,  PT1 , 
PT0,  NT2,  NT1,  NT0,  NC1,  and  NC0).  NC1  and  NC0  are  the 
next  column  numbers.  They  are  fed  into  the  63RS481A  as  cur- 
rent column  numbers  for  the  next  operation.  The  remaining  six 
outputs  are  divided  into  two  groups.  One  group  is  the  positive 
ternary  signals  (PT2,  PT1,  and  PT0).  The  second  group  is  the 


negative  ternary  signals  (NT2,  NT1,  and  NT0).  The  second 
PALC16R6Z  provides  two  3-bit  parallel-in  serial-out  shift  regis- 
ters at  three-quarters  external  clock  rate.  PT2,  PT1 ,  and  PT0  are 
serialized  to  generate  the  positive  ternary  signal  (P4B3T).  Like- 
wise, NT2,  NT1 ,  and  NT0  are  serialized  to  produce  the  negative 
ternary  signal  (N4B3T). 

4B3T  Line  Decoder  Implementation 

A  4B3T  decoder  detects  three  continuous  pairs  of  positive  ter- 
nary signals  (P4B3T)  and  negative  ternary  signals  (N4B3T)  and 
converts  them  to  4-bit  binary  words.  Figure  7  is  the  block  dia- 
gram of  the  4B3T  decoder.  It  includes  a  4B3T  decoding  state 
machine  and  an  NRZ  data  generation. 

A  5-bit  state  machine  generates  four  inputs  for  the  parallel-in  se- 
rial-out shift  register  and  a  load  signal.  A  PALC22V10  is  used  for 
the  4B3T  decoding  state  machine  because  it  has  enough  prod- 
uct terms  and  flexible  outputs.  The  PALC22V10's  output  has 
programmable  register  bypass  and  programmable  output  polari- 
ty features.  Q4  generates  shift/load  signal  for  the  4-bit  parallel-in 
serial-out  shift  register  (HCT  195).  This  shift/load  signal  will  be 
assertive  low  only  at  every  three  clock  cycles.  The  remaining 
four  register  outputs  (Q3,  Q2,  Q1 ,  and  Q0)  are  parallel  inputs  for 
the  4-bit  parallel-in  serial-out  shift  register  (see  Figure  8).  This 
shift  register  clocks  at  four-thirds  external  clock  rate.  The  decod- 
ing state  machine  diagram  is  shown  in  Figure  9.  An  error  flag  is 
generated  when  positive  ternary  signals  and  negative  ternary 
signals  both  are  high.  The  error  flag  also  detects  three  continu- 
ous pairs  of  zero  ternary  signals. 
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Figure  5. 4B3T  Encoder  Implemented  by  PLD  Devices 
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Figure  7.  Block  Diagram  of  4B3T  Line  Decoding 


Figure  8.  4B3T  Decoder  Implemented  by  a  PAL  Device  and  a  Shift  Register 
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Kevision  A 

Author      Cindy  Lee 

Company    Monolithic  Memories 

Date  4/2/87 

CHIP  EN4B3T1  PAL16R6 

;PIN1  PIN2  PIN3     PIN4  PIN5  PIN6  PIN7  PIN8  PIN9  PIN10 
CLK      MR       NRZIN  NC       NC      NC       NC       NC       NC  GND 

;PIN11  PIN12  PIN13  PIN14  PIN15  PIN16  PIN17  PIN18  PIN19  PIN20 
/OE         CK4_LD  CQO         CQl       BO         Bl         B2         B3         NC  VCC 

INPUT  SIGNALS 

CLK  :   EXTENAL  CLOCK 

MR  :  MASTER  RESET 

NRZIN         :  NON-RETURN-ZERO  INPUT  SIGNAL 
/OE  :  ASSERTIVE  LOW  OUTPUT  ENABLE 

OUTPUT  SIGNALS 

CK4_LD       :   1/4*  EXTERNAL  CLOCK  AND  LOAD  SIGNAL 
CQO,   CQl   :   2-BIT  COUNTER  STATE  VARIABLES 
BO,   Bl,   B2,   B3   :   FOUR  PARALLEL  BINARY  DATA  OUTPUTS 


EQUATIONS 

;THE  16R6  GENERATES : 

;(1)   A  4-BIT  SERIAL-IN  PARALLEL-OUT  SHIFT  REGISTER 

/BO               :=  /Bl  ;   LSB  OF  4-BIT  PARALLEL  OUTPUTS 

+  MR  ;  MASTER  RESET 

/Bl  :=  /B2 

+  MR 

/B2  :=  /B3 

+  MR 

/B3               :=  /NRZIN  ;  MSB  OF  4-BIT  PARALLEL  OUTPUTS 
+  MR 

;(2)   A  1/4  CLOCK  AND  LOAD/SHIFT  SIGNAL 

/CQO             :=  /CQl  ;   LSB  OF  2-BIT  COUNTER 

+  MR  ;  MASTER  RESET 

/CQl             :=  CQO  ;  MSB  OF  2-BIT  COUNTER 
+  MR 

/CK4_LD         =  CQO  ;   1/4  *  EXTERNAL  CLOCK 

+  CQl  ;  AND  A  LOAD/SHIFT  SIGNAL 
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SIMULATION 

TRACE_ON     MR  CLK  NRZIN 

SETF  MR  OE 
CLOCKF  CLK 

SETF  NRZIN 
CLOCKF  CLK 


BO  Bl  B2  B3  CQO  CQ1  CK4_LD 
;  RESET  CONDITION 


SETF  /MR  NRZIN 
CLOCKF  CLK 

SETF  /NRZIN 
CLOCKF  CLK 

SETF  /NRZIN 
CLOCKF  CLK 

SETF  NRZIN 
CLOCKF  CLK 


;  DISABLE  THE  MASTER  RESET 
;  SHIFT  A  POSITIVE  NRZ  DATA 

;  SHIFT  TWO  NEGATIVE  NRZ  DATA 


;   SHIFT  TWO  POSITIVE  NRZ  DATA 


SETF  NRZIN 
CLOCKF  CLK 


SETF  /NRZIN 
CLOCKF  CLK 


;   SHIFT  A  NEGATIVE  NRZ  DATA 


SETF  NRZIN 
CLOCKF  CLK 


;  SHIFT  A  POSITIVE  NRZ  DATA 


SETF  /NRZIN 
CLOCKF  CLK 


;   SHIFT  THREE  NEGATIVE  NRZ  DATA 
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CLOCKF  CLK 

SETF  /NRZIN 
CLOCKF  CLK 


SETF  NRZIN 
CLOCKF  CLK 


;  SHIFT  A  POSITIVE  NRZ  DATA 


SETF  /NRZIN 
CLOCKF  CLK 

SETF  NRZIN 
CLOCKF  CLK 

SETF  NRZIN 
CLOCKF  CLK 

SETF  NRZIN 
CLOCKF  CLK 


;  SHIFT  A  NEGATIVE  NRZ  DATA 


;   SHIFT  THREE  POSITIVE  NRZ  DATA 


TRACE  OFF 


m 


2-483 


PLD  Devices  Implement  4B3T  Line  Transcoder 


PLE9R8     ;   PROM  63RS481A 
TABLE . PLE 

6-BIT  TO  8-BIT  TABLE  CONVERTER 
MM I ,   SANTA  CLARA,  CA 

;PIN     8     7     6     5     4  3 
AO  Al  A2  A3  A4  A5 
; ADD  CO  CI  B3  B2  BI  BO 


PLE  DESIGN  SPECIFICATION 
CINDY  LEE  3/15/87 


;PIN     9     10     11     13     14     15     16  17 
Ql     Q2     Q3     Q4     Q5     Q6     Q7  Q8 
; DAT  PTO  PT1  PT2  NTO  NT1  NT2  NCI  NCO 


;   BO  Bl  B2  B3  CI  CO 


PTO 


PT1  :  = 


NTO 


NT1 


/BO 

/Bl 

/B2 

/B3 

* 

/CI 

* 

/CO 

;  o 

0 

0 

0 

0 

0 

+ 

/BO 

/Bl 

B2 

/B3 

;  o 

0 

1 

0 

X 

X 

+ 

BO 

* 

/Bl 

* 

/CI 

;  1 

0 

X 

X 

0 

X 

+ 

BO 

* 

/Bl 

* 

CI 

* 

/CO 

;  1 

0 

X 

X 

1 

0 

+ 

BO 

/Bl 

* 

B2 

CI 

CO 

;  1 

0 

1 

X 

1 

1 

+ 

BO 

Bl 

/B2 

* 

/B3 

/CI 

* 

/CO 

;  1 

1 

0 

0 

0 

0 

+ 

BO 

Bl 

B2 

B3 

* 

//-II 

/CI 

/CO 

;  1 

1 

1 

1 

0 

0 

Bl 

/B2 

/B3 

>  X 

1 

0 

0 

X 

X 

+ 

/BO 

Bl 

/B2 

B3 

/CI 

/CO 

/'  o 

1 

0 

1 

0 

0 

+ 

/Tin 
/  DU 

bJL 

/B3 

/CI 

i  0 

1 

1 

0 

0 

X 

+ 

BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

/CI 

;  l 

0 

1 

0 

0 

X 

+ 

BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

/CI 

;  1 

1 

0 

1 

0 

X 

T 

□  rt 

it 

Dl 

/no 

D-J 

* 

PI 

/on 

1  J. 

± 

n 

J. 

u 

+ 

BO 

* 

Bl 

* 

B2 

* 

/B3 

;  1 

1 

1 

o 

x 

x 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

/CI 

* 

/CO 

;  1 

1 

1 

1 

0 

0 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

/CO 

;  o 

0 

0 

0 

0 

0 

+ 

/BO 

* 

/Bl 

* 

/B2 

* 

B3 

;  o 

0 

0 

1 

X 

X 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

/CI 

;  o 

0 

1 

1 

0 

X 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

* 

/CO 

;  o 

0 

1 

1 

1 

0 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

/CI 

* 

/CO 

;  o 

1 

0 

1 

0 

0 

+ 

/BO 

* 

Bl 

* 

B2 

;  o 

1 

1 

X 

X 

X 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

B3 

* 

/CI 

;  1 

0 

0 

1 

0 

X 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

B3 

* 

CI 

* 

/CO 

;  1 

0 

0 

1 

1 

0 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

/CO 

;  l 

1 

0 

0 

0 

0 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

* 

CO 

;  o 

0 

1 

1 

1 

1 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

/B3 

;  o 

1 

0 

0 

X 

X 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

/CI 

* 

CO 

;  o 

1 

0 

1 

0 

1 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

CI 

;  o 

1 

0 

1 

1 

X 

+ 

/BO 

* 

Bl 

* 

B2 

;  o 

1 

1 

X 

X 

X 

+ 

BO 

* 

/B2 

* 

B3 

* 

CI 

* 

CO 

;  1 

X 

0 

1 

1 

1 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

CI 

;  1 

1 

0 

0 

1 

X 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

CO 

;  1 

1 

0 

0 

0 

1 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

CO 

;  0 

0 

0 

0 

0 

1 

+ 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

;  o 

0 

0 

0 

1 

X 

PLD  Devices  Implement  4B3T  Line  Transcoder 


+ 

/Bl 

/B2 

B3 

X 

0 

0 

1 

X 

X 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

/B3 

0 

0 

1 

0 

X 

X 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

* 

CO 

0 

0 

1 

1 

1 

1 

+ 

/BO 

* 

Bl 

* 

B2 

* 

/B3 

* 

CI 

. 

0 

1 

1 

0 

1 

X 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

* 

CO 

. 

1 

0 

0 

0 

1 

1 

+ 

BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

CI 

1 

1 

0 

1 

0 

1 

X 

BO 

* 

/Bl 

* 

/B2 

* 

CI 

* 

CO 

. 

1 

0 

0 

X 

1 

1 

+ 

BO 

* 

/Bl 

* 

B2 

1 

0 

1 

X 

X 

X 

+ 

BO 

* 

Bl 

/B2 

* 

/B3 

/CI 

* 

CO 

# 

1 

1 

0 

0 

0 

1 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

CI 

1 

1 

1 

0 

0 

1 

X 

+ 

BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

CI 

* 

CO 

1 

1 

1 

0 

1 

1 

1 

+ 

BO 

* 

Bl 

* 

B2 

* 

/B3 

• 

I 

1 

1 

1 

0 

X 

X 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

/CI 

* 

CO 

1 

1 

1 

1 

0 

1 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

CI 

. 

1 

1 

1 

1 

1 

X 

— 

/BO 

* 

/Bl 

* 

/B2 

* 

B3 

* 

CI 

0 

0 

0 

1 

1 

0 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

CI 

. 

0 

0 

1 

0 

1 

X 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

CI 

I 

0 

1 

0 

0 

1 

X 

+ 

/BO 

* 

Bl 

* 

B2 

* 

B3 

* 

CI 

i 

0 

1 

1 

1 

1 

X 

+ 

BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

1 

1 

0 

1 

1 

1 

X 

+ 

BO 

* 

Bl 

* 

B2 

* 

/B3 

* 

CI 

1 

1 

1 

1 

0 

1 

X 

+ 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

/CO 

0 

0 

0 

0 

0 

0 

+ 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

* 

CO 

1 

0 

0 

0 

0 

1 

1 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

/CI 

* 

CO 

1 

0 

0 

1 

1 

0 

1 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

* 

/CO 

0 

0 

1 

1 

1 

0 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

/CI 

* 

/CO 

. 

0 

1 

0 

1 

0 

0 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

CI 

* 

CO 

0 

1 

0 

1 

1 

1 

+ 

/BO 

* 

Bl 

* 

B2 

* 

/B3 

* 

CO 

0 

1 

1 

0 

X 

1 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

* 

/CO 

/ 

• 

1 

0 

0 

0 

1 

0 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

CO 

• 

1 

0 

0 

0 

0 

1 

+ 

BO 

/B2 

* 

B3 

* 

/CI 

* 

CO 

; 

1 

X 

0 

1 

0 

1 

+ 

BO 

* 

/B2 

* 

B3 

* 

CI 

* 

/CO 

i 

1 

X 

0 

1 

1 

0 

+ 

BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

CO 

1 

0 

1 

0 

X 

1 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

/CI 

* 

/CO 

2 

1 

1 

0 

0 

0 

0 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

CI 

* 

CO 

f 

1 

1 

0 

0 

1 

1 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

/CI 

* 

/CO 

1 

1 

1 

1 

1 

0 

0 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

CI 

* 

CO 

1 

1 

1 

1 

1 

1 

/BO 

* 

/Bl 

* 

/B2 

* 

B3 

* 

CO 

1 

0 

0 

0 

1 

X 

1 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

CO 

7 

0 

0 

1 

0 

X 

1 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

CO 

t 

0 

1 

0 

0 

X 

1 

+ 

/BO 

* 

Bl 

* 

B2 

* 

B3 

* 

CO 

i 

0 

1 

1 

1 

X 

1 

+ 

BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CO 

; 

1 

0 

1 

1 

X 

1 

+ 

BO 

* 

Bl 

* 

B2 

* 

/B3 

* 

CO 

1 

1 

1 

1 

0 

X 

1 

+ 

/BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

* 

/CO 

1 

0 

0 

0 

0 

1 

0 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

/CO 

; 

0 

0 

1 

1 

X 

0 

+ 

/BO 

* 

/Bl 

* 

B2 

* 

B3 

* 

CI 

* 

CO 

; 

0 

0 

1 

1 

1 

1 

+ 

/BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

CI 

* 

/CO 

• 
/ 

0 

1 

0 

1 

1 

0 

+ 

/BO 

* 

Bl 

* 

B2 

* 

/B3 

* 

/CO 

1 

0 

1 

1 

0 

X 

0 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

/CO 

# 

1 

0 

0 

0 

X 

0 

+ 

BO 

* 

/Bl 

* 

/B2 

* 

/B3 

* 

CI 

i 

1 

0 

0 

0 

1 

X 

+ 

BO 

* 

/B2 

* 

B3 

* 

/CO 

1 

1 

X 

0 

1 

X 

0 

+ 

BO 

* 

/Bl 

* 

B2 

* 

/B3 

* 

/CO 

1 

1 

0 

1 

0 

X 

0 

+ 

BO 

* 

Bl 

* 

/B2 

* 

/B3 

* 

/CO 

1 

1 

1 

0 

0 

X 

0 

+ 

BO 

* 

Bl 

* 

/B2 

* 

B3 

* 

CI 

* 

CO 

t 

1 

1 

0 

1 

1 

1 

+ 

BO 

* 

Bl 

* 

B2 

* 

B3 

* 

CI 

* 

/CO 

; 

1 

1 

1 

1 

1 

0 

t\  MonollthlaJSKR  Memories  £1 


2-485 


BO  Bl  B2  B3  CI  CO     PTO  PT1  PT2  NTO  NT1  NT2  NCI  NCO 


; BINARY  INPUT,   COLUMN,         OUTPUT  NEW  COLUMN 

;BO  Bl  B2   B3  CI  CO,  PTO  PT1  PT2  NTO  NT1  NT2  NCI  NCO 
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DESCRIPTION 


THIS  REGISTERED  PROM  (63RS481A)  IMPLEMENTS  A  6-TO-8  TABLE 
CONVERTER.  SIX  INPUTS  ARE  BO,  Bl,  B2 ,  B3  (BINARY  INPUTS), 
AND  CI,  CO  (CURRENT  COLUMN  NUMBER).  BECAUSE  THESE  FOUR 
PARALLEL  BINARY  INPUTS  ARE  FROM  THE  4-BIT  SERIAL-IN 
PARALLEL-OUT  SHIFT  REGISTER,  THIS  PROM  (63RS481A)  MUST 
OPERATE  AT  A  QUARTER  OF  THE  SHIFT  REGISTER'S  RATE.  EIGHT 
OUTPUTS  ARE  PTO,  PT1,  PT2,  NTO,  NT1,  NT2,  NCI,  AND  NCO.  THE 
PTO,  PT1,  AND  PT2  ARE  POSITIVE  TERNARY  SIGNALS  WHICH  FORMS 
THREE  SERIAL  P4B3T  SIGNAL  THROUGH  A  3-BIT  PARALLEL- IN 
SERIAL-OUT  SHIFT  REGISTER.  LIKEWISE,  NTO,  NT1,  AND  NT2  ARE 
NEGATIVE  TERNARY  SIGNALS.  NCI  AND  NCO  ARE  NEXT  COLUMN 
NUMBERS . 
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Title         4B3T  ENCODER  LOGIC2 

Pattern  EN4B3T2.pds 

Revision  A 

Author      Cindy  Lee 

Company    Monolithic  Memories 

Date  3/5/87 

CHIP  EN4B3T2  PAL16R6 

;PIN1  PIN2  PIN3  PIN4  PIN5  PIN6  PIN7  PIN8  PIN9  PINir 
CLK      MR       PTO     PT1     PT2     NTO     NT1     NT 2     LOAD  GND 

;PIN11  PIN12  PIN13  PIN14  PIN15  PIN16  PIN17  PIN18  PIN19  PIN20 
/OE         NC         NS2       NS1       N4B3T  PS 2       PS1       P4B3T  NC  VCC 

;     INPUT  SIGNALS 

CLK         :   EXTERNAL  CLOCK 

MR  :  MASTER  RESET 

;  PTO,  PT1,  PT2  :  THREE  POSITIVE  TERNARY  SIGNALS 
;  NTO,   NT1,   NT 2     :   THREE  NEGATIVE  TERNARY  SIGNALS 

LOAD       :   LOAD  SIGNAL  FOR  TWO  SHIFT  REGISTERS 

/OE         :   ASSERTIVE  LOW  OUTPUT  ENABLE 

;     OUTPUT  SIGNALS 

;     NS2,  NS1   :   NEGATIVE  TERNARY  SHIFT  REGISTER  VARIABLES 

N4B3T     :   OUTPUT  SIGNAL  OF  NEGATIVE  TERNARY  SHIFT  REGISTER 

;     PS 2 ,   PS1   :   NPOSITIVE  TERNARY  SHIFT  REGISTER  VARIABLES 

P4B3T     :   OUTPUT  SIGNAL  OF  POSITIVE  TERNARY  SHIFT  REGISTER 

EQUATIONS 

;THE  16R6  GENERATES: 

;(1)   A  3-BIT  PARALLEL-IN  SERIAL-OUT  SHIFT  REGISTER 
FOR  POSITIVE  TERNARY  SIGNAL 

/P4B3T       :=  /PTO  *  /PS1  ;  SHIFT  REGISTER  OUTPUT 


+ 
+ 
+ 


/LOAD  *  /PS1 
/PTO  *  LOAD 
MR 


/PS1 


/PT1  *  /PS2 
/LOAD  *  /PS2 
/PT1  *  LOAD 
MR 


+ 
+ 
+ 


/PS2 


/PT2 

/LOAD 

MR 


+ 
+ 


;(2)   A  3-BIT  PARALLEL-IN  SERIAL-OUT  SHIFT  REGISTER 
FOR  NEGATIVE  TERNARY  SIGNAL 


/N4B3T       :=  /NTO  *  /NS1  ;  SHIFT  REGISTER  OUTPUT 

+  /LOAD  *  /NS1 
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+  /NTO  *  LOAD 
+  MR 

/NS1  :=  /NT1  *  /NS2 

+  /LOAD  *  /NS2 
+  /NT1  *  LOAD 
+  MR 


/NS2 


:=  /NT2 
+  /LOAD 
+  MR 


SIMULATION 

TRACE_ON    MR  CLK  LOAD  PT2  PT1  PTO  P4B3T  NT2  NT1  NTO  N4B3T 


SETF  /LOAD  MR  OE 
SETF  /MR 

SETF  LOAD  PT2  PT1  PTO  NT 2  NT1  NTO 
CLOCKF 


;  RESET  CONDITION 
;   DISABLE  MASTER  RESET 

;  LOAD  INPUT  DATA 


SETF  /LOAD  /PT2  /PT1  PTO  /NT2  /NT1  NTO  ;  SHIFT  DATA 
CLOCKF 


SETF  PT2  /PT1  PTO  NT2  /NT1  NTO 
CLOCKF 


ETF  LOAD  PT2  /PT1  PTO  NT2  /NT1  NTO 
LOCKF 

3etf  /load  pt2  /pt1  pto  nt 2  /nt1  nto 
:lockf 


SETF  /PT2  PT1  /PTO  NT 2  NT1  /NTO 
CLOCKF 


SHIFT  DATA 
LOAD  INPUT  DATA 
SHIFT  DATA 
SHIFT  DATA 


SETF  LOAD  PT2  /PT1  PTO  /NT2  NT1  NTO  ;  LOAD  INPUT  DATA 
CLOCKF 


ETF  /LOAD  PT2  /PT1  PTO  /NT2  NT1  NTO  ;  SHIFT  DATA 
LOCKF 


SETF  PT2  /PT1  PTO  /NT2  NT1  NTO 
CLOCKF 


;  SHIFT  DATA 


SETF  LOAD  PT2  PT1  /PTO  /NT2  /NT1  /NTO  ;  LOAD  INPUT  DATA 
CLOCKF 

SETF  /LOAD  PT2  /PT1  /PTO  NT2  /NT1  /NTO   ;   SHIFT  DATA 
CLOCKF 


SETF  PT2  /PT1  /PTO  NT 2  /NT1  /NTO 
CLOCKF 


;  SHIFT  DATA 


SETF  /LOAD  PT2  /PT1  /PTO  NT2  /NT1  /NTO  ;  SHIFT  DATA 
CLOCKF 

SETF  PT2  /PT1  /PTO  NT 2  /NT1  /NTO  ;   SHIFT  DATA 

CLOCKF 

TRACE  OFF 
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Revision  A 

Author      Cindy  Lee 

Company    Monolithic  Memories 

Date  3/18/87 

CHIP  DE4B3T  PAL22V10 

;PIN1  PIN2  PIN3  PIN4     PIN5  PIN6 
/CLK     NC       MR       P4B3T  N4B3T  NC 

;PIN7  PIN8  PIN9  PIN10  PIN11  PIN12 
NC         NC       NC       NC         NC  GND 

;PIN13  PIN14  PIN15  PIN16  PIN17  PIN18 
NC  ERROR  NC        /Q4       Q3  Q2 

;PIN19  PIN20  PIN21  PIN22  PIN23  PIN24 
Ql  QO         NC         NC        NC  VCC 

GLOBAL 


INPUT  SIGNALS 

/CLK     :  ASSERTIVE  LOW  EXTERNAL  CLOCK 
MR         :  MASTER  RESET 
P4B3T  :   POSITIVE  TERNARY  SIGNAL 
N4B3T  :  NEGATIVE  TERNARY  SIGNAL 

OUTPUT  SIGNALS 

ERROR  :   ERROR  FLAG 

/Q4       :  ASSERTIVE  LOW  Q4  SIGNAL  WHICH  IS  A  /LOAD  SIGNAL 
FOR  A  SHIFT  REGISTER 
Q3,   Q2,   Ql,   QO   :   4B3T  DECODING  STATE  VARIABLES 


EQUATIONS 

; ERROR  SIGNAL  IS  GENERATED  WHEN  P4B3T  AND  N4B3T  SIGNALS  BOTH 
; ARE  HIGH,   OR  P4B3T  AND  N4B3T  SIGNALS  BOTH  ARE  LOW  AT  THE 
.•TRANSITION  FROM  STATE4  TO  STATEO. 

ERROR  =  P4B3T  *  N4B3T  ;  P4B3T  AND  N4B3T  ARE  HIGH 

+  /Q4  *  /Q3  *  /Q2  *  /Ql  *  /QO  *  /P4B3T  *  /N4B3T 

;/Q4  IA  A  ASSERTIVE  LOW  LOAD  SIGNAL.   THE  FOUR  PARALLEL  DATA 
; ARE  LOADED  INTO  4-BIT  PARALLEL-IN  SERIAL-OUT  SHIFT  REGISTER 
;WHEN  LAOD  IS  LOW,   OTHERWISE,   THE  DATA  IS  SHIFTED  OUT. 

Q4  :=  /N4B3T  *  P4B3T  *  /Ql  *  Q3  *  /Q4         ;   LOAD  SIGNAL 

+  /N4B3T  *  /QO  *  Q2  *  /Q3  *  /Q4 
+  /N4B3T  *  /QO  *  Ql  *  Q3  *  /Q4 
+  /N4B3T  *  /Ql  *  Q2  *  /Q3  *  /Q4 
+  N4B3T  *  /P4B3T  *  /Ql  *  Q3  *  /Q4 


i\  Monolithic  FjJ  Memories 
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+  /P4B3T  *  /QO  *  Q2  *  /Q3  *  /Q4 
+  /P4B3T  *  /QO  *  Ql  *  Q3  *  /Q4 
+  /P4B3T  *  /Ql  *  /Q2  *  Q3  *  /Q4 
+  /P4B3T  *  /Ql  *  Q2  *  /Q3  *  /Q4 
+  /P4B3T  *  QO  *  /Ql  *  Q3   *  /Q4 
GLOBAL. RSTF  =  MR 

;Q3,  Q2,  Ql,  AND  QO  ARE  THE  FOUR  BINARY  SIGNALS  WHICH  ARE 
.'CONVERTED  FROM  THTREE  TERNARY  CODE  WORD. 


Q3  :=  /N4B3T  *  P4B3T  *  /QO  *  /Ql  *  /Q2 

+  /N4B3T  *  P4B3T  *  QO  *  /Q2   *  Q3 
+  /N4B3T  *  QO  *  Ql  *  /Q2  *  /Q4 
+  /N4B3T  *  Ql  *  /Q2  *  Q3   *  /Q4 
+  /N4B3T  *  P4B3T  *  Q4 
+  /P4B3T  *  /QO  *  /Q2   *  Q3   *  /Q4 
+  N4B3T  *  /P4B3T  *  /QO  *  Q2   *  /Q4 
+  /P4B3T  *  /QO  *  Ql  *  Q3   *  /Q4 
+  N4B3T  *  /P4B3T  *  /Ql  *  Q2  *  /Q4 
+  N4B3T  *  /P4B3T  *  /Ql  *  Q3   *  /Q4 
+  /P4B3T  *  QO  *  Ql  *  /Q2   *  /Q4 


Q2  :=  /N4B3T  *  /QO  *  Ql  *  Q3  *  /Q4 

+  /N4B3T  *  P4B3T  *  Q2  *  /Q3  *  /Q4 

+  /N4B3T  *  QO  *  Ql  *  /Q3  *  /Q4 

+  N4B3T  *  /P4B3T  *  /QO  *  /Ql  *  /Q3 

+  /P4B3T  *  /QO  *  Q2  *  /Q3  *  /Q4 

+  N4B3T  *  /P4B3T  *  /QO  *  Q2 

+  /P4B3T  *  QO  *  Ql  *  /Q3  *  /Q4 

+  N4B3T  *  /P4B3T  *  Q4 


*  /Q3 
;  MSB  OF  THE  FOUR 
;   BINARY  SIGNALS 


Ql 


:=  /N4B3T  *  /QO  *  /Ql  *  /Q2  *  /Q3 
P4B3T  *  /QO  *  /Ql  *  Q2 
QO  *  /Ql  *  Q2  *  /Q3 
P4B3T  *  QO  *  Ql  *  /Q2 
P4B3T  *  Q2  *  /Q3 
Q4 

/QO  *  /Ql  *  /Q2  *  /Q3 
/QO  *  Ql  *  /Q2  *  Q3 
N4B3T  *  /P4B3T  *  /QO  *  Q3 
/P4B3T  *  QO  *  /Ql  *  /Q2  *  Q3 
/P4B3T  *  Q4 


/N4B3T 
/N4B3T 
/N4B3T 
/N4B3T 
/N4B3T 
/P4B3T 
/P4B3T 


QO 


:=  /N4B3T 
+  /N4B3T 
+  /N4B3T 
+  /N4B3T 
+  /N4B3T 
+  /N4B3T 
+  N4B3T  * 
+  /P4B3T 
+  N4B3T  * 
+  N4B3T  * 
+  /P4B3T 


/P4B3T  *  /QO  *  Ql 
/QO  *  /Ql  *  /Q3 

Ql  *  Q2  *  Q3 

*  QO  *  /Ql  * 

*  /Ql  *  Q2  * 


/QO  * 
P4B3T 
P4B3T 
Q4 
/P4B3T 

*  /Ql  * 
/P4B3T 
/P4B3T 

*  Q4 


*  Q3 


Q3 
Q3 


*  /QO  *  /Ql 
Q2  *  /Q3 

*  QO  *  Ql  *  /Q2 

*  QO  *  Q2  *  /Q3 


LSB  OF  THE  FOUR 
BINARY  SIGNALS 


2-492 


PLD  Devices  Implement  4B3T  Line  Transcoder 


SIMULATION 

TRACE_ON    MR  /CLK  P4B3T  N4B3T  /Q4  Q3  Q2  Ql  QO  ERROR 
;TO  RESET  WHOLE  DEVICES 

SETF  MR  ; STATE  #  =  Q4 (LOAD)   Q3  Q2  Ql  QO  &  ERROR 

; TEST  VECTOR  #1  TO  CHECK  SO,   SI,   S4 ,   S13,  AND  S16 


SETF  /MR  P4B3T  N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 


;S0  =  0  0  0  0  0,  ERROR 
.•INITIAL  CONDITION 

;S1  =00011 

;S4  =  0  1  1  0  0 

;S13  =11111 

;S1  =  0  0  0  1  1 

;S4  ==01100 

;S14  =10  0  11 

;S1  =  0  0  0  1  1 

;S4  =  0  1  1  0  0 

; SO  =  0  0  0  0  0,  ERROR 


; TEST  VECTOR  #2  TO  CHECK  SI,   S5,   S15,   S16,  AND  S17 

;S1  =00011 


SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 


SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T 
CLOCKF  CLK 


;S5  =  0  1  1  0  1 
;S15  =10  0  0  0 

;S1  =  0  0  0  1  1 
;S5  =  0  1  1  0  1 


^MonollthlcRfiEil 
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CLOCKF  CLK 


SETF  /P4B3T  /N4B3T 
CLOCKF  CLK 


;S1  =  0  0  0  1  1 


SETF  /P4B3T  N4B3T 


;S5  =  0  1  1  0  1 


CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S17  =10  0  0  1 

CLOCKF  CLK 

; TEST  VECTOR  #3  TO  CHECK  SI,   S6,   S18,   S19,  AND  S20 


CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S6  =  0  1  1  1  0 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S18  =1110  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ; SI  =  0  0  0  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S6  =  0  1  1  1  0 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S19  =11110 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S1  =  0  0  0  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S6  =  0  1  1  1  0 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S20  =10  10  1 

CLOCKF  CLK 

; TEST  VECTOR  #4  TO  CHECK  S2,   S7,   S20,   S18,  AND  S21 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S7  =  0  0  1  0  0 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S20  =10   10  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 


SETF  /P4B3T  /N4B3T 


;si 


=  0  0  0  1  1 
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SETF  /P4B3T  /N4B3T  ;S7  =  0  0  1  0  0 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S18  =1110  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S7  =  0  0  1  0  0 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S21  =10  111 

CLOCKF  CLK 

;TEST  VECTOR  #5  TO  CHECK  SO,   S2,   S7,   S8,   S22,   AND  S23 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S7  =  0  0  1  0  0 

CLOCKF  CLK 

SETF  P4B3T  N4B3T  ;S0  =  0  0  0  0  0,  ERROR 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0   1   1  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S8  =  0  0  1  0  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S14  =10  0  11 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S8  =  0  0  1  0  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S22  =110  0  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S8  =  0  0  1  0  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S23  =10  110 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 


*T1  Monolithic  ["JJ  Memories  £1 
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SETF  /P4B3T  N4B3T  ;S8  =  0  0  1  0  1 

CLOCKF  CLK 

SETF  P4B3T  N4B3T  ; SO  =  0  0  0  0  0,  ERROR 

CLOCKF  CLK 

;TEST  VECTOR  #6  TO  CHECK  S2 ,   S9,   S24,   S25,  AND  S23 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S9  =00110 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S24  =10  10  0 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S9  =  0  0  1  1  0 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S25  =1110  0 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S2  =  0  0  1  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S9  =  0  0  1  1  0 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S23  =10  110 

CLOCKF  CLK 

;TEST  VECTOR  #7  TO  CHECK  S3,   S10,   S16,   S26,  AND  S15 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S10  =01000 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S16  =110  0  0 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S3  =  0   1  0   1  1 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S10  =01011 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S26  =110  11 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S3  =01011 

CLOCKF  CLK 
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SETF  /P4B3T  /N4B3T  ;S10  =01000 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S15  =10  0  0  0 

CLOCKF  CLK 

;TEST  VECTOR  #8  TO  CHECK  S3,   Sll,   S27,   S28,    *ND  S22 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S11  =01001 
CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S27  =10  0  10 
CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S11  =01001 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S28  =110  10 
CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 
CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S11  =01001 
CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S22  =110  0  1 
CLOCKF  CLK 

; TEST  VECTOR  #9  TO  CHECK  SO,   S3,   S12,   S13,   S28,   AND  S25 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S12  =01010 

CLOCKF  CLK 

SETF  /P4B3T  /N4B3T  ;S13  =11111 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S12  =01010 

CLOCKF  CLK 

SETF  /P4B3T  N4B3T  ;S28  =110  10 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ; S3  =  0  1  0  1  1 

CLOCKF  CLK 
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SETF  P4B3T  /N4B3T  ;S12  =01010 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S25  =1110  0 

CLOCKF  CLK 

SETF  P4B3T  /N4B3T  ;S3  =  0  1  0  1  1 

CLOCKF  CLK 

SETF  P4B3T  N4B3T  ;S0  =  0  0  0  0  0,  ERROR 

CLOCKF  CLK 

TRACE_OFF 
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Manchester  Encoding 

The  PALC22V10  has  been  programmed  to  give  a  Manchester 
encoded  output  for  eight  digital  TTL-level  inputs  or  a  bit-serial 
TTL  level  input.  The  reason  for  using  such  encoding  is  to 
remove  the  DC  component  from  a  digital  waveform  prior  to 
transmission.  If  the  data  were  transmitted  without  encoding, 
'DC  wander'  could  result. 

A  transmitted  binary  signal  consists  of  a  series  of  High  and 
Low  voltage  pulses.  If  the  signal  consists  of  primarily  High 
pulses,  a  mean  DC  component  will  exist.  If  the  signal  then 
becomes  primarily  Low  pulses,  the  mean  DC  component  will 
shift,  or  wander.  This  can  cause  transmission  problems  on 
capacitive  lines.  A  Manchester  encoded  signal  cannot  have  a 
DC  component. 

The  principle  of  Manchester  encoding  is  to  introduce  phase 
shifting  in  a  carrier  wave,  which  is  continuously  switching, 
such  that  logic  input  changes  are  represented  by  a  phase  shift 


of  that  carrier.  If  data  at  the  input  of  an  encoder  switches  from 
a  logic  High  to  a  logic  Low,  then  a  phase  shift  of  180  degrees 
(equivalent  to  a  polarity  shift)  is  performed.  When  a  Low-to- 
High  transition  occurs  at  the  input,  the  carrier  phase  is 
switched  back  by  180  degrees  to  the  original  phase. 

For  example,  a  logic  High  could  be  encoded  as  a  zero  phase 
shift,  and  a  logic  Low  as  a  180-degree  phase  shift  from  a 
reference.  The  result  in  the  encoded  signal  is  that  two 
successive  Highs  or  Lows  represent  either  a  positive  or 
negative  data  transition  (Figure  1).  When  the  data  in  the  figure 
toggles  Low  to  High  the  encoded  output  gives  two  successive 
High  levels  as  the  phase  of  the  carrier  shifts  by  180  degrees. 
The  transition  of  data  from  High  to  Low  will  generate  two 
successive  Low  outputs  in  the  encoded  signal,  returning  the 
signal  to  the  original  phase. 


1_ 


MANCHESTER  1 


:jirLrLRjuiJi_rLrLn_n 


Figure  1 .  A  Manchester  Encoded  Signal  from  a  Digital  TTL  Input 


Applications 

Manchester  encoding  is  applied  to  local  area  networks,  such 
as  Ethernet,  and  can  be  used  in  storage  of  data  on  magnetic 
media,  or  in  other  transmissions  including  infrared.  Both  data 
and  clock  are  imbedded  in  the  Manchester  encoded  signal. 
Clock  information  may  easily  be  extracted  from  the  encoded 


waveform 
clock  can 


by  using  phase-locked  loop  techniques,  and  the 
be  used  to  recover  the  encoded  data.  Thus,  the  data 


can  be  resynchronized  on  reception. 


Circuit 

Figure  2  shows  a  schematic  diagram  of  the  Manchester 
encoder  circuit.  The  design  has  been  made  to  be  compatible 
with  either  serial  or  parallel  data  output  from  standard  UART 
(Universal  Asynchronous  Receiver/Transmitter)  circuits.  The 
clock  input  to  the  system  will  come  from  the  clock  input  to  the 
UART  circuit,  which  is  typically  sixteen  times  the  data  rate. 
The  divide-by-sixteen  circuit  will  generate  a  square  wave 
output  at  R3,  the  original  data  rate  as  applied  to  the  UART.  This 
signal  is  available  as  an  output  from  the  system.  On  a  PCB,  the 
R3  signal  can  be  used  to  decode  the  data. 


£1  MonoUthioM  Memory  <H 
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DIVIDE  BY 

DIVIDE  BY 

>  SIXTEEN 

EIGHT 

R0R1R2R3 

RA   RB  RC 

CLOCK 
INPUT 


D6 
D5 
D4 
D3 
D2 
D1 
DO 


CLOCK  INPUT  DIVIDED  BY  SIXTEEN 


3E> 


R1  - 

R2  - 
R3  - 


R3  - 
RA- 
RB  - 

RC- 


MANCHESTER 
ENCODED  OUTPUT 


RST 


READY  OUTPUT 
FLIP-FLOP 


RST 

— o- 


Figure  2.  Manchester  Encoder  Circuit 


A  further  counter  synchronously  clocked  is  a  divide-by-eight 
counter  with  outputs  RA,  RB  and  RC.  These  outputs  will  select, 
in  the  case  of  a  parallel  data  input,  one  of  the  individual  data 
lines  D0-D7.  The  selected  data  line  is  routed  through  the 
multiplexer  and  applied  to  the  input  of  an  eXclusive-OR  (XOR) 
gate.  One  of  the  inputs  to  the  XOR  gate  is  the  data  clock  rate, 
available  at  the  R3  flip-flop  output.  The  function  of  the  XOR 
gate  has  been  configured  in  the  PALC22V10  from  a  'sum-of- 
products'  architecture. 

The  phase  encoded  signal  has  been  derived  at  the  output  of 
flip-flop  RX.  Inverting  the  data  gives  a  180-degree  phase  shift 
from  the  reference  established  at  the  R3  output.  The  selected 
data  input  line  determines  whether  or  not  phase  (polarity) 
inversion  should  take  place.  If  the  data  input  is  Low,  no 
inversion  takes  place;  if  High,  the  reference  signal  is  inverted 
to  give  the  required  phase  shift. 

Equations 

The  general  output  equation  is: 

RX  :=  R3*DATA  +  R3*DATA  (=  R3  :+:  DATA).  (1) 

The  data  input  is  selected  by  a  one-of-eight  condition  of  the 
state  machine  RA,  RB,  and  RC.  RA  is  the  clock  reference 
divided  by  two,  RB  is  the  clock  divided  by  four  and  RC  by  eight. 


For  a  parallel  input,  data  DO  is  selected  to  control  the  output 
polarity  at  RX  when  RA'RB'RC  is  True,  D1  when  RA*RB*RC  is 
True,  D2  when  RA*RB*RC  is  True,  etc.  This  procedure 
serializes  the  parallel  data.  The  Manchester  encoded  signal  is 
created  by  this  dynamic  control  of  the  inverting  input  to  the 
XOR  configuration  through  the  selected  data  input. 

There  is  a  serial  select  input  to  the  multiplexer  circuit  such  that 
when  driven  active  High,  serial  data  applied  to  the  DO  input  will 
encode  the  carrier  signal  on  a  bit-by-bit  basis.  Input  lines  D1- 
D7  are  ignored  when  the  encoder  is  used  in  this  mode.  The  SER 
mode  should  be  hard-wired  Low  for  parallel  data  input. 

To  enable  external  handshake  to  be  accomplished  in  either 
serial  or  parallel  mode,  a  RDY  output  is  available.  Wheri  a 
complete  byte  or  bit  is  encoded  and  transmitted  to  the  output 
RX,  the  RDY  pin  will  go  High  to  request  more  data.  Note  that  the 
Manchester  encoder  does  not  latch  the  data;  it  must  remain 
valid  for  the  duration  of  the  encoding.  The  logic  circuit  required 
for  the  RDY  pin  is  shown  as  a  sum  of  product  terms  as  applied 
to  the  'D'  input  of  the  RDY  flip-flop. 

The  data  that  is  applied  to  the  input  is  available  in  a  serial  form 
at  the  DTST  output  and  can  be  used  for  test  purposes.  DTST  is 
simply  the  serialized  output  of  the  data  input  D0-D7. 
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PAL  Device  Design  Considerations 

The  PALC22V10  was  chosen  for  this  application  because  of 
the  large  number  of  product  terms  required  to  encode  the  XOR 
function.  Up  to  sixteen  product  terms  can  be  used  to  create  an 
XOR  function  on  eight  data  inputs.  The  PALC22V10  also  offers 
low  power  with  TTL  compatibility,  reducing  power  supply 
requirements  while  increasing  reliability. 

The  flip-flops  RO,  R1,  R2,  R3,  RA,  RB,  and  RC  form  a  long 
binary  count  chain.  The  output  from  R3  forms  the  reference 
phase  output  which  is  the  clock  input  divided  by  sixteen.  As 
discussed,  RA,  RB,  and  RC  form  a  binary  state  machine  to 
select  D0-D7  data  inputs  individually. 


Design  Procedure 

The  fol  owing  figures  show  the  design  procedure  used  to  derive 
equations  for  the  binary  divide-by-sixteen  counter  R0-R3.  The 
Boolean  equations  have  been  derived  using  the  PALASM®  2 
software  operators. 

Figure  3  shows  the  counter  states  in  hexadecimal  and  binary 
formats.  Figure  4  shows  two  Karnaugh  maps,  the  state 
assignment  and  state  transition;  Figure  4a  shows  the 
hexadecimal  value  of  the  current  state  of  the  counter,  and 
Figure  4b  shows  the  next  state.  For  example,  the  location  in 
the  state  assignment  map  of  hex  'A'  would  be  represented  by 
the  next  state  hex  'B'  in  the  transition  map. 


State 

RO 

R1 

R2 

R3 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

2 

0 

0 

0 

3 

1 

0 

0 

4 

0 

1 

0 

5 

1 

1 

0 

6 

0 

1 

0 

7 

1 

1 

0 

8 

0 

0 

9 

1 

0 

A 

0 

0 

B 

1 

0 

C 

0 

1 

D 

1 

1 

E 

0 

1 

F 

1 

1 

Figure  3.  Binary  Counter  Truth  Table.  The  Conditions  of  the 
Flip-flops  RO,  R1 ,  R2,  and  R3  Are  Shown  in  the  Truth 
Table  with  the  Hexadecimal  Value  of  the  Binary 
Weighting 


v     R1  RO 
R2\  10 

11 

01 

00 
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B 
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1 
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4a.  State  Assignment  Map 


R1  R0 
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B 
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00 
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Figure  4.  The  State  Assignment  Map  (4a)  Shows  the  Current 
Hexadecimal  Value  in  the  Appropriate  Position  in  the 
Map.  The  State  Transition  Map  (4b)  Shows  Next 
State.  For  Example,  State  A  in  the  Assignment  Map 
is  Shown  as  B  in  the  Transition  Map  Because  that  is 
the  Next  Count  in  the  Sequence 


Figure  5  shows  the  binary  weighting  of  the  hexadecimal  values 
of  the  flip-flops  R0,  R1 ,  R2,  and  R3  of  the  transition  map.  From 
this  map,  individual  Karnaugh  maps  are  drawn  for  each 
individual  flip-flop,  and  equations  derived  by  performing 
minimization  for  each  flip-flop. 


v     R1  R0 
R2\  10 

11 

01 

00 

10 

1101 

0011 

0101 

1001 

11 

1111 

0000 

0111 

1011 

01 

1110 

0001 

0110 

1010 

00 

1100 

0010 

0100 

1000 

Figure  5.  State  Transition  Map  with  the  Binary  Weighting  of 
the  Hexadecimal  Entries.  The  Register  Significance 
of  the  Entry  is  from  Left  to  R^ght.  For  Example,  1 101 
Represents  the  Binary  Weighting  of  R0  R1  /R2  R3 
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Figure  6  shows  the  Karnaugh  maps  and  the  equations  tor  either 
true  or  complement  outputs.  For  true  outputs,  minimization  is 
performed  on  the  logic  One  entries  in  the  map.  Where  an 
inverting  buffer  is  used  between  the  internal  flip-flop  and  the 
outside  world,  minimization  is  performed  on  the  logic  Zero 
entries  in  the  map  (shown  bolder).  This  is  because  the  internal 
flip-flop's  output  is  set  High,  but  the  state  of  the  output  pin  is  a 
logic  Low  due  to  the  inverting  action  of  the  buffer.  With  the 
output  configuration  of  the  PALC22V10,  it  is  possible  to 
choose  either  inverted  or  non-inverted  outputs,  so  either 
equation  will  be  suitable. 


/R0:=R0      (1)  /R1  :=  R0"R1  + /R07R1  (3) 

R0:=/R0       (2)  HI  := /R0*R1  +  R07R1  (4) 

6a.  Map  for  Flip-flop  RO.  6b.  Map  for  Flip-flop  R1 . 


The  equations  derived  for  a  general  counter  of  length  N 
become: 

on    :=  qn*qn-1*qn-2*...q2*q1*q0 
+  qn-qnTi 

+  QN'QI 
+  QN'QO 
using  the  inverting  option,  or: 

QN      :=  QN*QN-1*QN-2*...Q2*Q1*Q0 
+  QN'QNM 
+  QN'5FF2 


+  QN*Q1 
+  QN*QO 
for  the  non-inverting  option. 

This  generalized  equation  requires  N  +  1  product  terms  for  a 
general  flip-flop,  QN.  For  example,  Q7,  in  a  binary  count 
sequence,  will  require  a  summation  of  eight  product  terms.  The 
complete  design  specification  is  shown  in  the  PAL  device 
Design  Specification  of  Pattern  04. 


v     R1  RO 
R3R2\.  10, 
10 


XD 


11 


Vo  |Uy 


/R2  :=  R2'R1'R0  +  /R27R1  *  /R27R0  (5) 
R2  :=  /R2*R1'R0 1  R27R1  +  R27R0  (6) 


6c.  Map  for  Flip-flop  R2. 


;R3  :=  R3'R2*R1"R0  *  /R37R2 
+  /R37R1  +/R37R0  (7) 

R3  :=  /R3-R2-R1-R0  +  R37R2 
♦  R37R1  ♦  R37R0  (8) 

6d.  Map  for  Flip-flop  R3. 


Design  Results 

Figure  7  is  an  oscilloscope  photograph  of  the  outputs  of  the 
programmed  device.  The  top  trace  is  the  RDY  signal.  It  goes 
High  to  indicate  that  a  complete  data  byte  has  been 
transmitted  and  that  the  next  data  byte  may  be  presented  to 
the  data  inputs  D0-D7.  The  second  trace  is  DTST,  the  serial 
data  stream  D0-D7  of  the  parallel  data  input  D0-D7.  The  bottom 
trace  is  the  Manchester  encoded  waveform  of  the  parallel  data 
input,  or  RX.  The  markers  show  one  complete  frame  of 
information. 


Figure  6.  The  Karnaugh  Map  of  Figure  5.  Broken  Down  for 
Individual  Flip-flops  RO,  R1,  R2,  and  R3. 
Minimization  is  Performed  for  Each  Individual  Flip- 
flop 


DTST 


2  4   g  PHASE   C  0  4 


The  PALASM  2  software  operators  describing  the  equation 

are: 

*  logical  AND 

+  logical  OR 

/    logical  inversion 

:=  registered  output  equation. 


Figure  7.  Oscilloscope  Trace  of  Programmed  Device.  The 
Manchester  Encoded  Output  is  RX 
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THE  PALC22V10  HAS  BEEN  DESIGNED  TO  FUNCTION  AS  A  MANCHESTER 
ENCODER  CIRCUIT.     THE  BINARY  DATA  INPUT  WILL  DETERMINE  WHETHER 
THE  TRANSMISSION  SIGNAL  SHOULD  BE  IN  PHASE  WITH  A  CLOCK 
REFERENCE  OR  SHIFTED  BY   180   DEGREES  FROM  THAT  REFERENCE. 
THE  CHARACTERISTIC  OF  A  MANCHESTER  ENCODED  SIGNAL   IS  A 
CHANGE  OF  PHASE  FOR  EACH  TRANSITION  OF  THE  DATA  INPUT. 
ASSUMING  THAT    'X'    IS  THE  PHASE  OF  THE  CARRIER  THAT  ENCODES 
A  LOGIC  LOW  INPUT,   WHEN  THE  INPUT  DATA  TRANSITIONS 
TO  A  LOGIC  HIGH  THE  PHASE  OF  THE  CARRIER  WILL  BE  SHIFTED 
TO   ' X  +  180'.      IT  WILL  CHANGE  BACK  TO   'X'   ONLY  WHEN  THE 
DATA  INPUT  TOGGLES  FROM  A  LOGIC  HIGH  TO  A  LOGIC  LOW. 
THE  KEY  FEATURES  OF  THE  ENCODED  SIGNAL  ARE: 

1.  DATA  AND  CLOCK  FREQUENCY  INFORMATION  IS 
CONTAINED  IN  THE  ENCODED  SIGNAL.  CLOCK 
INFORMATION  MAY  BE  EXTRACTED  BY  USING  A 
PHASE  LOCKED  LOOP  IN  THE  RECEIVER. 

2.  NO  DC  COMPONENT  IS  CONTAINED  IN  THE 
PHASE  ENCODED  SIGNAL. 

THE  APPLICATIONS  OF  PHASE  ENCODED  SIGNALS  CAN  BE  IN  SHORT 
COMMUNICATIONS  LINKS,  SUCH  AS  ETHERNET  OR  INFRARED  TRANS- 
MISSIONS,  OR  IN  STORAGE  ON  SOME  MAGNETIC  MEDIA. 

THERE  ARE  EIGHT  DATA  INPUTS,   DO  TO  D7 .      DATA  PRESENT  ON 
THESE  PINS  WILL  BE  ENCODED  INTO  A  BI-PHASE  SIGNAL  AT  THE 
OUTPUT  QX,   WHEN  THE  SER   (SERIAL)    INPUT  DATA  PIN  IS  CONFIGURED 
TO  A  LOGIC  ZERO.      TO  CONVERT  A  SERIAL  DATA  STREAM  OF  LOGIC 
ONES  AND  ZEROS   INTO  A  PHASE  ENCODED  SIGNAL  THE  SER  INPUT 
IS  TIED  HIGH,    AND  THE  DATA   IS  APPLIED  TO  THE  DO  INPUT 
OF  THE  PAL  DEVICE.     A  CLOCK  INPUT  OF  16  TIMES  THE  DATA  RATE 
IS  REQUIRED  FOR  THE  PARALLEL  OR  SERIAL  ENCODING  OF  THAT 
DATA,    SO  THE  DESIGN  MAY  BE  USED  WITH  MANY  POPULAR  UART  LSI 
CIRCUITS.      THERE  IS  ONE  HANDSHAKE  OUTPUT  FROM  THE  DEVICE, 
RDY.      THIS  SIGNAL  GOES  HIGH  WHEN  THE  NEXT  DATA  BYTE  OR  BIT 
IS  REQUIRED  TO  BE  SET  UP  AT  THE  INPUT. 
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CHIP  MANENC  PAL22V10 


;PIN 

1 

2 

3 

4 

5 

6 

CLK 

/RST 

DO 

Dl 

D2 

D3 

;PIN 

7 

8 

9 

10 

11 

12 

D4 

D5 

D6 

D7 

NC 

GND 

;PIN 

13 

14 

15 

16 

17 

18 

SER 

RDY 

/DTST 

RC 

RB 

RX 

;PIN 

19 

20 

21 

22 

23 

24 

RA 

R3 

R2 

Rl 

R0 

VCC 

GLOBAL 


STRING  SO 
STRING  SI 
STRING  S2 
STRING  S3 
STRING  S4 
STRING  S5 
STRING  S6 
STRING  S7 


'/RC*/RB*/RA' 
'/RC*/RB*  ra 1 
'/RC*  RB*/RA ' 
'/RC*  RB*  RA' 
'  RC*/RB*/RA' 
'  RC*/RB*  RA' 
1  RC*  RB*/RA ' 
'    RC*   RB*  RA' 


EQUATIONS 
GLOBAL. SETF 


RST 


/R0 
/Rl 

/R2 
/R3 

RDY 


RO 


:=  Rl*  RO 
+  /R1*/R0 

:=     R2*  Rl*  RO 
+  /R2*/R1 
+  /R2*  /RO 

:=     R3*  R2*  Rl*  RO 
+  /R3*/R2 
+  /R3*  /Rl 
+  /R3*  /RO 

:=  RC*RB*RA*R3*R2*R1*R0*/SER 
+  R3*R2*R1*R0*  SER 


GLOBAL  TERM 
ENABLES  RESET. 
STRING  DECLARATIONS. 
ENCODE  STATE  0 
ENCODE  STATE  1 
ENCODE  STATE  2 
ENCODE  STATE  3 
ENCODE  STATE  4 
ENCODE  STATE  5 
ENCODE  STATE  6 
ENCODE  STATE  7 
FOR  PARALLEL  INPUT 

50  SELECTS  DO  INPUT, 

51  SELECTS  Dl  INPUT, 
ETC. 


SET  INITIAL 
CONDITIONS,  RESET. 
RO   DIVIDES  THE  CLOCK 
INPUT  BY  2. 
Rl  DIVIDES  THE  CLOCK 
BY  4. 

R2  DIVIDES  THE  CLOCK 
BY  8. 


R3  DIVIDES  THE  CLOCK 
BY  16. 


RDY  OUTPUT  PERFORMS 
A  HANDSHAKE  OPERATION. 
WHEN  HIGH,   NEW  DATA 
MAY   BE  APPLIED. 
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/RA 


/RB 


/RC 


/RX 


RA*  R3 

*   R2*   Rl*  R0*/SER 

+ 

/RA*/R3 

*  /SER 

+ 

/RA* 

/R2*  /SER 

+ 

/RA* 

/Rl*  /SER 

+ 

/RA* 

/R0*/SER 

+ 

SER 

RB* 
/RB* 
/RB* 
/RB* 
/RB* 

/RB*/RA* 


/DTST: 


RA*  R3*  R2*  Rl*  R0*/SER 
/R3*  /SER 
/R2*  /SER 
/Rl*  /SER 
/R0*/SER 
/SER 
SER 


RC*  RB*  RA*  R3*  R2*  Rl*  R0*/SER 


/RC* 
/RC* 
/RC* 
/RC* 
/RC* 
/RC*/RB* 


/R3' 


/R2 


/RA* 


/SER 
/SER 
/Rl*  /SER 
/R0*/SER 
/SER 
/SER 
SER 


S0*D0 

S1*D1*/SER 
S2*D2*/SER 
S3*D3*/SER 
S4*D4*/SER 
S5*D5*/SER 
S6*D6*/SER 
S7*D7*/SER 

S0*/R3*D0 

S1*/R3*D1*/SER 

S2*/R3*D2*/SER 

S3*/R3*D3*/SER 

S4*/R3*D4*/SER 

S5*/R3*D5*/SER 

S6*/R3*D6*/SER 

S7*/R3*D7*/SER 


S0*R3*/D0 
S1*R3*/D1 
S2*R3*/D2 
S3*R3*/D3 
S4*R3*/D4 
S5*R3*/D5 
S6*R3*/D6 
S7*R3*/D7 


*/SER 
*/SER 
*/SER 
*/SER 
*/SER 
*/SER 
*/SER 


COUNTER   [RA,   RB,   RC ] 
COUNTS  THE  NUMBER 
OF  BITS  TRANSMITTED 
WHEN  A  BYTE  IS 
APPLIED  TO  THE 
DO  -  D7  DATA  INPUTS. 
THE  LOGIC  CONDITION 
OF  THE  SELECTED 
INPUT  IS  MULTIPLEXED 
TO  THE  RX  REGISTERED 
OUTPUT  BY  THE   ' ONE  OF 
EIGHT'    STATE  SELECTOR, 
IN  THE  DIVIDE  BY  EIGHT 
COUNTER  RA,   RB,   RC.  FOR 
SERIAL  MODE  /RA*/RB*/RC 
SELECTS  DATA  INPUT  DO, 
RA*/RB*/RC  SELECTS  Dl , 
ETC.,   WHILE  /SER  SELECTS 
PARALLEL  OPERATION.  IF 
SER   IS  ACTIVE,  SERIAL 
ENCODING   IS  SELECTED. 
THE  SERIAL  DATA  APPLIED 
TO  THE  DO   INPUT  IS 
ENCODED.      Dl  -  D7 
INPUTS  ARE  DESELECTED. 
DO  -  D7   IS  SERIALIZED 
FOR  PARALLEL  MODE. 
FOR  SERIAL  MODE,    DO  ONLY 
IS  CLOCKED  THROUGH  TO 
THE  DTST  OUTPUT. 
/DTST  OUTPUT  IS  THE 
SERIALIZED  ENCODING 
OF  THE  DATA  INPUT 
DO  -  D7,   OR  DO. 
THE  RX  OUTPUT  PROVIDES 
THE  MANCHESTER  ENCODED 
OUTPUT  OF  THE  REFERENCE 
FREQUENCY,   WHICH  OCCURS 
AT  THE  R3  OUTPUT.  THE 
PHASE  AT  R3   IS  SHIFTED 
AT  THE  RX  OUTPUT   IF  THE 
SELECTED  DATA   INPUT  PO- 
LARITY CHANGES.      IF  /SER 
IS  INACTIVE,   DO  -  D7 
INPUTS  ARE  SELECTED.  IF 
/SER   IS  ACTIVE,    ONLY  THE 
SERIAL  INPUT  APPLIED  TO 
DO   IS  SELECTED. 
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SIMULATION 

TRACE_ON  CLK  RO  Rl  R2  R3  RA  RB  RC  RX 
/DTST  RDY  DO  Dl  D2  D3  D4  D5 
D6  D7  SER  /RST 

SETF  /CLK  RST  DO  Dl  /D2  D3  D4  /D5 
/D6  /D7  /SER 

SETF  RST 

CLOCKF  CLK 

SETF  /RST 

FOR  I    :=  1  TO  128  DO 

BEGIN  CLOCKF  CLK 

END 

SETF  SER  DO 

FOR  I    :=  1  TO  3  2  DO 

BEGIN  CLOCKF  CLK 

END 

TRACE  OFF 


TRACE  ALL  SIGNALS. 

SET   INITIAL  CONDITIONS 
FOR  DATA  &  CONTROL  IN- 
PUTS.  PARALLEL  MODE  SET. 
RESET  ACTIVE  TO 
INITIALIZE  THE 
REGISTERS 
CLOCK  8  DATA  BITS 
THROUGH  TO  R3  &  RX 
OUTPUTS . 

SET  SERIAL  MODE  VIA 
SERIAL  CONTROL  PIN. 
CLOCK  SYSTEM  TO  SELECT 
SERIAL  OPERATION. 
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Peripherals  are  integral  to  any  computer  system.  They  range 
from  basic  keyboards  and  printers  to  mass  storage  disks/tapes. 
In  this  section  we  will  focus  on  mass  storage  systems  such  as  disk 
drives  and  tape  drives. 

A  typical  mass  storage  subsystem  can  be  thought  of  in  terms  of 
two  major  sections:  the  controller  and  the  storage  device.  The 
controller  provides  an  interface  between  the  host  and  the  storage 
device.  It  may  control  one  or  several  storage  devices  at  a  time. 
Common  storage  devices  are  disk  drives  (Winchester  and  floppy) 
and  tape  drives  (reel  and  cartridge). 

The  discussion  that  follows  will  be  oriented  toward  the  storage 
device  being  a  disk  drive  although  it  could  just  as  easily  have  been 
done  using  tape  drives. 

Figure  1  shows  two  configurations  of  disk  storage  subsystems.  In 
Figure  1 A  the  storage  device  is  connected  to  the  host  via  a  string 
controller.  Communication  between  the  storage  device  and  the 
controller  is  accomplished  through  standardized  interface  proto- 
cols. Examples  of  common  disk  drive  interfaces  in  use  today  are 
SMD  (Storage  Module  Device),  ESMD  (Enhanced  SMD),  ESDI 
(Enhanced  Small  Disk  Interface),  ST506,  and  SA465.  An  appli- 
cation example  of  an  ESDI  interface  controller  is  shown  on  page 
2-509.  An  intelligent  storage  device  is  created  when  the  string 
controller  function  is  embedded  into  the  storage  device  (Figure 
1B).  Intelligent  storage  devices  typically  communicate  tothe  host 
through  a  host  adapter  using  either  SCSI  (Small  Computer 


System  Int 
protocols, 


erf  ace)  or  IPI  (Intelligent  Peripheral  Interface)  interface 


ESMD 
SMD 
ESDI 


ST506 

STRING 

/1SA465 

STORAGE 

CONTROLLER 

\  

V 

DEVICE 

a.  Siring  Controller  With  a  Storage  Device 


V 


IPI 

A  SCSI  N 

HOST 

c- K, 

ADAPTER 

INTELLIGENT 
STORAGE  DEVICE 


b.  Host  Adapter  With  an  Intelligent  Storage  Device 


Figure  1. 


Controller 

The  controller  function  can  be  conceptually  divided  into  several 
sections:  the  Host  Bus  Interface,  the  Data  Buffer,  the  Kernel  and 
the  Device  Interface. 


ESMD 
SMD 
ESDI 
ST506 
,  SA465 


KERNEL 

HOST  BUS 

DEVICE 

-v 

INTERFACE 

INTERFACE 

DATA 

BUFFER 

Figure  2. 

The  Host  Bus  Interface  provides  the  connections  to  the  host  CPU 
and  main  memory.  It  can  serve  as  either  a  slave  or  a  master  on 
the  host  bus.  When  the  controller  is  a  bus  slave,  the  host  CPU 
must  be  able  to  access  the  controller,  pass  commands,  and 
obtain  subsystem  status  using  established  host  bus  protocol. 
The  controller  behaves  as  a  bus  master  while  transferring  data  in 
either  direction,  or  posting  status.  A  bus  master  must  be  able  to 
arbitrate  for  the  host  bus  and  execute  the  necessary  handshake 
protocols. 

The  Data  Buffer  stores  the  data  being  read  from  or  written  to  the 
disk  and  checks  for  data  integrity.  The  physical  buffer  can  serve 
several  purposes:  data  transfer  rate  adaptor  between  the  host 
and  the  disk,  a  temporary  storage  for  data  going  to  the  host  until 
it  can  be  verified  error  free,  and  a  cache.  Depending  on  the 
complexity  of  the  data  buff  er  it  may  be  as  small  as  one  byte  or  as 
large  as  a  complete  track  of  data. 

Data  integrity  can  be  monitored  for  datatransfers  from  the  host  as 
well  as  from  the  disk  drive.  The  sophistication  used  to  monitor 
data  integrity  varies  significantly  from  simple  single  bit  error 
detection  (parity)  to  large  multiple  bit  error  detection  and  correc- 
tion. The  host  may  elect  to  transfer  data  with  a  parity  bit  for  each 
byte.  Data  error  detection  and  correction  schemes  are  employed 
between  the  controller  and  the  disk  drive.  As  storage  densities 
have  increased,  so  have  the  size  of  the  errors.  The  demand  for 
more  capable  error  detection  and  correction  codes  that  can  more 
accurately  detect  and  correct  larger  number  of  bits  in  error  has 
increased  accordingly. 
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Such  error  detection  codes  as  CRC  code  are  satisfactory  if  the 
error  occurs  in  the  datatransmission  channel  and  the  data  can  be 
recovered  by  retransmission.  Data  errors  caused  by  media 
defects  require  the  capability  of  both  error  detection  and  correc- 
tion. As  the  nature  of  media  errors  has  changed,  so  havethe  error 
detection  and  correction  codes  being  used.  Fire  Codes  and 
computer  generated  codes  have  been  the  most  popular  codes. 
Symbol  oriented  codes,  such  as  Reed-Solomon  codes,  are 
emerging  as  the  solutions  for  the  future. 

The  Kernel  is  the  brain  of  the  controller  and  the  manager  of  the 
subsystem.  It  translates  commandsfrom  the  host  into  commands 
for  the  storage  device.  It  receives  requests  from  the  operating 
system  through  the  host  interface  and  notifies  the  host  when  the 
request  is  completed.  To  execute  the  host's  request  for  data,  the 
kernel  may  have  to  access  the  storage  device  or  it  may  only  have 
to  access  a  cache.  When  accessing  the  storage  device  the  kernel 
sends  commands  according  to  the  device  interface  protocols. 

The  Device  Interface  communicates  directly  with  the  storage 
device.  Using  the  particular  interface  protocols,  the  device  inter- 
face sends  commands,  retrieves  status,  and  transfers  data 
between  the  controller  and  the  storage  device.  The  device  inter- 
faces shown  in  Figure  3  are  predominantly  disk  and  would  have 
to  be  changed  to  accommodate  tape  drives.  An  application 
example  of  a  tape  drive  interface  controller  is  shown  on 
page  2-519. 

Storage  Device 

The  storage  device  can  also  be  divided  into  several  sections:  the 
Interface  &  Control,  the  Servo  and  the  Read/Write. 


ESMD 
SMD 
ESDI 
ST506 
.  SA465 „ 


SERVO 

INTERFACE 

&  CONTROL 

READ /WRITE 

The  Servo  is  responsible  for  the  movement  of  the  media  and  the 
positioning  of  the  read/write  heads.  For  disk  drives  D.C.  brush- 
less  motors  are  predominantly  used  to  drive  the  spindle.  Head 
positioning  is  done  either  with  a  stepper  motor  or  a  voice  coil 
actuator.  In  the  case  of  tape  drives,  the  Servo  is  only  responsible 
for  capstan  motor  control  because  the  position  of  the  read/write 
heads  is  fixed. 

The  Read/Write  is  responsiblefor  storing  and  retrieving  datafrom 
the  media.  The  data  encoding  method  must  balance  data  density 
and  ease  of  data  recovery.  Several  Run-Length-Limited  (RLL) 
coding  techniques  are  in  use  today.  Winchester  disk  drives  have 
used  frequency  modulation  (FM),  modified  FM  (MFM),  modified 
MFM  (MMFM),  and  are  currently  using  2,7  to  obtain  present  day 
densities.  Tape  drives  employ  different  RLL  codes  than  disk 
drives.  The  two  common  codes  in  use  for  tape  storage  are  phase 
encoding  (PE)  and  group-coded  recording  (GCR).  The  applica- 
tion note  on  page  2-541  shows  an  example  of  a  GCR  encoder  im- 
plemented with  PAL  devices. 

When  the  storage  device  is  configured  with  an  embedded  control- 
ler (Figure  4)  the  Dewce  /n/erfacesection  of  the  controller  and  the 
Interface  &  Control  section  of  the  storage  device  are  not  required. 
The  Kernel  will  directly  control  the  storage  device. 
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Figure  4. 


PLDs  have  been  used  in  designs  throughout  mass  storage 
subsystems.  Applications  range  from  consolidation  of  random 
logic  to  address  decoding,  bus  arbitration  logic  to  protocol  state 
machines,  and  complex  encoding  and  decoding.  Application 
notes  and  technical  articles  are  included  here  to  aid  designers  in 
understanding  the  benefits  of  implementing  different  designs  with 
programmable  logic. 


The  Interface  &  Control  is  the  brain  of  the  storage  device. 
It  interprets  the  commands  from  the  controller.  It  supervises  the 
servo  and  the  read/write  and  maintains  the  device  status. 
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The  ESDI  Translator 

ESDI  is  a  low-cost,  high-performance  interface  standard  suita- 
ble for  the  smaller,  high-performance  Winchester  disk  drives 
currently  being  produced.  The  ESDI  interface  consists  of  a 
control  cable  and  a  data  cable.  The  control  cable  allows  for  a 
daisy  chain  connection  of  up  to  seven  devices  (disk  or  optical 
drives)  with  only  the  last  device  being  terminated.  In  our  de- 
sign, we  assumed  that  the  device  is  a  disk  drive.  The  data 
cable  is  attached  in  a  radial  configuration  (See  Figure  1). 


HOST 


CONTROLLER 


Figure 


DC  VOLTAGES 
(RADIAL) 


FRAME  GROUND  - 


-CJ3 


J2    DRIVE  NO.  1 


7 


J2  DRIVE  NO.  2 
■a  „ 


7 


-f~_  J2    DRIVE  NO.  N 


TERMINATOR 
LAST  DRIVE 
ONLY 


1.  Connection  Between  the  Controller  and  Multiple 
Drives 


The  ESDI  Translator  handshakes  serial  commands  from  a  disk 
controller,  deserializes  the  commands  and  passes  the  com- 
mands to  a  microcontroller.  The  command  data  word  structure 
is  shown  in  Figure  2. 

The  Command  Function  bits  define  functions  to  be  executed  by 
the  disk  drive.  These  functions  are  seek,  recalibrate,  request 
status,  request  configuration,  select  head  group,  control,  data 
strobe  offset,  track  offset,  initiate  diagnostics,  set  bytes  per 
sector  and  set  configuration.  Some  of  these  functions,  such 
as  the  control  function,  have  modifiers  for  more  detailed  func- 
tional description.  Other  commands  have  parameters  that  con- 
tain numbers.  For  the  seek  command,  the  parameter  specifies 
the  cylinder  number  that  the  drive  will  seek  to.  The  request 
status  and  request  configuration  commands  require  data  from 
the  disk  drive  to  be  transferred  back  to  the  disk  controller.  In 
our  current  design,  internal  registers  A  and  B  in  the  LCA  device 
represent  the  upper  and  lower  bytes  of  the  command  respec- 
tively. 

Figure  3  illustrates  the  relationships  between  the  disk  control- 
ler, the  ESDI  Translator,  and  the  microcontroller.  The  PROM  is 
used  to  store  the  configuration  data  for  the  LCA  device.  The 
Done/Program  (D/P)  output  is  driven  LOW  when  the  device  is 
being  configured.  Configuration  data  is  read  from  the  PROM 
device  during  configuration.  After  configuration  is  complete, 
the  LCA  device  drives  the  D/P  pin  HIGH  to  deselect  the  PROM. 
Drive  selection,  write  protection,  command  completion  and 
fault  detection  are  also  handled  by  the  ESDI  Translator. 


ODD 
PARITY 


15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

P 

CMD  FUNCTION 

CMD  MODIFIER 

00000000 

P 

CMD  FUNCTION 

CMD  PARAMETER 

P 

REGISTER  A  REGISTER  B 

Figure  2.  Command  Data  Word  Structure 
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Figure  3.  An  ESDI  Translator  Implemented  on  the  LCA  Device 


Why  Use  an  LCA  Device  in  an  ESDI 
Translator 

The  ESDI  interface  standard  requires  more  logic  functions  to 
be  built  into  the  disk  drive  than  some  other  interface  stan- 
dards, such  as  the  ST506  standard.  However,  the  external  di- 
mensions of  a  disk  drive  usually  have  to  conform  to  an  industri- 
al standard  form  factor.  Thus,  the  use  of  high-density  semicus- 
tom  chips  is  the  logical  solution  to  increase  functionality  with- 
out increasing  the  external  dimensions. 

The  LCA  device  is  a  high-density  CMOS  integrated  circuit 
available  from  Monolithic  Memories.  Its  high  gate  density  al- 
lows the  implementation  of  an  ESDI  Translator  in  a  single  chip. 
Fifteen  standard  SSI  and  MSI  chips  would  be  necessary  for  the 
same  application.  If  PLDs  are  used  to  implement  the  ESDI 
Translator,  more  than  one  would  be  necessary  because  a  large 
amount  of  logic  is  required,  thus  occupying  more  board  space. 

A  major  advantage  in  using  the  LCA  device  is  that  it  can  speed 
up  the  design  cycle,  enabling  the  manufacturer  to  have  a 
shorter  time-to-market.  Also,  many  peripheral  products  are 
produced  in  relatively  small  quantities  aiming  at  very  special- 
ized markets.  The  LCA  device,  which  has  no  NRE  cost,  makes 
the  production  of  small  quantities  more  economical  than  the 
gate  array.  Another  advantage  of  the  LCA  device  over  other 
semicustom  chips,  such  as  the  gate  array,  is  its  reprogramma- 
bility  feature.  The  LCA  device  is  RAM-based  which  can  easily 
be  reprogrammed  by  the  user  in  the  final  system.  This  feature 
is  especially  important  in  the  peripheral  products  market, 
where  many  products  have  short  life  spans. 


Design  Implementation 

The  ESDI  Translator  is  responsible  for  all  control  interfaces 
between  the  disk  controller  and  the  disk  drive.  An  internal 
block  diagram  of  the  ESDI  Translator  is  shown  in  Figure  4.  It 
consists  of  five  major  logic  building  blocks: 

•  Drive  selection 

•  Read  gate/write  gate 

•  Counter/controller 

•  Shift  register  and  parity  generator/checker 

•  Internal  register  address  decoder 

Drive  selection  on  ESDI-compatible  drives  involves  three 
signals  from  the  disk  controller,  Drive  Select  0-2.  These  three 
drive  select  lines  are  encoded  so  that  up  to  seven  drives  may 
be  connected  to  the  same  ESDI  port,  as  shown  in  Table  1 . 

On  the  LCA  device,  the  drive  number  is  selected  by  connecting 
the  drive  switch  pins  to  either  VCC  or  GND.  When  the  code  on 
the  drive  select  lines,  DSX,  equals  the  code  on  the  drive  switch 
pins,  DSWX,  where  X  may  be  0,  1,  or  2,  the  drive  is  selected 
and  the  Drive  Selected  signal,  DSELD,  is  asserted.  Once  the 
drive  has  been  selected,  serial  commands  output  by  the  disk 
controller  will  be  read  by  the  LCA  device.  The  actual  implemen- 
tation is  shown  in  Figure  5,  using  two  CLBs  and  seven  lOBs. 
BDSO  and  BDS1  are  the  names  of  the  CLBs  in  the  current  de- 
sign. In  our  design,  names  that  begin  with  the  letter  B  or  P  are 
used  to  designate  a  CLB  or  an  IOB  respectively.  SCLK  is  the 
system  clock. 


2-510 


£1  MonollthlcMMomorlBS  £1 


LSB 


MSB 


COMMAND  DATA  - 


ADDRESS 
LOGIC 


•  CONFIG  STATUS  DATA 


-  RD 
WR 

•  AO,  A1/CS 


ERROR 
REGISTER 

CMD  COMP 
REGISTER 

r 

i 

TRANSFER  REQUEST  - 
TRANSFER  ACKNOWLEDGE  - 
ATTENTION  ■ 
COMMAND  COMPLETE  - 


DRIVE  SELECT  (0-2) 
DRIVE  SWITCH  (0-2) 

DRIVE  SELECTED  - 
READ  GATE  IN  - 
WRITE  GATE  IN 
WRITE  PROTECT  - 


COUNTER/ 
CONTROLLER 


DRIVE 
SELECTION 


READ  GATE/ 
WRITE 
GATE 
LOGIC 


EXT.  CHANGE 
OF  STATUS 


-READ  GATE  OUT 
-WRITE  GATE  OUT 


Figure  4.  ESDI  Translator  Internal  Block  Diagram 
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Figure  5.  Configuration  of  the  BDSO  and  BDS1  CLBs  to 
Provide  the  Drive  Selected  Signal 
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Figure  6  shows  the  configurations  of  the  two  CLBs  as  dis- 
played on  the  computer  screen  by  the  XACT  development  soft- 
ware. In  Figure  6a,  the  CLB  is  configured  as  one  function  of 
four  variables.  The  D  flip-flop  is  not  used.  The  logic  representa- 
tion, truth  table,  Karnaugh  map,  signal  names,  block  name  and 
Boolean  equation  are  shown. 

In  Figure  6b,  the  CLB  is  configured  as  one  function  of  three 
variables,  the  output  of  which  is  connected  to  the  D  flip-flop. 
The  Q  output  of  the  D  flip-flop  becomes  the  output  of  this  CLB. 
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Figure  6a.  Drive  Selection  Logic  Implemented  in  a 
CLB  (BDSO) 
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Figure  7.  Read  Gate/Write  Gate  Logic  Implementation 


The  Counter/Controller  handshakes  commands  from  the  disk 
controller.  It  also  handshakes  status/configuration  data  to  the 
disk  controller.  It  is  also  responsible  for  generating  the  Inter- 
rupt, Attention  and  Command  Complete  signals.  Seventeen  bit 
commands  (one  bit  is  parity)  are  transferred  from  the  disk  con- 
troller to  the  LCA  device  via  the  Command  Data  line.  The  serial 
bit  transfer  is  performed  using  a  pair  of  handshaking  signals, 
Transfer  Request  (TREQ)  and  Transfer  Acknowledge  (TACK). 
TREQ  is  asserted  by  the  disk  controller  when  a  bit  is  valid  on 
the  Command  Data  line,  and  TACK  is  asserted  by  the  LCA  de- 
vice when  the  command  bit  has  been  read.  The  handshaking 
action  is  shown  in  Figure  8. 
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Figure  6b.  Drive  Selection  Logic  Implemented  in  a 
CLB  (BDS1) 


TACK 


\ 


X 


The  LCA  device  also  performs  logic  functions  for  the  Read 
Gate  and  Write  Gate  logic  blocks.  The  Read  Gate  signal  allows 
data  to  be  read  from  the  disk,  and  the  Write  Gate  signal  allows 
data  to  be  written  on  the  disk.  These  signals  from  the  disk 
controller  are  input  to  the  LCA  device  as  Read  Gate  In  and 
Write  Gate  In.  Under  normal  operating  conditions,  Read  Gate 
Out  is  asserted  when  Read  Gate  In  is  asserted  and  Write  Gate 
Out  is  asserted  when  Write  Gate  In  is  asserted.  When  both 
Read  Gate  In  and  Write  Gate  In  are  asserted,  a  write  error 
condition  results  and  the  Attention  line  is  asserted,  signalling 
the  disk  controller  that  an  error  has  occurred.  Also,  the  LCA 
device  may  be  used  to  provide  write  protection  to  a  disk  drive. 
When  the  Write  Protect  signal  is  asserted,  the  Write  Gate  Out 
signal  will  not  be  asserted  when  the  Write  Gate  In  signal  is 
asserted,  preventing  the  write  circuitry  from  being  activated. 
These  logic  functions  are  implemented  in  two  CLBs,  BRWGO 
and  BRWG1 ,  as  shown  in  Figure  7. 


Figure  8.  Command  Data  Transfer 


A  17-state  counter  counts  the  number  of  command  bits  shifted 
in  or  shifted  out  of  the  data  registers.  Its  implementation  is 
shown  in  Figure  9.  The  SHIFT  signal  is  asserted,  thus 
incrementing  the  counter,  whenever  there  is  a  transfer  request 
and  the  LCA  device  is  selected.  CNT_Q0  is  the  lowest  bit  of  the 
shift  register  counter.  This  bit  is  inverted  whenever  SHIFT  is 
asserted  unless  sixteen  bits  have  already  been  shifted  into  the 
LCA  device  (CNT16  asserted,  or  CKCMD  asserted  when  all 
seventeen  bits  have  been  shifted  in).  The  DATAOUT  signal  is 
asserted  after  a  Request  Status  command  or  a  Request 
Configuration  command  has  been  transferred  and  the  internal 
data  registers  have  been  loaded  with  data  to  be  serially  shifted 
out.  Hence,  when  DATAOUT  is  asserted,  CKCMD  is  negated 
and  the  counter  is  enabled. 
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Figure  9.  17-State  Counter  Implemented  In  six  CLBs 


The  logic  generation  of  the  Attention  signal,  ATTEN,  is  shown 
in  Figure  10.  Whenever  there  is  a  write  fault,  WRFLT,  a  parity 
error,  PARERR,  an  interface  fault,  INTFLT,  or  an  external  error, 
CSTS(Change  of  Status),  the  ATTEN  signal  is  asserted.  When 
both  WRGIN  and  RDGIN  signals  are  asserted  and  the  LCA 


device  is 
signal  is 


selected,  the  WRFLT  signal  is  asserted.  The  WRFLT 
negated  when  the  command  transferred  to  the  LCA 


device  Register  A  is  the  Reset  command  defined  by  the  ESDI 
standard  which  has  a  Command  Function  of  0101  (Control) 
and  a  Command  Modifier  of  0000  (Reset  Attention  and 
Standard  Status).  The  Command  Function  and  Command 
Modifier  formats  are  shown  in  Figure  2. 

Three  CLBs,  BPG0,  BPG1,  and  BPG2,  are  responsible  for 
generating  the  PARERR  signal.  BPG0  is  a  multiplexer  that 
selects  the  source  of  the  input  to  the  parity  generator/checker 
(BPG1).  If  data  is  being  shifted  into  the  LCA  device  (parity 
checker  mode),  the  CMDBITA  input  is  chosen.  If  data  is  being 
shifted  out  of  the  LCA  device  (parity  generator  mode),  the 
RA_Q7  output  is  chosen.  These  two  signals  are  also  shown  in 


Figure  11.  BPG1  is  an  odd  parity  generator/checker.  In  the 
parity  checker  mode,  whenever  an  odd  number  of  ones  are 
passed  through  this  CLB,  the  output  is  one.  This  output  signal 
is  connected  to  BPG2,  which  inverts  the  signal  and  asserts  or 
negates  the  PARERR  signal  accordingly.  If  parity  is  correct,  an 
interrupt  is  asserted  to  the  microcontroller  informing  the 
microcontroller  that  a  command  has  been  received  and  is 
ready  to  be  read.  In  the  parity  generator  mode,  the  output  of 
BPG1  is  the  parity  bit.  BPG1  is  clocked  by  the  SHIFT  input, 
which  is  asserted  whenever  there  is  a  transfer  request  and  the 
LCA  device  is  selected.  BPG1  is  reset  by  the  Reset  Parity 
Generator  input,  RSTPGEN.  This  signal  is  asserted  when  ei- 
ther INT  is  asserted  or  an  interface  fault  is  detected. 

The  Interface  Fault  signal  is  asserted  when  the  LCA  device  is 
selected  and  CNTR  is  negated  before  seventeen  bits  have 
been  transferred.  CNTR  is  asserted  after  the  first  command  bit 
is  shifted  in  and  is  negated  after  the  seventeenth  bit  is  shifted 
out.  BINTFLT0  is  clocked  by  the  system  clock,  SCLK,  and 
reset  by  the  RSTCOS  signal.  The  IOB  PCSTS  is  configured  as 
a  buffered  input  to  signal  external  error  conditions. 

The  microcontroller  is  able  to  address  four  register  locations  in 
the  LCA:  two  data  registers,  BRGA  and  BRGB,  one  error 
register,  BDMX,  and  one  command  complete  register,  BRC7. 
Only  three  bits  in  the  error  register  are  used.  They  are  bit  0  for 
parity  error,  bit  4  for  interface  fault  and  bit  7  for  write  fault.  Only 
one  bit  in  the  command  complete  register  is  used.  This  is  bit  0, 
which  has  a  value  of  zero  when  the  command  is  completed. 
The  addresses  and  contents  of  the  registers  are  shown  in 
Table  2. 


A1  AO 

REGISTER 

BIT  NUMBER 
7  6  5  4  3  2  1  0 

0  o 

REGISTER  A 

0  1 

REGISTER  B 

1  0 

ERROR 
REGISTER 

1  1 

COMMAND 
COMPLETE  REGISTER 

Table  2.  Register  Addresses  and  Contents 

The  LCA  device  implementation  of  the  registers  and  multiplexer 
is  shown  in  Figure  11.  When  the  interrupt  signal  is  asserted, 
the  microcontroller  reads  the  two  data  registers  by  setting  the 
A1  and  AO  address  lines  appropriately  and  asserting  the  RD 
signal  to  the  LCA  device.  These  data  registers  contain  the 
command  that  is  transferred  from  the  disk  controller  through 
the  CMDBITA  input.  If  the  command  is  a  request  data 
command,  configuration  or  status  data  is  written  to  these  two 
data  registers  by  the  microcontroller.  These  two  bytes,  plus  a 
parity  bit  that  is  generated  by  the  parity  generator  in  the  LCA 
device,  are  serially  transferred  to  the  disk  controller  over  the 
Config/Status  Data  line  through  RA_Q7.  After  all  seventeen 
bits  have  been  transferred,  the  Command  Complete  signal  is 
asserted.  If  the  command  is  not  a  request  data  command,  the 
microcontroller  executes  the  command  and  upon  completion, 
writes  a  byte  of  zeros  to  the  command  complete  register  of  the 
LCA.  When  the  command  complete  register  is  written  with  all 
zeros,  the  Command  Complete  signal  is  asserted  by  the  LCA 
device. The  command  complete  register  may  only  be  written, 
not  read.  The  status  register  contains  error  bits  that  are  set 
when  errors  are  detected.  This  register  may  only  be  read  and 
not  written. 
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Figure  10.  Generation  of  the  Attention  Signal  by  the  Four  Possible  Error  Conditions 
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Design  Considerations 

The  circuit  diagram  of  the  programmed  LCA  device  is  shown  in 
Figure  12.  This  design  implementation  uses  sixty-three  of  the 
sixty-four  CLBs  within  the  LCA.  This  translates  into  a  98% 
usage.  The  rightmost  column  of  CLBs  contains  the  multiplexer, 
which  selects  between  registers  A,  B,  or  the  error  register  to 
be  placed  on  the  data  bus.  Registers  A  and  B  are  placed  on  the 
third  and  second  column  from  the  right  end,  respectively.  The 
registers  and  multiplexer  are  placed  physically  close  to  each 
other  to  simplify  routing.  The  CLBs  which  execute  a  particular 
function  are  placed  physically  next  to  each  other.  These 
functions  include  the  parity  generator/checker,  17-state 
counter,  read  gate/write  gate  logic,  drive  selection  logic  and 
error  detection  logic.  The  read  gate/write  gate  logic  and  the 
drive  selection  logic,  which  require  much  I/O  activity,  are 
placed  in  CLBs  near  the  edges  of  the  device. 

Another  consideration  in  implementing  an  LCA  design  is 
routing.  Long  lines  are  available  for  signals  that  have  to  travel 
a  long  distance  within  the  LCA  device.  These  lines  can  also  be 
used  for  signals  that  must  have  minimal  skew  between  differ- 
ent destinations.  An  automatic  routing  program  is  available  in 
the  XACT  software  package. 

Routing  requires  careful  consideration  when  a  design  uses  a 
high  percentage  of  the  available  CLBs  (above  95%).  Although 
automatic  routing  is  available,  designs  with  high  CLB  usage 
may  require  point-to-point  routing  (EDITNET  command  in 
XACT).  Some  of  the  techniques  that  can  be  used  are  swapping 
input  pins,  swapping  CLBs,  and  implementing  buffers  in 
unused  sections  of  the  CLBs.  When  swapping  input  pins,  the 
designer  must  be  careful  because  certain  functions,  such  as 
the  clock  input  to  the  flip-flop,  may  only  be  input  on  certain 
pins.  Swapping  CLBs  is  very  simple  because  the  XACT 
software  provides  a  command  for  swapping  CLBs,  but 


sometimes  not  all  of  the  signals  can  be  successfully  rerouted. 
Passing  a  signal  through  a  CLB  presents  a  routing  channel  that 
is  not  otherwise  available.  However,  a  delay  is  added  to  the 
signal.  Usually,  a  combination  of  these  techniques  can  be 
used  to  successfully  route  a  high-density  design. 

The  high  number  of  lOBs  in  the  LCA  device  gives  the  designer 
much  flexibility  in  designing  the  pinout  of  the  device.  In  this 
design,  thirty-five  of  the  lOBs  are  used.  The  lOBs  that  are  not 
being  used  for  the  actual  design  are  not  left  unused,  but  are 
configured  as  outputs  and  are  connected  to  various  signals 
within  the  LCA  device  to  provide  test  points  for  the  LCA  device. 
This  greatly  increases  the  testability  of  the  design  once  it  is 
placed  on  the  board. 

XACT  is  used  by  the  designer  to  define  the  CLBs  and  lOBs, 
and  to  perform  EDITNET.  Alternately,  the  design  may  be  input 
using  the  schematic  capture  software  offered  by  Daisy  and 
Futurenet.  In  these  methods,  an  automatic  place  and  route 
software  package  divides  the  design  into  blocks  that  may  be 
implemented  in  CLBs  and  lOBs,  and  routes  the  CLBs  and 
lOBs  automatically.  Once  the  design  has  been  completed,  it  may 
be  simulated  using  the  software  package  P-SILOS. 


Conclusion 

The  LCA  device  provides  many  advantages  to  the  user.  Its 
high  gate  count  and  I/O  capability  could  potentially  replace 
several  PLDs  in  many  applications,  hence  reducing  board 
space.  The  LCA  device  is  preferred  by  many  customers  over 
gate  arrays  because  it  is  reprogrammable  'on-the-fly'  and  there 
are  no  long  design  cycles  and  initial  NRE  cost  of  a  gate  array. 
The  current  design  file,  XDES15.LCA,  is  available  upon 
request.  The  bit  pattern  and  the  .LCA  file  will  be  provided  for 
programming  the  LCA  device  in  an  EPROM. 
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Controller  in  PROSE 


Introduction 

The  increasing  complexity  of  today's  computers  is  requiring  in- 
creased use  of  distributed  control  design  techniques.  Such  a 
system  might  include  a  32-bit  microprocessor  as  the  Central 
Processing  Unit  (CPU),  a  1 6-  or  8-bit  microcontroller  as  a  periph- 
eral controller,  and  a  complex  sequencer  as  a  peripheral  drive 
controller.  The  new  PROSE  (PROgrammable  SEquencer)  de- 
vice from  Monolithic  Memories  offers  a  simple,  fast,  and  effective 
method  for  implementation  of  peripheral  drive  controllers. 

This  article  will  describe  the  design  of  a  QIC-02  compatible  tape 
drive  sequencer  in  the  PROSE  device.  The  design  requirements 
will  be  outlined,  and  then  the  design  process  will  be  described  in 
detail  for  a  section  of  the  design. 

QIC-02  Tape  Drive 


The  peripheral  to  be  controlled  by  our  circuit  is  a  1/4"  streaming 
tape  drive  that  operates  according  to  the  QIC-02  interface  stan- 
dard. This  standard  specifies  the  command  set  and  resulting  tape 
drive  control  signals,  as  shown  in  Figure  1. 


COMMAND 


Invalid  Command 


Read  Status 


Beginning  of  Tape 


Erase  Entire  Tape 


Initialize  Tape 


Write 


Read 


Read  File  Mark 


Write  Fi  e  Mark 


QIC-02  COMMAND* 


D7  D6  D5  D4  D3  D2  D1  DO 


0     0  1 


1     1  0 


1  1 


1      1  0 


1  1 


1 


1111 


1110 


1  1 


1      0  1 


1    1  1 


1    1  1 


1  1 


"The  8  bit  commands  shown  in  this  table  are  inverted  from  the 
QIC-02  specification.  This  is  because  the  PAL  device  has 
been  programmed  to  accept  active  low  signal  inputs  as 
defined  by  the  QIC-02  specification. 

Figure  1.  QIC-02  Commands 


Each  command  requires  a  certain  sequence  of  events  for  execu- 
tion. For  example,  the  Erase  Tape  command  requires  rewinding 
the  tape,  asserting  the  Erase  and  Initialize  signals,  moving  the 
tape  forward  to  the  end,  removing  Erase  and  Initialize,  and  then 
rewinding  the  tape  again.  This  type  of  function  requires  feedback 
of  current  state  information,  and  additional  inputs,  for  next  state 
selection.  This  is  the  basic  definition  of  a  sequencer,  or  state 
machine  (Figure  2). 

Sequencers  are  a  common  design  element;  they  include  any 
digital  device  which  traverses  through  a  sequence  of  states  in  an 
orderly  fashion.  A  state  is  a  set  of  values  (a  count)  measured  at 
different  parts  of  the  circuit.  State  machines  are  often  used  for 
timing  delays,  control  signal  generation,  arbitration,  event  moni- 
toring, and  multiple  condition  testing. 

Our  tape  drive  sequencer  must  provide  the  proper  control  signals 
for  the  tape  drive  and  then  hold  them  until  executed,  monitor  the 
status  of  the  tape  drive,  and  watch  for  new  commands  from  the 
microcontroller.  The  block  diagram  of  the  system  is  shown  in 
Figure  3. 

The  peripheral  microcontroller  generates  the  8-bit  QIC-02  com- 
mands in  response  to  the  central  32-bit  microprocessor.  The 
sequencer  decodes  the  command,  determining  which  of  the 
commands  has  been  sent,  and  then  begins  execution  of  the 
command. 

Handshaking  is  performed  by  a  Request  signal  from  the  micro- 
controller, signifying  that  a  valid  command  is  available,  and  a 
Ready  signal  from  the  sequencer,  signifying  that  the  command 
has  been  decoded  or  executed  and  a  new  command  can  be  sent 
(Figure  4).  Upon  reset,  the  PROSE  device  asserts  READY, 
informing  the  peripheral  controller  that  the  tape  drive  is  ready  to 
accept  a  command.  The  peripheral  controller  asserts  REQUEST 
after  issuing  a  command  on  the  eight-bit  data  bus.  When  the 
PROSE  device  detects  REQUEST  asserted,  READY  is  negated 
and  the  command  is  decoded.  After  the  command  is  decoded, 
READY  is  asserted  by  the  PROSE  device,  at  which  time  RE- 
QUEST is  negated  by  the  peripheral  controller.  When  REQUEST 
is  negated,  READY  is  negated  by  the  PROSE  device. 


COMBINATORIAL 
LOGIC 


Figure  2.  General  State  Machine. 
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Afterthe  PROSE  device  decodes  the  command,  the  command  is 
executed  by  outputting  the  proper  drive  codes  and  asserting  the 
propersignals.  Afterthe  command  has  been  completed,  READY 
is  asserted  by  the  PROSE  device,  informing  the  peripheral 
controller  that  the  next  command  may  be  issued. 

Command  Execution 

The  PROSE  device  must  be  reset  after  power  up  before  any 
command  execution  takes  place.  This  is  accomplished  by 
pulsing  the  Preset  pin  low.  This  places  the  PROSE  device  into  the 
Initialize  state.  The  first  command  that  is  issued  by  the  peripheral 
controller  must  be  a  Select  Drive  command.  If  the  correct  drive 
is  selected,  the  PROSE  device  will  enter  a  Drive  Selected  state. 
A  Select  Drive  command  that  selects  an  incorrect  drive  or  any 
other  command  will  be  ignored.  Once  in  the  Drive  Selected  state, 
any  command  issued  will  be  executed.  The  drive  is  unselected 
and  returned  to  the  initialize  state  by  issuing  a  Select  Drive 
command  for  a  different  drive. 

The  following  text  describes  the  execution  of  each  of  the  QIC-02 
commands  (see  QIC-02  specification  for  timing  diagrams): 

Beginning  of  Tape  Command 

The  Beginning  of  Tape  command  code  directs  the  PROSE  device 
to  output  the  Rewind  drive  code.  This  causes  the  tape  drive  to 
rewind  the  tape.  When  BOT  (Beginning  Of  Tape)  is  asserted  by 
the  tape  drive,  the  Rewind  drive  code  is  negated  and  READY  is 
asserted.  BOT  is  asserted  when  the  Beginning  of  Tape  hole  is 
detected  by  the  tape  drive. 


Erase  Command 

The  Erase  command  code  directs  the  PROSE  device  to  output 
the  Rewind  drive  code  until  BOT  is  asserted,  then  assert  ERASE 
and  INITIALIZE.  When  ERASE  is  asserted,  the  tape  drive 
activates  the  erase  head  and  moves  the  tape  forward  until  EOT 
(End  Of  Tape)  is  asserted  by  the  tape  drive.  EOTis  asserted  when 
the  End  of  Tape  hole  is  detected  by  the  tape  drive.  When  EOT  is 
asserted,  ERASE  and  INITIALIZE  are  negated  and  the  Rewind 
drive  code  is  output  until  BOT  is  asserted.  When  BOTis  asserted, 
the  Rewind  drive  code  is  negated  and  READY  is  asserted. 

Initialize  Command 

The  Initialize  command  code  directs  the  PROSE  device  to  assert 
INITIALIZE  and  to  output  the  Rewind  drive  code.  When  BOT  is 
asserted,  the  Rewind  drive  code  is  negated  and  the  Fast  Forward 
drive  code  is  output.  When  EOT  is  asserted,  the  Fast  Forward 
drive  code  is  negated  and  the  Rewind  drive  code  is  output. 
Finally,  when  BOT  is  asserted,  the  Rewind  drive  code  and 
INITIALIZE  are  negated  and  READY  is  asserted. 

Write  File  Mark  Command 

The  Write  File  Mark  command  directs  the  PROSE  device  to 
output  the  Write  File  Mark  drive  code.  When  FM  is  asserted,  the 
Write  File  Mark  drive  code  is  negated  and  READY  is  asserted. 
FM  is  asserted  after  the  file  mark  has  been  written. 


Figure  3.  Peripheral  Controller  System  Including  Tape  Drive  Sequencer. 
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Figure  4.  Command  Transfer  Protocol. 
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Read  Command 

Priorto  issuing  the  Read  command,  ONLINE  must  be  asserted  by 
the  peripheral  controller.  If  ONLINE  is  not  asserted  when  the 
Read  command  code  is  decoded,  the  Invalid  Command  drive 
code  will  be  output  and  the  PROSE  device  will  wait  for  the  next 
command.  When  the  Read  command  is  issued  and  ONLINE  is 
asserted,  the  PROSE  device  outputs  the  Read  drive  code.  Once 
the  Read  drive  code  has  been  output,  the  Read  command  may  be 
terminated  by  either  the  tape  drive  or  the  peripheral  controller. 
The  tape  drive  terminates  the  Read  command  by  asserting  FM. 
FM  is  asserted  when  a  file  mark  is  detected.  When  this  occurs, 
the  Read  command  code  is  negated  and  READY  is  asserted.  The 
peripheral  controller  may  terminate  the  Read  command  by  either 
negating  ONLINE  or  by  issuing  a  Read  File  Mark  command. 
When  ONLINE  is  negated,  the  Read  drive  code  is  negated  and 
the  Rewind  drive  code  is  output  until  BOT  is  detected,  at  which 
time  READY  is  asserted.  When  the  Read  File  Mark  command  is 
issued,  the  Read  drive  code  is  negated  and  the  Read  File  Mark 
command  is  executed. 

Read  File  Mark  Command 

The  Read  File  Mark  command  directs  the  PROSE  device  to 
output  the  Read  File  Mark  drive  code.  The  Read  File  Mark  code 
causes  the  tape  drive  to  move  to  the  next  file  mark.  When  FM  is 
asserted  by  the  tape  drive,  the  Read  File  Mark  drive  code  is 
negated  and  READY  is  asserted.  FM  is  asserted  when  afile  mark 
is  detected. 

Read  Status  Command 

The  Read  Status  command  directs  the  PROSE  device  to  transfer 
six  bytes  of  status  data  to  the  peripheral  controller.  After  the 
command  is  decoded,  the  PROSE  enables  the  first  byte  by 
outputting  the  Status  Byte  1  drive  code  and  asserts  READY.  The 
peripheral  controller  responds  by  reading  the  status  byte  and 
asserting  REQUEST.  When  the  PROSE  device  detects  RE- 
QUEST asserted,  READY  is  negated  and  the  next  status  byte  is 
enabled.  READY  is  not  asserted  until  after  the  peripheral 
controller  negates  REQUEST.  The  remaining  five  status  bytes 
are  transferred  in  a  similar  manner.  After  all  six  bytes  are 
transferred,  READY  is  asserted. 

Write  Command 

We  will  look  at  the  Write  command  in  detail  as  an  example  of  the 
design  process.  The  Write  command  allows  the  peripheral 
controller  to  write  data  onto  the  tape.  Data  is  transferred  via  the 
same  eight-bit  bus  that  provides  the  command  code. 

Prior  to  issuing  the  Write  command,  Online  must  be  asserted  by 
the  peripheral  controller.  If  Online  is  not  asserted  when  the  Write 
command  code  is  decoded,  the  sequencer  will  assert  the  Invalid 
Command  signal  and  wait  for  the  next  command. 

If  Online  is  asserted  when  the  Write  command  is  decoded,  the 
sequencer  will  assert  the  Write  signal  for  the  tape  drive.  Data  is 
sent  from  the  peripheral  controller  to  the  tape  drive  on  the  eight- 
bit  bus,  and  the  data  is  written  onto  the  tape. 


The  Write  command  may  be  terminated  by  either  the  tape  drive 
or  the  peripheral  controller.  The  tape  drive  may  cause  the  Write 
command  to  be  terminated  iftheendof  thetape  is  reached.  When 
the  early  warning  hole  is  detected  by  the  tape  drive,  it  asserts 
EWH.  When  this  happens,  the  sequencer  must  negate  the  Write 
signal  and  then  assert  the  End  of  Media  signal.  It  then  waits  to 
decode  the  next  command. 

The  peripheral  controller  may  terminate  the  Write  command 
when  all  of  the  data  has  been  written.  This  may  be  noted  by 
negating  the  Online  signal,  or  by  asserting  the  Request  signal. 
When  Online  is  negated,  the  sequencer  must  negate  the  Write 
signal.  It  then  outputs  the  Write  File  Mark  signal  until  the  file  mark 
is  activated  by  the  tape  drive.  Once  the  file  mark  is  activated,  the 
sequencer  must  rewind  the  tape. 

When  the  Request  signal  is  asserted  by  the  peripheral  controller, 
the  sequencer  negates  the  Write  signal.  It  then  decodes  the  new 
command,  which  usually  would  be  a  Write  File  Mark  command. 

State  Diagram  Representation 

All  of  this  complex  interaction  can  be  described  in  a  simple  state 
diagram.  State  diagrams  are  the  universal  language  for  describ- 
ing state  machines.  The  diagrams  depict  the  two  major  opera- 
tions in  a  state  machine:  control  sequencing  and  output  genera- 
tion. Control  sequencing  is  changing  from  present  state  to  next 
state.  It  can  be  direct,  based  only  upon  present  state,  or 
conditional,  based  upon  present  state  and  input  conditions. 
Similarly,  output  generation  can  be  direct,  based  upon  present 
state  only,  or  conditional,  based  upon  present  state  and  input 
conditions.  The  former  is  called  a  Moore  state  machine,  while  the 
latter  is  a  Mealy  state  machine. 

Transitions  for  our  tape  drive  sequencer  depend  on  the  state  and 
conditional  inputs,  as  with  the  Request  and  Online  inputs  during 
execution  of  the  Write  command.  Outputs,  however,  will  depend 
only  on  the  present  state;  thus,  our  sequencer  will  be  a  Moore 
state  machine. 

The  state  diagram  for  a  Moore  machine  is  simple  to  understand. 
Each  state  is  represented  by  a  bubble,  and  each  transition  is 
represented  by  an  arrow.  Conditions  required  for  transitions,  if 
any,  are  listed  next  to  the  arrows.  Outputs  associated  with  each 
state  are  listed  with  the  state  name  in  the  bubble. 

Write  Command  State  Diagram 

The  state  diagram  for  the  Write  command  is  shown  below. 
Thirteen  states  are  shown.  The  diagram  is  entered  from  the  top, 
from  the  part  of  the  state  diagram  that  performs  command 
decoding.  WRCMD  is  the  state  entered  when  the  Write  command 
is  decoded.  In  this  state  the  sequencer  asserts  the  Ready  signal, 
telling  the  peripheral  controllerthat  it  has  decoded  the  signal.  The 
sequencer  then  waits  for  the  peripheral  controller  to  remove  the 
Request  signal  before  continuing  to  state  WRCMDA,  or  A  for 
short. 

In  state  A  the  sequencer  immediately  checks  to  make  sure  that 
Online  is  asserted  before  beginning  the  Write  sequence.  If  it  is 
not,  the  Invalid  Command  signal  is  output  and  the  sequencer 
enters  another  part  of  the  state  diagram  (CMDERRA). 
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It  Online  is  asserted,  tne  sequencer  moves  10  siaie  d,  asserimy 
the  Write  signal  to  the  tape  drive  to  allow  writing  to  begin.  The 
sequencer  moves  from  states  B,  E,  and  F  and  back  to  B,  until  the 
Write  process  is  interrupted.  The  tape  drive  responds  by  moving 
forward  and  writing  the  information  provided  on  the  eight-bit 
data  bus. 


(COMMAND  DECODING) 


(DRVSELD) 


(BOTCMDA) 


416  05 

Figure  5.  Write  Command  State  Diagram. 
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tape  drive  or  the  peripheral  controller.  State  B  checks  for 
assertion  of  the  EWH  (Early  Warning  Hole)  signal  from  the  tape 
drive.  If  asserted,  the  sequencer  immediately  negates  the  Write 
signal  in  state  C,  and  then  asserts  the  EOM  (End  of  Media)  signal 
to  the  peripheral  drive  controller  in  state  D.  The  sequencer  then 
goes  to  the  Drive  Selected  state  (DRVSELD)  and  begins  the 
command  decoding  procedure  in  another  part  of  the  state 
diagram. 

State  E  checks  for  assertion  of  the  Request  signal,  signalling  the 
end  of  the  data  to  be  written.  In  this  case,  the  sequencer  would 
immediately  negate  the  Write  signal  and  go  to  the  Drive  Selected 
state. 

State  F  checks  for  removal  of  the  Online  signal.  If  Online  is  true, 
the  sequencer  moves  back  to  state  Band  continues  writing.  If  not, 
it  will  move  to  state  G,  removing  the  Write  signal,  and  then  state 
H,  outpufting  the  Write  File  Mark  signal.  It  then  waits  in  state  H 
until  the  File  Mark  signal  is  activated  by  the  tape  drive,  indicating 
that  the  file  mark  has  been  written.  When  File  Mark  is  activated, 
the  Write  File  Mark  signal  is  removed  in  state  I,  and  the  sequencer 
moves  to  the  Beginning  of  Tape  command  sequence  to  rewind 
the  tape. 

Device  Selection 

So  far,  the  design  has  been  described  and  defined  without  regard 
to  the  implementation  methodology.  The  complete  sequencer 
can  be  implemented  in  several  medium-size  discrete  logic  de- 
vices, or  one  large  custom  circuit. 

The  best  choice  is  often  a  programmable  logic  device.  Program- 
mable logic  provides  the  integration  of  custom  logic  with  the  quick 
time-to-market  of  standard  products.  Especially  important  in  this 
application  is  the  ability  to  adapt  to  design  changes.  TheQIC-02 
specification  can  be  modified  to  enhance  performance  or  add 
features.  Of  course,  any  design  errors  can  also  be  quickly  fixed, 
even  after  layout  of  the  PC  board. 

Programmable  logic  devices  offer  the  complexity  to  implement 
large  designs  in  few  devices.  At  the  same  time,  software  support 
tools  allow  very  simple  design  entry,  even  of  complex  designs. 
This  design  will  require  almost  no  additional  work  to  describe  for 
PLD  software.  Device  programmers  provide  an  instant  custom 
device,  and  can  even  test  the  device  immediately  after 
programming. 

Selection  of  PROSE  Device 

The  PROSE  device  is  selected  as  the  heart  of  this  application. 
The  primary  reason  is  the  number  of  states  required  for  implem- 
entation of  the  sequencer.  We  saw  that  the  Write  command  alone 
had  thirteen  states,  although  some  were  transitions  to  other  parts 
of  the  state  diagram.  There  are  eight  other  commands  required, 
totaling  over  100  states.  Only  the  PROSE  device  provides 
enough  states  in  a  single  programmable  chip. 

Another  requirement  in  many  applications  is  high  speed.  The 
PROSE  device  offers  a  maximum  frequency  of  30  MHz  internally. 
At  the  same  time,  the  PROSE  device  has  enough  logic  on  board 
to  require  very  few  surrounding  logic  chips. 


i  ne  crtuat  device  has  the  architecture~shown  in  Figure  6.  It  is 
based  on  a  128x21  PROM  array,  providing  128  states  of  21  bits 
each.  The  128  states  will  include  the  states  WRCMD  and  its 
following  states  from  A  through  I.  Eight  of  the  twenty-one  bits  in 
each  state  are  the  outputs  of  the  device,  providing  signals  such 
as  the  Write  and  Ready  signals. 

The  other  thirteen  bits  in  each  state  feed  back  internally  for  next 
state  selection.  Six  of  the  feedback  signals  feed  back  into  a  PAL 
array,  which  is  also  fed  by  eight  external  inputs.  These  external 
inputs  are  the  condition  inputs,  such  as  the  Request  and  Online 
signals,  which  determine  the  next  state.  The  feedback  signals 
determine  which  of  these  eight  inputs  will  determine  the  next 
branch. 

The  outputs  of  the  PAL  array  are  two  address  lines  to  the  PROM 
array.  Before  reaching  the  PROM  array,  however,  these  signals 
go  through  two  Exclusive-OR  (XOR)  gates  controlled  by  feed- 
back signals.  These  gates  are  used  by  the  software  to  optimize 
the  fit  of  the  design  within  the  device,  and  will  be  described  later. 
The  other  five  address  lines  to  the  PROM  array  are  fed  back 
directly  from  the  previous  state.  Thus,  two  of  the  lines  can  be 
changed  by  input  conditions,  providing  four-way  branching  capa- 
bility from  any  state. 

Expanding  Inputs  for  the  PROSE 
Device 

The  PROSE  device  has  a  balanced  architecture  of  eight  inputs 
and  eight  outputs.  However,  we  already  know  that  we  need  more 
than  eight  inputs  in  the  design.  The  eight  bits  of  the  command 
code  from  the  peripheral  controller  need  to  be  decoded,  along 


Figure  6.  PMS14R21  Architecture 


ana  hue  Mark  signals  already  mentioned.  In  addition  to  these 
signals,  the  sequencer  must  monitorthe  End  of  Tape  and  Begin- 
ning of  Tape  signals  from  the  tape  drive,  making  atotal  of  fourteen 
inputs. 

A  simple  solution  is  encoding  the  eight  different  eight-bit  com- 
mands into  a  three-bit  command  code  before  reaching  the 
PROSE  device.  This  can  be  easily  done  in  almost  any  20-pin  PAL 
device.  With  a  PALdevice  as  the  solution,  we  can  easily  add  even 
more  logic  surrounding  the  sequencer,  including  drive  selection. 
Drive  selection  logic  requires  that  more  commands  be  added  to 
the  original  eight,  forcing  the  use  of  four  bits  for  the  encoded 
commands.  Three  of  the  independent  status  signals  from  the 
tape  drive  can  also  be  combined  in  the  PALdevice  (Beginning  of 
Tape,  End  of  Tape,  and  File  Mark),  providing  one  Status  signal. 
(Early  Warning  Hole  remains  a  separate  input.) 

The  resulting  logic  block  fits  easily  into  a  simple  PAL20L8  combi- 
natorial 24-pin  PAL  device  (Figure  7).  Several  versions  of  this 
device  are  available  to  choose  from,  including  a  CMOS  device 
offering  zero  standby  power,  and  high-speed  devices  at  15 
nanoseconds  or  less  propagation  delay. 

The  encoding  of  the  command  codes,  including  drive  selection, 
is  shown  in  Figure  8.  Now  we  must  write  the  equations  for 
PALASM  2  software  for  implementation  in  a  PAL20L8.  Writing  the 
equationsfor  each  output  is  a  simple  matter;  active-low  equations 
are  written  where  each  product  term  is  one  of  the  commands  in 
which  that  particular  command  code  bit  is  zero.  For  example,  the 
equation  for  CC3  is  simply  the  sum  of  the  products  defining  the 
Write,  Read,  Read  File  Mark,  and  Write  File  Mark  commands. 
The  equation  would  be: 


/CC3      =     D7VD6*   D5*  D4*   D3*   D2*  Dl*   DO  (Write) 
+   /D7*   D6*   D5*   D4*   D3*   D2*   Dl*   DO  (Read) 
+   /D7*    D6*/D5*    D4*    D3*    D2*    Dl*    DO  (Read 
File  Mark) 

+  D7VD6VD5*  D4*  D3*  D2*  Dl*  DO  (Write 
File  Mark) 

where  /  means  inversion,  *  means  AND,  and  +  means  OR.  The 
equations  can  be  minimized  either  by  observation  or  by  running 
them  through  the  program  MINIMIZE  in  the  PALASM  2  software 
suite.  The  full  PALASM  2  file  can  be  simulated  for  design 
verification,  and  then  processed  to  create  a  JEDEC  file  for 
•  sdo  programming. 


CC0-CC3 


STATUS 


Figure  7.  PAL  Encoding  Logic  Block. 
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Expanding  Outputs  for  the  PROSE 
Device 

The  section  of  the  design  that  we  looked  at  in  detail  required  five 
output  signals:  Ready,  Invalid  Command,  Write,  End  of  Media, 
and  Write  File  Mark.  Other  required  outputs  include  Read,  Read 
File  Mark,  Fast  Forward,  Rewind,  Erase,  Initialize,  and  Drive 
Selected.  These  twelve  outputs  already  extend  us  beyond  the 
eight  outputs  available  in  the  PROSE  device.  In  addition,  six 
Enable  Status  bits  are  required,  making  a  total  of  eighteen 
outputs  required. 

A  simple  solution  can  again  be  provided  by  a  programmable  logic 
device,  to  decode  outputs  from  the  PROSE  device  into  the 
required  output  signals.  The  requirement  for  many  outputs  is  met 
by  two  PAL  devices,  the  PAL6L16  and  PAL8L14.  However,  even 
faster  solutions  are  provided  by  two  PLE  (PROM  as  Logic 
Element)  devices,  the  PLE5P16  and  PLE6P16.  These  devices 
are  specialized  PROMs,  both  with  sixteen  outputs.  The  PLE5P1 6 
offers  the  highest  speed  at  1 8  ns  propagation  delay,  and  provides 
an  output  enable,  which  can  be  useful  when  testing  the  tape  drive 
independent  of  the  sequencer  logic. 


In  the  PLE5P1 6  we  want  to  decode  four  outputs  from  the  PROSE 
device  into  fourteen  signals.  The  four  signals  to  be  left  as  direct 
outputs  from  the  PROSE  device  can  be  the  signals  used  more 
often,  or  those  that  require  higher  speed.  In  this  case,  Ready, 
Drive  Selected,  Erase,  and  Initialize  are  left  direct,  while  the  other 
signals  are  decoded  through  the  PLE  device.  The  equations  for 
the  decoding  are  arbitrary,  as  long  as  every  signal  in  asserted  by 
its  own  code.  (None  of  these  signals  are  to  be  asserted  at  the 
same  time.)  The  decoding  is  shown  below. 

The  generation  of  the  equations  is  similar  to  that  shown  previ- 
ously. For  example, 

STSO    -    /CD3VCD2VCD1*  CDO 

is  the  decoding  for  asserting  the  Enable  Status  byte  0  signal. 

However,  this  time  PLEASM  software  is  used  for  design  entry. 
The  design  method  is  equivalent  to  that  for  PAL  devices  in 
PALASM  software,  butthe  software  creates  a  programming  file  in 
the  PROM  format. 


COMMAND 

QIC-02  COMMAND 

PAL  COMMAND  CODE 

D7 

D6 

D5 

D4       D3  D2 

D1 

DO 

CC3 

CC2 

CC1 

CC0 

Invalid  Command" 

X 

X 

X 

XXX 

X 

X 

1 

1 

Select  Drive  1* 

1 

1 

0 

1 

0 

Select  Drive  2* 

1 

0 

1 

0 

Select  Drive  3* 

111110  11 

1 

0 

Select  Drive  4* 

11110  111 

1 

0 

Read  Status 

0          0  111111 

0 

1 

1 

Beginning  of  Tape 

1        ■)        0        1        1        1  10 

0 

1 

0 

Erase  Entire  Tape 

110          1110  1 

0 

0 

1 

Initialize  Tape 

110          110  11 

0 

0 

0 

Write 

10  111111 

0 

0 

1 

1 

Read 

0 

1 

0 

0 

1 

0 

Read  File  Mark 

0 

0 

0 

0 

0 

1 

Write  File  Mark 

1 

0 

0 

0 

0 

0 

0 

*  If  the  drive  is  the  incorrect  drive  number,  the  four  bit  command  code  is  1101 
*'  Any  command  code  not  shown  above  will  result  in  an  illegal  command  code. 


Figure  8.  PAL  Command  Codes  Table. 
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CD3 

CD2    CD1  CDO 

SIGNAL  ASSERTED 

0 

0        0  0 

No  Command 

0 

0        0  1 

STS0  (Enable  Status  byte  0) 

0 

0         1  0 

STS1  (Enable  Status  byte  1) 

0 

0         1  1 

STS2  (Enable  Status  byte  2) 

0 

1         0  0 

STS3  (Enable  Status  byte  3) 

0 

1         0  1 

STS4  (Enable  Status  byte  4) 

0 

1         1  0 

STS5  (Enable  Status  byte  5) 

0 

1      1  1 

EOM  (End  of  Media) 

1 

0        0  0 

RFM  (Read  File  Mark) 

1 

0        0  1 

WFM  (Write  File  Mark) 

1 

0         1  0 

READ  (Read  Enable) 

1 

0         1  1 

WRITE  (Write  Enable) 

1        0        0       FFWD  (Fast  Forward) 

1         0  1 

RWD  (Rewind) 

1         1  0 

ICMD  (Invalid  Command) 

1         1  1 

No  Command 

Figure  9.  PLE  Device  Decoding 

PROSE  Design  Specification 

We  are  now  ready  to  discuss  the  details  of  the  design  specifica- 
tion for  the  PROSE  device.  The  software  used  is  a  part  of  the 
PALASM  2  software  suite  called  PROASM  for  PROSE  Assem- 
bler. It  requires  almost  no  additional  work  on  the  part  of  the 
designer;  it  simply  requires  a  description  of  the  state  diagram. 
PROASM  has  the  following  file  format: 

•  Declaration  Section 

•  State  Section 

•  Conditions  Section 

Declaration  Section 

The  Declaration  Section  is  the  location  of  the  general  information 
about  the  design,  including  the  design  engineer's  name  and 
company.  It  provides  for  device  selection;  in  this  case,  we  are 
using  the  PROSE  device,  which  has  the  part  number  PMS1 4R21 . 
It  also  allows  the  definition  of  all  of  the  inputs  and  output  signal 
names.  Remember,  four  of  the  inputs  are  PAL  device  outputs 
(CC0-CC3),  and  four  of  the  outputs  feed  inputs  to  the  PLE  device 
(CD0-CD3). 


QIC-02  COMMAND 
1 
B 

WON 
MMI 

JANUARY  19,  1987 
QJC_02_CMD_DEC  PMS14R21 

12  3  4 
CLK  DCLK  /ONLINE  /REQUEST 

5        6        7  8        9  10 

/CC0  /CC1  /CC2  /CC3  STATUS  EWH 
11  12 
SDI  GND 

13  14     15  16     17  18 
PRESET  SDO  CD 3  CD2  CD1  CDO 

19  20         21  22 

INITIALIZE  ERASE  DRSELD  READY 
23  24 
MODE  VCC 


TITLE 
PATTERN 
REVISION 
AUTHOR  KEN 
COMPANY 
DATE 

CHIP 

; INPUT  PIN 

; INPUT  PIN 
; INPUT  PIN 

; OUTPUT  PIN 

; OUTPUT  PIN 
; OUTPUT  PIN 

State  Section 


The  next  part  of  the  design  file  is  the  State  section.  This  section 
begins  with  the  keyword  STATE.  The  first  item  required  is 
selection  of  either  a  Moore  or  a  Mealy  type  state  machine.  As 
discussed  previously,  a  Moore  type  state  machine  is  required. 

STATE 

MOORE_MACHINE 

Next,  several  general  or  default  values  are  specified.  First  is 
selection  of  the  programmable  Preset/Enable  function  pin.  Either 
can  be  used:  in  this  case,  Preset  is  desired,  so  the  keyword 
MASTER_RESET  is  entered. 

MASTER_RESET 

Next  are  global  default  output  values.  Global  default  output 
values  are  the  values  to  be  assigned  to  the  outputs  when  the 
design  specification  does  not  specify  a  particular  value.  This  is 
not  used,  so  no  default  output  function  is  entered.  However,  the 
design  file  can  be  simplified  by  specifying  a  default  value  for  the 
outputs  (either  High  or  Low),  and  then  only  specifying  the  output 
value  when  it  differs  from  the  default  value. 

Next  are  global  default  transitions  or  branches.  Global  default 
branches  are  the  branches  to  be  taken  when  no  branch  is 
specified  in  the  design  file.  This  is  a  method  of  not  only  simplifying 
the  design  file,  but  also  guaranteeing  that  there  is  atransition  from 
every  state  for  every  input  condition.  In  this  design,  the  default  is 
chosen  to  be  HOLD_STATE;  this  means  that  the  sequencer  will 
remain  in  the  same  state  if  no  other  transition  is  enabled. 

DEFAULT_BRANCH  HOLD_STATE 

Next  is  the  specification  of  the  state  to  enter  after  power-up.  The 
PROSE  device  has  power-up  preset,  whereby  all  twenty-one  flip- 
flops  power  up  into  the  logic  One  state.  This  state  is  called 
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POWERJJP.  The  transition  from  this  state  (on  the  first  clock 
edge)  shows  how  transition  equations  are  written.  The  format  is 

STATE_X   :-  CONDITIONJV  ->  STATE_Y 

meaning  that  when  in  State  X,  if  condition  A  is  true  the  machine 
will  transition  to  state  Y.  In  this  design,  we  want  to  transition  to 
state  INIT  on  any  condition,  so  the  equation  is  written 

POWER_UP  :=  VCC  ->  INIT 

where  VCC  means  unconditionally. 

The  first  transition  in  the  Write  command  state  diagram  requires 
two  branches.  The  local  default  branch  is  indicated  by  the  +  sign, 
which  is  the  "otherwise"  operator.  Thus,  from  state  WRCMD, 
we  have 

WRCMD   :-  REQ  ->  WRCMD 
+->  WRCMDA 

which  means  "if  REQ  (Request)  is  true,  go  to  state  WRCMD  (stay 
in  the  same  state),  otherwise  (if  REQ  is  not  true)  go  to  state 
WRCMDA." 

The  remaining  transition  equations  are  written  similarly.  In  effect, 
the  arrows  in  the  state  diagram  are  directly  transcribed  into 
arrows  in  the  equations. 

WRCMDA  :=  ONLN  ->  WRCMDB 
+->  CMDERRA 

WRCMDB  :=  EW       ->  WRCMDC 
+->  WRCMDE 

WRCMDC  :-  VCC     ->  WRCMDD 

WRCMDD   :-  VCC     ->  DRVSELD 

WRCMDE   :=  REQ     ->  DRVSELD 
+->  WRCMDF 

WRCMDF   :=  ONLN  ->  WRCMDB 
+->  WRCMDG 

WRCMDG  :-  VCC     ->  WRCMDH 

WRCMDH   :=  STS     ->  WRCMDI 
+->  WRCMDH 

WRCMDI   :=  VCC     ->  BOTCMDA 

Note  that  the  equation  for  WRCMDH  does  not  need  the  second 
line;  the  global  default  transition  is  to  stay  in  the  same  state,  so  it 
does  not  need  to  be  specified  here. 

Output  Definition 

The  next  task  is  to  define  the  eight  outputs  for  each  of  these 
states.  Outputs  are  defined  in  the  form  of  an  equation.  For 
example, 

STATE  X.OUTF  :-  OUT  1  *  /OUT  2 


Output  equations  for  the  Write  command  section  are  written 
according  to  the  state  diagram.  In  all  of  the  Write  states,  Drive 
Selected  (DRVSELD)  remains  active.  Thus,  in  state  WRCMD 
only  Drive  Selected  and  the  Ready  signal  are  active,  telling  the 
peripheral  controller  that  the  Write  command  has  been  decoded 
and  is  ready  to  be  executed.  The  resulting  output  equation  for 
state  WRCMD  is 

WRCMD. OUTF  :=      READYVCD3  VCD2  VCD1  VCDO 
*/ERASE*/INITIALIZE*  DRSELD 

The  other  states  have  similar  output  equations,  translating  the 
required  output  signal  to  the  appropriate  encoding  for  the  PLE 
device  to  decode. 

WRCMDA. OUTF   :-  /READYVCD3  VCD2  VCD1  VCDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDB. OUTF   :-  /READY*  CD3*/CD2*  CD1*  CDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDC. OUTF   :-  /READYVCD3VCD2VCD1 VCDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDD. OUTF   :=  /READY* /CD3*  CD2*  CD1*  CDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDE. OUTF         /READY*  CD3VCD2*  CD1*  CDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDF . OUTF  :-  /READY*  CD3VCD2*  CD1*  CDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDG  -  OUTF   :=  /READYVCD3VCD2VCD1  VCDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDH. OUTF   :=  /READY*  CD3VCD2 VCD1*  CDO 
VERASEVINITIALIZE*  DRSELD 

WRCMDI. OUTF  :=  /READYVCD3VCD2VCD1  VCDO 
VERASEVINITIALIZE*  DRSELD 

Again,  if  default  output  values  were  used,  these  equations  could 
be  extremely  simplified. 

Conditions  Section 

The  final  section  is  the  Conditions  section,  designated  by  the 
keyword  Conditions.  This  section  defines  the  conditions  used  in 
the  state  equations.  In  this  design,  the  conditions  are  simply  the 
inputs  to  the  PROSE  device,  and  are  straightforward: 

CONDITIONS 

ONLN  =  ONLINE 
STS  =  STATUS 
EW  -  EWH 
REQ  =  REQUEST 

Other  conditions  used  in  the  design  file  are  also  placed  in  this 
Conditions  section. 


means  that  in  state  X,  output  1  will  be  High  and  output  2  will  be 
Low.  If  other  outputs  exist,  they  will  be  assigned  their  default 

values. 


25  26 


The  other 
manner, 
ware.  A 
verification, 
PROM  and 
to  the 
logic  withi 


sections  of  the  design  file  are  completed  in  a  similar 
the  whole  design  is  assembled  by  PROASM  soft- 
feature  of  PROASM  software,  in  addition  to  design 
is  automatic  state  assignment.  State  locations  in  the 
conditional  product  terms  in  the  PAL  array  are  used 
possible  efficiency,  to  provide  the  best  fit  of  the 
the  device. 


and 
key 


greatest 


The  automatic  fitting  is  easily  seen  in  Figure  10.  Each  individual 
state  can  branch  to  one  of  four  locations:  these  four  locations  form 
a  "primary  location."  For  example,  the  hex  addresses  1  A,  3A,  5A, 
and  7Aform  a  primary  location,  which  can  be  designated  by  the 
smallest  address,  1  A. 


The  software  optimizes  use  of  these  primary  locations  to  elimi- 
nate excessive  duplication  of  states  in  different  primary  locations. 
For  example,  the  state  WRCMD  is  duplicated  in  primary  location 


1A 

RFMCMD 

3A"  \ 

/  wrcmd\ 

5A 

WFMCMD 

7A 

yRDCMry 

19 

RDCMDA 

39 

RDCMD 

59 

79 

A 

WRCMD 

5 

CM D ERR A 

45 

65 

:  • 

\ 

23 

:0 

43 

CMDERRB 

63 

CMDERRC 

4 

24  

:  f 

DRVSELD 

44  , 

64 

:  g 

RDSTSA 

22^. 
7  D  j 

42 

62 

416  10 

Figure  10.  State  Assignments  for  the  Six  Primary  PROM 
Locations  Used  for  the  Write  Command  Sequence. 
Arrow  Indicate  Potential  State  Transitions,  to  a  Set  of 
Circled  Next  States.  All  Next  States  Must  be  In  the  Same 
Primary  Location. 


WHCMU  itself  or  WRCMDA  (or  A  in  the  shortened  form).  The 
branch  cannot  be  made  to  the  primary  location  where  WRCMD 
already  exists,  because  the  other  three  locations  have  already 
been  used.  Thus,  WRCMD  must  be  duplicated  in  anotherprimary 
location  which  has  an  open  location  for  state  A. 

The  software  then  puts  the  two-way  branch  from  state  A  in  primary 
location  5.  It  then  makes  use  of  the  other  half  of  the  primary 
location  for  the  two-way  branch  from  state  B.  However,  when  it 
then  assigns  the  two-way  branch  from  state  Fto  states  B  or  G,  it 
must  duplicate  B  in  primary  location  4  since  there  is  no  room  left 
in  its  primary  location  (5)  for  G. 

Note  that  the  software  avoids  duplication  of  state  DRVSELD, 
however.  The  two-way  branch  from  state  E  to  states  DRVSELD 
or  F  is  placed  in  the  other  half  of  primary  location  4.  When 
DRVSELD  is  branched  to  again  from  state  D,  the  same  location 
can  be  used. 

This  duplication  avoidance  is  where  the  XOR  gates  are  used.  To 
place  a  branch  in  the  lower  order  half  of  a  primary  location,  the 
branch  addresses  must  be  00  or  01.  This  is  accomplished  by 
making  the  XOR  feedback  00  and  allowing  a  PAL  product  term  to 
determine  whetherthe  second  address  bit  should  be  a  1 .  To  use 
the  higher  order  half  of  a  primary  location  (branch  addresses  10 
or  11),  the  XOR  feedback  would  be  made  10  and  again  a  PAL 
product  term  would  be  used  to  determine  the  other  bit.  XOR 
feedback  is  also  used  to  determine  "right  or  left  side"  of  a  primary 
location,  or  to  use  a  product  term  that  already  exists  on  one  side 
of  the  PAL  array. 

The  PROASM  software  provides  detailed  implementation  re- 
sults, showing  how  the  states  and  product  terms  were  assigned 
for  every  state  and  transition. 

PROSIM  Simulation 

The  entire  sequencer  can  be  simulated  by  PROSIM  software,  to 
verify  the  design.  This  can  be  done  before  any  parts  are 
programmed.  Simulation  results  can  be  added  to  the  program- 
ming file  to  provide  automatic  testing  after  programming. 

Completed  Design 

The  complete  design  required  less  than  a  weekto  implement  from 
scratch.  Two  days  were  required  to  define  the  application.  Only 
one  day  was  required  to  write  the  PROSE  design  specification. 
Simulation  and  design  checking  took  an  additional  day.  Only  half 
a  day  each  were  required  for  the  PAL  and  PLE  device  designs. 
This  included  revisions  to  the  PAL  and  PROSE  devices  to  fix 
minor  design  errors.  The  complete  design  is  shown  in  Figure  1 1 , 
and  the  design  files  are  shown  on  the  following  pages. 
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Conclusion 

Programmable  logic  devices  were  once  the  glue  logic  that  tied 
together  the  intelligence  of  a  digital  system.  Today,  advanced 
chips  such  as  the  PROSE  device  are  capable  of  implementing 
distributed  control  functions  and  make  up  the  heart  of  the  control 
of  the  system.  With  complementary  advanced  design  tools 
available  with  PALASM  2  software,  PLDs  are  an  effective  ap- 
proach to  even  the  most  complex  tasks. 


The  PROSE  device  has  been  designed  to  implement  a  complex 
tape  drive  sequencer.  Design  implementation  in  the  PROSE 
device  is  made  easy  for  several  reasons,  including: 

1 .  Design  file  requires  merely  transcription  of  the  state  diagram 

2.  Software  automatically  optimizes  fit 

3.  Programmability  of  the  device  allows  design  errors  to  be  fixed 
quickly 

As  a  result,  the  complete  tape  drive  sequencer  required  only  four 
days  to  design  and  implement. 
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•  DRVSELD 

■  ERASE 

■  INITIALIZE 

■  STS0 

■  STS1 

•  STS2 

•  ST  S3 

•  STS4 

•  STS5 

•  RFM 

■  WFM 

•  READ 

•  WRITE 

■  FFWD 

•  RWD 

■  INVD  CMD 

•  EOM 


ROT  EWH 


Figure  11.  QIC-02  Command  Sequencer 
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THIS  PAL  DEVICE  IS  PART  OF  THE  QIC-02  COMMAND  SEQUENCER 
DESIGN.   THE  PRIMARY  PURPOSE  OF  THIS  PAL  DEVICE  IS  TO  ENCODE 
8  BIT  COMMANDS  INTO  4  BIT  COMMAND  CODES .      IT  IS  ALSO  USED  TO 
ENCODE  TAPE  DRIVE  STATUS  SIGNALS  AND  SELECT  THE  DRIVE  NUMBER. 


TITLE  QIC-02  COMMAND  DECODER  PAL 
PATTERN 

REVISION  B 

AUTHOR  KEN  WON 

COMPANY  MONOLITHIC  MEMORIES 

DATE  JANUARY   26,  1987 


CHIP     QIC_02_CMD_PAL  PAL20L8 

D7   D6  D5  D4   D3  D2  Dl  DO  DSO  DS1   FM  GND 

BOT  EOT  NC  /DSELD  /CC2A  /STATUS  CC3  CC2  CC1  CCO  NC  VCC 


EQUATIONS 

STATUS     =  /EOT*/BOT*/FM 

/CCO         ■  D7*D6*/D5*D4*D3*D2*D1*/D0 

+  D7*D6»/D5*D4*D3*/D2*D1*D0 

+  D7*/D6*/D5*D4*D3*D2«D1*D0 

+  /D7*D6*D5*D4*D3*D2*D1*D0 

+  D7*D6*D5*D4*DSELD 


; ASSERT  STATUS  TO  PMS14R21 

BOT  COMMAND 
INITIALIZE 
WRITE  FILE  MARK 
READ 

SELECT  DRIVE 


/CC1         =  D7*D6*/D5*D4*D3*D2*/D1*D0 

+  D7*D6*/D5*D4*D3*/D2*D1*D0 

+  D7*/D6*/D5*D4*D3*D2*D1*D0 

+  /D7*D6*/D5*D4*D3*D2*D1*D0 

+  D7*D6*D5*D4*/DSELD 


ERASE 
INITIALIZE 
WRITE  FILE  MARK 
READ  FILE  MARK 
UNSELECT  DRIVE 


/CC2 


/CC3 


D7*D6*/D5*D4 

D7*D6*/D5*D4 

D7*D6*/D5*D4* 

D7*/D6*D5*D4* 

D7*/D6*/D5*D4 

/D7*D6*/D5*D4 

/D7*D6*D5*D4* 

/D7*/D6*D5*D4 


D3*D2* 
D3*D2« 
D3*/D2 
D3*D2* 
«D3*D2 
*D3*D2 
D3*D2* 
*D3*D2 


D1*/D0 
/D1*D0 
D1*D0 
D1*D0 
*D1*D0 
•D1*D0 
D1*D0 
*D1*D0 


=  D7*/D6*D5*D4*D3*D2*D1*D0 
+  D7*/D6*/D5*D4*D3*D2*D1*D0 
+  /D7*D6*D5*D4*D3*D2*D1*D0 
+  /D7*D6*/D5*D4*D3*D2*D1*D0 


BOT  COMMAND 
ERASE 

INITIALIZE 
WRITE 

WRITE  FILE  MARK 
READ  FILE  MARK 
READ 

READ  STATUS 
(MINIMIZATION  REDUCES 
PRODUCT  TERMS) 
WRITE 

WRITE  FILE  MARK 
READ 

READ  FILE  MARK 


DSELD       =  /DO*/DS1*/DSO*D1*D2*D3  ;  DRIVE  1  SELECT 

+  /D1*/DS1*DS0*D0*D2»D3  ;  DRIVE  2  SELECT 

+  /D2*DS1*/DS0*D0*D1*D3  ;  DRIVE  3  SELECT 

+  /D3*DS1*DS0*D0*D1*D2  ;  DRIVE  4  SELECT 

SIMULATION 


;  THIS  SIMULATION  FILE  TESTS  THE  OUTPUT 
;    COMMAND  INPUT 

TRACEON  D7  D6  D5  D4  D3  D2  Dl  DO  FM  EOT 

SETF  D7  D6  /DS  D4  D3  D2  Dl  /DO 

SETF  D7  D6  /D5  D4  D3  D2  /Dl  DO 

SETF  D7  D6   /D5  D4  D3  /D2  Dl  DO 

SETF  D7   /D6  D5  D4  D3  D2  Dl  DO 

SETF  D7   /D6  /D5  D4  D3  D2   Dl  DO 

SETF  /D7   D6  D5  D4  D3  D2  Dl  DO 

SETF  /D7   D6  /D5  D4   D3  D2  Dl  DO 

SETF  /D7  /D6  D5  D4   D3  D2  Dl  DO 

SETF  /D7   D6  D5  D4  /D3  /D2  Dl  DO 

SETF  EOT 

SETF  /EOT 

SETF  BOT 

SETF  /BOT 

SETF  FM 

SETF  /FM 

SETF  DS1  /DSO 

SETF  D7  D6  D5  D4  D3  /D2  Dl  DO 
SETF  D7  D6  D5  D4  D3  D2  /Dl  DO 
TRACE  OFF 


FOR  ALL  VALID  AND  ONE  INVALID 


BOT  CCO  CC1  CC2  CC3  /STATUS  /DSELD 

; BEGINNING  OF  TAPE 

; ERASE 

; INITIALIZE 

;WRITE 

;WRITE  FILE  MARK 
;  READ 

; READ  FILE  MARK 

; READ  STATUS 

; INVALID  COMMAND 

; ASSERT  EOT 

; NEGATE  EOT 

; ASSERT  BOT 

; NEGATE  BOT 

; ASSERT  FM 

; NEGATE  FM 

;SET  DRIVE  NUMBER  TO  3 
; SELECT  DRIVE  3 
; UNSELECT  DRIVE  3 


CI  Monolithic  KKfl] 
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;THIS   IS  A  SECOND  REVISION  OF  AN  APPLICATION  FOR  THE  PROSE  DEVICE. 
; THE  ORIGINAL  APPLICATION  IS  CALLED  14R21.APP.     THIS  APPLICATION 
; IMPLEMENTS  A  QIC-02  COMMAND  DECODER  USING  A  PROSE  DEVICE,   A  PAL 
.•DEVICE,   AND  A  PLE  DEVICE.     THE  PROSE  DEVICE  FILE  IS  CALLED 
;CMD_PAL.APP  AND  THE  PLE  DEVICE  FILE  IS  CALLED  CMD  PLE.APP. 

TITLE  QIC-02  COMMAND  DECODER 
PATTERN 

REVISION  B 

AUTHOR  KEN  WON 

COMPANY  MMI 

DATE  JANUARY   19,  1987 

CHIP         QIC02_CMD_DEC  PMS14R21 

CLK  DCLK  /ONLINE  /REQUEST  /CCO  /CC1  /CC2  /CC3  STATUS  EWH  SDI  GND 
PRESET  SDO  CD3  CD2  CD1  CDO   INITIALIZE  ERASE  DRSELD  READY  MODE  VCC 

STATE 

MOOREMACHINE 
MASTERRESET 

DEFAULTBRANCH  HOLDSTATE 
POWER_UP  :=VCC->INIT 

;@@ia@@g  equations  @eee@§ 


.******  DRIVE  SELECTION  ****** 

INIT  :=  REQ  ->  DRVSELDA  ;  IF  /REQUEST  IS  TRUE,  CHECK 

;    FOR  DRIVE  SELECTION 


+  -> 

INIT 

OTHERWISE  GOTO  INIT 

DRVSELDA 

:=  CONDO 

-> 

DRVSELDB 

+  -> 

INIT 

DRVSELDB 

:=  CONDI 

-> 

INIT 

+  -> 

DRVSELDC 

DRVSELDC 

:=  COND2 

-> 

INIT 

+  -> 

DRVSELDD 

DRVSELDD 

:=  COND3 

-> 

INIT 

+-> 

DRVSELDE 

; DRIVE   IS  SELECTED 

DRVSELDE 

:=  REQ 

-> 

DRVSELDE 

;WAIT  FOR  REQ  DEACTIVATED 

+  -> 

DRVSELDF 

; THEN  GOTO  DRVSELDF 

DRVSELDF 

:=  VCC 

-> 

DRVSELD 

; DRIVE  SELECTION  COMPLETE 

INIT. OUTF :=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDA . OUTF : =  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDB. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDC. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDD. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDE. OUTF : =  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDF. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELD. OUTF :=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
.******  COMMAND  DECODING  ****** 


DRVSELD 

REQ 

-> 

CMDDECA 

IF  /REQUEST  IS  TRUE, 
DECODE  COMMAND 

COND3 

+  -> 

DRVSELD 

OTHERWISE  GOTO  DRVSELD 

CMDDECA 

«= 

-> 
+  -> 

RDWRCMD 
CMDDECB 

IF  CC3   IS  TRUE,  GOTO 
READ/WRITE  DECODING 

OTHERWISE  CONTINUE 
COMMAND  DECODING 

RDWRCMD 

COND2 

-> 
+  -> 

RDWRCMDA 
CMDERR 

IF  CC2   IS  TRUE,  GOTO 
READ/WRITE  DECODING 
OTHERWISE  GOTO  CMDERR 

RDWRCMDA 

j  = 

CONDI 

-> 
+  -> 

FMCMD 
RDWRCMDB 

IF  CC1   IS  TRUE,  COMMAND 
IS  A  FILE  MARK  COMMAND 
OTHERWISE  READ  OR  WRITE 

RDWRCMDB 

CONDO 

-> 
+  -> 

RDCMD 
WRCMD 

IF  CCO  TRUE,   COMMAND  READ 
OTHERWISE  COMMAND  IS  WRITE 

FMCMD 

CONDO 

-> 
+  -> 

WFMCMD 
RFMCMD 

IF  CCO   IS  TRUE,   COMMAND  IS 

WRITE  FILE  MARK 
OTHERWISE  COMMAND  IS  READ 

FILE  MARK 

CMDDECB 

:  ■ 

COND2 

-> 
+-> 

CMDDECC 
DESELDA 

IF  CC2   IS  TRUE,  CONTINUE 

COMMAND  DECODING 
OTHERWISE  CHECK  FOR 

DESELECTING  DRIVE 

CMDDECC 

CONDI 

-> 
+  -> 

CMDDECD 
CMDDECE 

CONTINUE  DECODING  COMMAND 
CONTINUE  DECODING  COMMAND 

CMDDECD 

CONDO 

-> 

+  -> 

INITCMD 
ERASECMD 

IF  CCO  TRUE,   COMMAND  INIT 
OTHERWISE  COMMAND  IS  ERASE 

CMDDECE 

CONDO 

-> 
+  -> 

BOTCMO 
RDSTSCMD 

IF  CCO  TRUE,   COMMAND  BOT 
OTHERWISE  COMMAND  IS  READ 
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CMDDECA.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECB.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECC.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECD.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
CMDDECE.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMD.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMDA . OUTF : =  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDWRCMDB . OUTF : =  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
FMCMD.OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


.******  DESELECT  DRIVE  ****** 
DESELDA  :=  CONDI 

DESELDB  :=  CONDO 

DESELDC  :=  CONDO 

REQ 
VCC 


DESELDD 

DESELDE 

DESELDA 

OUTF:= 

DESELDB 

OUTF:= 

DESELDC 

OUTF:= 

DESELDD 

OUTF : * 

DESELDE 

OUTF:= 

.****** 

INVALID 

CMDERR 

CMDERRA 

CMDERRB 

CHDERRC 

->  DESELDB  ;  CONTINUE  CHECKING  FOR 

;   DESELECT  COMMAND 
+->  DESELDC  OTHERWISE  GOTO  DESELDA 

->  CMDERR  ;IF  CCO   IS  TRUE,  COMMAND 

;    IS  INVALID 
+->  DESELDD  ; OTHERWISE  DESELECT  DRIVE 

->  DRVSELDE  ;IF  CCO   IS  TRUE,   GOTO  DRIVE 

;  SELECTION 

♦->  CMDERR  .-OTHERWISE  COMMAND  INVALID 

->  DESELDD  ;WAIT  FOR  REQ  DEACTIVATED 

+->  DESELDE  ; THEN  GOTO  DESELDE 

->   INIT  ;GOTO  DRIVE  SELECTION 


/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


****** 


REQ 


VCC 
VCC 
VCC 


CMDERR  ;WAIT  FOR  REQ  DEACTIVATED 

CMDERRA  ;THEN  OUTPUT  INVALID 

;   COMMAND  CODE 
CMDERRB  ; INVALID  COMMAND  CODE 

CMDERRC  ; NEGATE  CODE 

DRVSELD  ;GOTO  COMMAND  DECODING 


CMDERR. OUTF: 
CMDERRA. OUTF 
CMDERRB. OUTF 
CMDERRC. OUTF 


/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
■  READY*CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY*CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


.**«»**  BOT  COMMAND  ****** 
BOTCMD  :=  REQ 


BOTCMDA 
BOTCMP 


=  STS 
=  VCC 


->  BOTCMD  ,-WAIT  FOR  REQ  DEACTIVATED 

+->  BOTCMDA  ;THEN  REWIND  COMMAND  CODE 

->  BOTCMP  ; WHEN  STS   IS  TRUE,  NEGATE 

;   OUTPUT  CODE 
->  DRVSELD  ;GOTO  DRVSELD 


BOTCMD. OUTF :=  /READY*/CD3*/CD2 */CDl */CD0* /ERASE*/ INITIALIZE* DRSELD 
BOTCMDA. OUTF : =  READY *CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
BOTCMP . OUTF : =  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


.****** 


INITIALIZE  COMMAND  ****** 


INITCMD 

REQ 

-> 

INITCMD 

+  -> 

INITCMDA 

INITCMDA 

STS 

-> 

INITCMDB 

INITCMDB 

*VCC 

-> 

INITCMDC 

INITCMDC 

STS 

-> 

INITCMDD 

INITCMDD 

VCC 

-> 

INITCMDE 

INITCMDE 

STS 

-> 

INITCMP 

INITCMP 

VCC 

-> 

DRVSELD 

;WAIT  FOR  REQ  DEACTIVATED 

; THEN  OUTPUT  REWIND  COMMAND 

;   CODE  &  ASSERT  INITIALIZE 

; WHEN  STS  IS  TRUE,  NEGATE 

!   COMMAND  CODE 

; ASSERT  FFWD  COMMAND  CODE 

;WHEN  STS  IS  TRUE,  NEGATE 

;   COMMAND  CODE 

; ASSERT  REWIND  COMMAND  CODE 

; WHEN  STS   IS  TRUE,  NEGATE 

;   COMMAND  CODE  i  INITIALIZE 


INITCMD. OUTF: 
INITCMDA. OUTF 
INITCMDB. OUTF 
INITCMDC. OUTF 
INITCMDD. OUTF 
INITCMDE. OUTF: 
INITCMP. OUTF:= 


/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY*CD3*CD2*/CD1*CD0*/ERASE*INITIALIZE*DRSELD 
=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 

■  READY*CD3*CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 

=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*INITIALIZE*DRSELD 

■  READY*CD3*CD2*/CD1*CD0*/ERASE*INITIALIZE*DRSELD 
READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
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ERASE  COMMAND  ****** 


ERASECMD 

!  - 

REQ 

-> 

ERASECMD 

WAIT 

FOR 

/REQ  DEACTIVATED 

+-> 

ERASECMDA 

THEN 

REWIND  COMMAND  CODE 

ERASECMDA 

STS 

-> 

ERASECMDB 

WHEN 

STS 

IS  TRUE, 

NEGATE 

COMMAND 

CODE 

ERASECMDB 

VCC 

-> 

ERASECMDC 

ASSERT  ERASE  &  INITIALIZE 

ERASECMDC 

STS 

-> 

ERASECMDD 

WHEN 

STS 

IS  TRUE, 

NEGATE 

ERASE  AND  INITIALIZE 

ERASECMDD 

VCC 

-> 

ERASECMDE 

OUTPUT  REWIND  COMMAND  CODE 

ERASECMDE 

STS 

-> 

ERASECMDF 

WHEN 

STS 

IS  TRUE, 

NEGATE 

COMMAND 

CODE 

ERASECMDF 

VCC 

-> 

DRVSELD 

GOTO 

DRVSELD 

ERASECMD. OUTF:=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

ERASECMDA . OUTF : =  READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

ERASECMDB. OUTF:=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

ERASECMDC . OUTF : =  READY* /CD3*/CD2*/CD1 */CD0 * ERASE* INITIAL I ZE*DRSELD 

ERASECMDD. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

ERASECMDE . OUTF : =  READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

ERASECMDF. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


.«*«***  READ  STATUS  COMMAND  ****** 


RDSTSCMD 

REQ 

-> 

RDSTSCMD 

WAIT  FOR  REQ  DEACTIVATED 

+-> 

RDSTSAA 

THEN  NEGATE  READY 

RDSTSAA 

VCC 

-> 

RDSTSA 

ENABLE  STATUS  BYTE  0 

RDSTSA 

VCC 

-> 

RDSTSB 

ASSERT  READY 

RDSTSB 

REQ 

-> 

RDSTSC 

WHEN  REQUEST  IS  ACTIVATED, 

NEGATE  READY 

RDSTSC 

REQ 

-> 

RDSTSC 

WAIT  FOR  REQUEST  TO  BE 

DEACTIVATED 

+  -> 

RDSTSD 

THEN  ENABLE  STATUS  BYTE  1 

RDSTSD 

VCC 

-> 

RDSTSE 

ASSERT  READY 

RDSTSE 

REQ 

-> 

RDSTSF 

WHEN  REQUEST  IS  ACTIVATED, 

NEGATE  READY 

RDSTSF 

REQ 

-> 

RDSTSF 

WAIT  FOR  REQ  DEACTIVATED 

+-> 

RDSTSG 

THEN  ENABLE  STATUS  BYTE  2 

RDSTSG 

:  = 

VCC 

-> 

RDSTSH 

ASSERT  READY 

RDSTSH 

:=  REQ 

-> 

RDSTSI 

WHEN  REQUEST  IS  ACTIVATED, 

NEGATE  READY 

RDSTSI 

:  = 

REQ 

-> 

RDSTSI 

WAIT  FOR  REQ  DEACTIVATED 

+  -> 

RDSTSJ 

THEN  ENABLE  STATUS  BYTE  3 

RDSTSJ 

:  = 

-> 

RDSTSK 

ASSERT  READY 

RDSTSK 

:=  REQ 

-> 

RDSTSL 

WHEN  REQUEST  IS  ACTIVATED, 

NEGATE  READY 

RDSTSL 

REQ 

-> 

RDSTSL 

WAIT  FOR  REQ  DEACTIVATED 

+-> 

RDSTSM 

THEN  ENABLE  STATUS  BYTE  4 

RDSTSM 

VCC 

-> 

RDSTSN 

ASSERT  READY 

RDSTSN 

REQ 

-> 

RDSTSO 

WHEN  REQUEST  IS  ACTIVATED, 

NEGATE  READY 

RDSTSO 

REQ 

-> 

RDSTSO 

WAIT  FOR  REQ  DEACTIVATED 

+  -> 

RDSTSP 

THEN  ENABLE  STATUS  BYTE  5 

RDSTSP 

VCC 

-> 

RDSTSQ 

ASSERT  READY 

RDSTSQ 

REQ 

-> 

RDSTSR 

WHEN  REQUEST  ACTIVATED, 

NEGATE  READY   &  COMMAND 

RDSTSR 

REQ 

-> 

RDSTSR 

WAIT  FOR  REQ  DEACTIVATED 

+  -> 

DRVSELD 

THEN  GOTO  DRVSELD 

RDSTSCMD. OUTF : =  /READY* /CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RDSTSAA. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSA. OUTF : =  READY*/CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSB. OUTF : =  /READY*/CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSC . OUTF : =  READY */CD3*/CD2*/CDl*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSD . OUTF : =  READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSE . OUTF : =  /READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSF . OUTF : =  READY*/CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSG. OUTF :=  READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSH. OUTF :=  /READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSI .OUTF: =  READY*/CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSJ. OUTF :=  READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSK. OUTF :=  /READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSL. OUTF :=  READY*/CD3*CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSM. OUTF :=  READY*/CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSN. OUTF :=  /READY*/CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSO. OUTF :=  READY */CD3*CD2*/CDl*CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSP. OUTF :=  READY*/CD3*CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

RDSTSQ. OUTF. ■=  /READY* /CD3*CD2*CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 

RDSTSR . OUTF : =  READY*/CD3*/CD2  */CDl*/CD0  */ERASE* /INITIAL I ZE*DRSELD 
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.******  WRITE  FILE  MARK  COMMAND  ****** 
WFMCMD  :=  REQ  ->  WFMCMD 


WFMCMDA 
WFMCMDB 


VCC 
STS 


VCC 


+->  WFMCMDA 
->  WFMCMDB 
->  WFMCMDC 

~>  DRVSELD 


WHEN  REQUEST  DEACTIVATED, 

NEGATE  READY  f.  COMMAND 
OTHERWISE  GOTO  WFMCMDA 
OUTPUT  WFM  COMMAND  CODE 
WHEN  STS  IS  ACTIVATED, 

NEGATE  COMMAND  CODE 
GOTO  DRVSELD 


WFMCMD. OUTF: 
WFMCMDA. OUTF 
WFMCMDB . OUTF 
WFMCMDC . OUTF 


/READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY */CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY*CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


.******  READ  FILE  MARK  COMMAND  ****** 
RFMCMD  :=  REQ  ->  RFMCMD 


RFMCMDA 
RFMCMDB 


:=  VCC 
:=  STS 


VCC 


+->  RFMCMDA 
->  RFMCMDB 
->  RFMCMDC 

->  DRVSELD 


;WHEN  REQUEST  DEACTIVATED, 
;   NEGATE  READY  &  COMMAND 
.•OTHERWISE  GOTO  RFMCMDA 
; OUTPUT  RFM  COMMAND  CODE 
; WHEN  STS  IS  TRUE,  NEGATE 
;   COMMAND  CODE 
;GOTO  DRVSELD 


RFMCMD. OUTF :=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDA. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDB. OUTF :=  READY*CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
RFMCMDC. OUTF :=  READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

.»**.«.  READ  COMMAND  ****** 


RDCMD 

:=  REQ 

-> 

RDCMD 

WHEN  REQUEST  DEACTIVATED, 

NEGATE  READY 

+-> 

RDCMDA 

OTHERWISE  GOTO  RDCMD 

RDCMDA 

:=  ONLN 

-> 

RDCMDB 

IF  ONLINEB  IS  TRUE,  OUTPUT 
READ  COMMAND  CODE 

+-> 

CMDERRA 

IF  ONLINEB  FALSE,  OUTPUT 
INVALID  COMMAND  CODE 

RDCMDB 

:=  STS 

-> 

RDCMDC 

IF  FILE  MARK   IS  FOUND, 
NEGATE  COMMAND  CODE 

+  -> 

RDCMDD 

OTHERWISE  GOTO  RDCMDD 

RDCMDD 

:=  ONLN 

-> 

RDCMDE 

IF  ONLINE  IS  TRUE,  GOTO 
RDCMDE 

+-> 

RDCMDF 

OTHERWISE  NEGATE  COMMAND 
CODE 

RDCMDF 

:=  VCC 

-> 

RDCMDG 

OUTPUT  REWIND  COMMAND  CODE 

RDCMDG 

:=  STS 

-> 

RDCMDC 

NEGATE  COMMAND  COPF. 

RDCMDE 

:=  REQ 

-> 

DRVSELD 

IF  REQUEST  IS  TRUE,  GOTO 

DRVSELD 

+-> 

RDCMDB 

OTHERWISE  GOTO  RDCMDB 

RDCMDC 

:=  VCC 

-> 

DRVSELD 

GOTO  DRVSELD 

RDCMD. OUTF := 
RDCMDA. OUTF 
RDCMDB. OUTF 
RDCMDC. OUTF 
RDCMDD. OUTF 
RDCMDE. OUTF 
RDCMDF. OUTF 
RDCMDG. OUTF 


READY* /CD 3* /CD2* /CD1 */CD0* /ERASE* / INITIALIZE* DRSELD 
=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*CD1*/CD0*/ERASE*/ INITIALIZE* DRSELD 
=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=   /READY *CD3*/CD2*CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY */CD3*/CD2*/CDl*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE* DRSELD 


;«**.**   WRITE  COMMAND  ****** 


WRCMD 

REQ 

-> 
+-> 

WRCMD 
WRCMDA 

WHEN  REQUEST  DEACTIVATED, 

NEGATE  READY 
OTHERWISE  GOTO  WRCMD 

WRCMDA 

ONLN 

-> 
+-> 

WRCMDB 
CMDERRA 

IF  ONLINE  IS  TRUE,  OUTPUT 
WRITE  COMMAND  CODE 

IF  ONLINE  FALSE,  OUTPUT 
INVALID  COMMAND  CODE 

WRCMDB 

EW 

-> 
+  -> 

WRCMDC 
WRCMDE 

IF  EWH  IS  DETECTED, 

NEGATE  COMMAND  CODE 
OTHERWISE  GOTO  WRCMDE 

WRCMDC 

VCC 

-> 

WRCMDD 

OUTPUT  END  OF  MEDIA 
COMMAND  CODE 

WRCMDD 

VCC 

-> 

DRVSELD 

GOTO  DRVSELD 

WRCMDE 

REQ 

-> 
+  -> 

DRVSELD 
WRCMDF 

IF  REQUEST  IS  TRUE ,  GOTO 

DRVSELD 
OTHERWISE  GOTO  WRCMDF 

WRCMDF 

ONLN 

-> 

WRCMDB 

IF  ONLINEB  IS  TRUE,  GOTO 

WRCMDB 
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WRCMDG 
WRCMDH 


WRCMDI 

WRCMD.OUTF:> 
WRCMDA.OUTF 
WRCMDB.OUTF 
WRCMDC.OUTF 
WRCMDD.OUTF 
WRCMDE.OUTF 
WRCMDF.OUTF 
WRCMDG. OUTF 
WRCMDH. OUTF 
WRCMDI . OUTF 


VCC 
STS 


VCC 


+->  WRCMDG  ; NEGATE  COMMAND  CODE 

->  WRCMDH  ;OUTPUT  WFM  COMMAND  CODE 

->  WRCMDI  ;IF  FM  IS  ACTIVATED,  GOTO 

;  WRCMDI 

+->  WRCMDH  ; OTHERWISE  GOTO  WRCMDH 

->  BOTCMDA  ; REWIND  TAPE 


READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 

=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*/CD3*CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*CD3*/CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
=  /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 


;  e@@(a@@  CONDITIONS 


CONDITIONS 
CONDO 
CONDI 
COND2 
COND3 
ONLN 
STS 
EW 
REQ 


CCO 
CC1 
CC2 
CC3 

ONLINE 
STATUS 
EWH 

REQUEST 


SIMULATION  INPUT 


SIMULATION 

TRACE_ON  /CCO  /CC1  /CC2  /CC3  /REQUEST  /ONLINE  CD3  CD2  CD1  CDO   STATUS  ERASE 

INITIALIZE  READY  DRSELD 
SETF  /CC3  /CC2  /CC1  /CCO  /REQUEST  /ONLINE  /STATUS  /EWH  /PRESET  /CLK 
SETF  PRESET  ; RESET  DEVICE 

CLOCKF  CLK  ; SELECT  DRIVE 

SETF  /CC3  /CC2  /CC1  CCO 
SETF  REQUEST 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  /REQUEST 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

SETF  /CC3  /CC2  /CC1  /CCO  ;  INVALID  COMMAND 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /CC3  CC2  /CC1  CCO 
SETF  REQUEST 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  /REQUEST 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  STATUS 
CLOCKF  CLK 
SETF  /STATUS 
CLOCKF  CLK 
CLOCKF  CLK 


;BOT  COMMAND 


; ASSERT  STATUS,  BOT 
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SETtf  /LCJ  CC1  /CCU 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /CC3  CC2  CC1  CCO 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /CC3  CC2  /CC1  /CCO 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 


; ERASE  COMMAND 


ASSERT  STATUS,  BOT 


; ASSERT  STATUS,  EOT 


; ASSERT  STATUS,  BOT 


; INITIALIZE  COMMAND 


; ASSERT  STATUS,  BOT 


; ASSERT  STATUS ,  EOT 


; ASSERT  STATUS ,  BOT 


; READ  STATUS 


•les  I 
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SETF  /REQUEST  ; READ  STATUS  BYTE  1 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST  ; READ  STATUS  BYTE  1 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST  ; READ  STATUS  BYTE  2 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST  ; READ  STATUS  BYTE  3 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST  ; READ  STATUS  BYTE  4 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST  ; READ  STATUS  BYTE  5 

CLOCKF  CLK 
CLOCKF  CLK 

SETF  CC3  CC2  /CC1  CCO  ; READ  COMMAND 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 
CLOCKF  CLK 
SETF  /REQUEST 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

SETF  /ONLINE  ; NEGATE  ONLINEB 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  CC3  CC2  /CC1  CCO  ; READ  COMMAND 

SETF  ONLINE 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 
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SETF  STATUS 


; FILE  MARK  FOUND 


CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  /STATUS 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

SETF  CC3  CC2  /CC1  CCO  ; READ  COMMAND 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  CC3  CC2  CC1  /CCO  ; READ  FILE  MARK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  CC3  CC2  /CC1  CCO  ; READ  COMMAND 

SETF  /ONLINE 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  CC3  CC2  /CC1   /CCO  ; WRITE  COMMAND 

SETF  ONLINE 

CLOCKF  CLK 

SETF  REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /REQUEST 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

CLOCKF  CLK 

SETF  /ONLINE  ; NEGATE  ONLINEB 

CLOCKF  CLK 
CLOCKF  CLK 
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CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  STATUS 
CLOCKF  CLK 
CLOCKF  CLK 
SETF  /STATUS 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 
CLOCKF  CLK 

SETF  STATUS  ; ASSERT  STATUS,  BOT 

CLOCKF  CLK 

SETF  /STATUS 

CLOCKF  CLK 

CLOCKF  CLK 

TRACE  OFF 
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PLE5P16 
12345 

COMMAND  CODE  DECODER 

MMI,    SANTA  CLARA,  CALIFORNIA 


PLE  DESIGN  SPECIFICATION 
KEN  WON  1/27/87 


STSO 

STS1 

STS2 

STS3 

STS4 

STS5 

EOM 

RFM 

WFM 

READ 

WRITE 

FFWD 

RWD 

ICMD 


/CD3*/CD2*/CD1*CD0 

/CD3*/CD2*CD1*/CD0 

/CD3*/CD2*CD1*CD0 

/CD3*CD2*/CD1*/CD0 

/CD3*CD2*/CD1*CD0 

/CD3*CD2*CD1*/CD0 

/CD3*CD2*CD1*CD0 

CD3*/CD2*/CD1*/CD0 

CD3*/CD2*/CD1*CD0 

CD3*/CD2*CD1*/CD0 

CD3*/CD2*CD1*CD0 

CD3*CD2*/CD1*/CD0 

CD3*CD2*/CD1*CD0 

CD3*CD2*CD1*/CD0 


ENABLE  STATUS  BYTE 
ENABLE  STATUS  BYTE 
ENABLE  STATUS  BYTE 
ENABLE  STATUS  BYTE 
ENABLE  STATUS  BYTE 
ENABLE  STATUS  BYTE 
END  OF  MEDIA 
READ  FILE  MARK 
WRITE  FILE  MARK 
READ 
WRITE 

FAST  FORWARD 
REWIND 

INVALID  COMMAND 


FUNCTION  TABLE 


NC  CD3  CD2  CD1  CDO  EOM  ICMD  RWD  FFWD  WRITE  READ  WFM  RFM 
STS5  STS4   STS3   STS2  STS1  STSO 


.  INPUT  /  OUTPUT. 


LLLLL 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

LLLLH 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

LLLHL 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

LLLHH 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

LLHLL 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

H 

LLHLH 

L 

L 

L 

L 

L 

L 

L 

L 

L 

H 

L 

LLHHL 

L 

L 

L 

L 

L 

L 

L 

L 

H 

L 

L 

LLHHH 

H 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

LHLLL 

L 

L 

L 

L 

L 

L 

L 

H 

L 

L 

L 

LHLLH 

L 

L 

L 

L 

L 

L 

H 

L 

L 

L 

L 

LHLHL 

L 

L 

L 

L 

L 

H 

L 

L 

L 

L 

L 

LHLHH 

L 

L 

L 

L 

H 

L 

L 

L 

L 

L 

L 

LHHLL 

L 

L 

L 

H 

L 

L 

L 

L 

L 

L 

L 

LHHLH 

1> 

L 

H 

L 

L 

L 

L 

L 

L 

L 

L 

LHHHL 

L 

H 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L  L  L  ;  NO  COMMAND 

L  L  H  ;  ENABLE  STATUS  BYTE  0 

L  H  L  ;  ENABLE  STATUS  BYTE  1 

H  L  L  ;  ENABLE  STATUS  BYTE  2 

L  L  L  ;  ENABLE  STATUS  BYTE  3 

L  L  L  ;  ENABLE  STATUS  BYTE  4 

L  L  L  ;  ENABLE  STATUS  BYTE  5 

L  L  L  ;  END  OF  MEDIA 

L  L  L  ;  READ  FILE  MARK 

L  L  L  ;  WRITE  FILE  MARK 

L  L  L  ;  READ 

L  L  L  ;  WRITE 

L  L  L  ;  FAST  FORWARD 

L  L  L  ;  REWIND 

L  L  L  ;  INVALID  COMMAND 


DESCRIPTION 

THIS  PLE  DEVICE  IS  ONE  OF  THREE  DEVICES  USED  TO  IMPLEMENT  A  QIC-02 
COMMAND  SEQUENCER.      THIS  DEVICE  DECODES  THE  OUTPUT  CODE  FROM  THE 
PROSE  DEVICE  TO  GENERATE  CONTROL  SIGNALS.   THE  OTHER  TWO  DEVICES  ARE 
A  20L8  PAL  DEVICE  AND  A  14R21  PROSE  DEVICE. 


•Vl  Monolithic  KIM  M emorles  ^1 
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GCR  (4B-5B)  Encoder/ Decoder 


One  of  the  more  common  logic  functions  performed  on  serial 
data  is  the  data  encode/decode  function.  Usually  it  is  desir- 
able to  map  (encode)  the  logical  bit  stream  to  a  physical  bit 
stream,  adjusting  for  the  peculiarities  of  the  particular  trans- 
mission or  storage  media. 

Noise,  bandwidth,  and  reliability  considerations  may  mean 
that  a  different  data  format  would  be  desirable  when  data  is 
sent  along  to  or  stored  on  a  given  media.  For  example,  group- 
coded  recording  (GCR)  formats  take  a  given  number  of  data 
bits  and  encode  them  with  a  larger  number  of  bits.  A  4B-5B 


GCR  code  would  take  4  data  bits  and  encode  them  into  16 
states  with  5  new  bits.  A  particular  4B-5B  code  is  shown  in 
Figure  1 . 

This  mapping  allows  at  most  two  zeros  to  occur  in  succes- 
sion. Also  note  that  data  combinations  with  more  than  one 
zero  at  the  beginning  and  end  of  the  word  are  excluded.  This 
is  necessary  to  insure  that  when  data  words  are  serialized,  no 
more  than  two  zeros  occur  in  succession  at  any  point  in  the 
bit  stream.  Finally,  the  data  combination  11111  is  reserved  as 
a  synchronization  mark.  In  tape  systems,  this  results  in  in- 
creased bit  density  and  eases  clock  synchronization. 


4B-5B  Code 
4-Bit  Data  5-Bit  Data 


0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

0 

0 

1 

1 

1 

0 

1 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

1 

1 

1 

0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

1 

0 

1 

0 

1 

0 

1 

1 

0 

1 

0 

1 

0 

1 

1 

0 

1 

0 

1 

1 

0 

0 

1 

1 

1 

1 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

1 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

1 

0 

1 

1 

0 

1 

0 

1 

1 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

0 

1 

1 

1 

1 

0 

1 

1 

1 

1 
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Figure  1.  4B-5B  Code 


trailer.  Parallel  input  data  is  given  to  the  GCR  E/D,  converted  to 
the  5-bit  format,  serialized,  and  written  to  the  tape.  On  a  read, 
the  serial  data  from  the  tape  is  parallelized,  converted  back  to 
the  4-bit  format  and  output  to  the  output  data  bus.  Additionally, 
during  a  read,  two  status  signals  are  developed.  The  first  signal, 
INV,  indicates  the  presence  of  an  invalid  inputj.e.,  too  many 
zeros  in  succession.  The  second  status  signal,  H,  indicates  the 
detection  of  the  synchronization  mark  (11111). 

The  operation  modes  for  the  GCR  E/D  are  shown  in  the  Data- 
Flow  Diagrams  of  Figure  3.  The  control  signal  and  operation 
functions  are  indicated  for  each  operation  mode.  In  particular,  the 
data  flow  between  each  bit  of  the  output  register  is  indicated 
schematically. 

The  first  mode  of  operation  of  the  GCR  E/D  is  the  HOLD 
mode.  When  ENABLE  is  HIGH,  all  data  operations  on  the  out- 
put register  are  disabled,  independent  of  the  two  mode  con- 
trols, M,  and  M0.  The  output  data  is  simply  fed-back  to  the 
register  inputs.  Thus  the  register  content  is  retained  after  the 
clock  transition. 

When  the  ENABLE  input  is  LOW,  the  operations  indicated  by 
the  M ,  and  M0  mode  bits  are  executed  on  the  clock  transition. 
When  M ,  and  M0  are  both  LOW,  the  SERIAL  SHIFT  IN  mode  is 
selected.  In  this  mode  the  output  register  is  configured  as  a 
serial  shift  register.  The  serial  input  is  consecutively  shifted 
into  the  register  until  all  5  bits  from  the  tape  have  been  stored, 
MSB  at  Y3  and  LSB  at  SERIAL  OUT. 


After  the  5  bits  of  data  have  been  serialized  by  the  SERIAL 
SHIFT  IN  instruction,  the  5B  code  must  be  converted  to  a  4B 
code.  This  is  accomplished  by  taking  the  outputs  of  the  5 
register  bits  and  converting  them  to  4  bits  with  combinatorial 
logic.  On  the  clock  transition,  the  result  is  loaded  into  the  Y 
register.  On  the  same  clock  transition  that  loads  the  con- 
verted data  into  the  Y  register,  the  serial  input  is  loaded  into 
the  serial  output  register.  Because  the  serial  data  is  being 
read  continuously,  one  data  bit  per  clock  transition,  the  con- 
version must  be  done  without  missing  a  serial  data  bit. 

The  CONVERT  PARALLEL  INPUT  AND  LOAD  operation  is 
selected  when  ENABLE  is  LOW,  M,  is  HIGH  and  M0  is  HIGH. 
This  mode  takes  the  4  input  data  bits  and  converts  them  to 
the  5  bit  representation.  The  result  is  loaded  into  the  output 
register  on  the  clock  transition.  The  LSB  of  the  5B  representa- 
tion is  loaded  into  the  Y3  bit  of  the  output  register  and  the 
MSB  is  loaded  into  the  serial  output  bit.  This  configuration,  in 
conjunction  with  the  next  instruction,  allows  the  serial  data 
to  be  written  to  the  tape  drive  one  bit  per  clock  transition. 

The  final  operation,  SERIAL  SHIFT  OUT,  is  selected  when 
ENABLE  is  LOW,  M,  is  LOW  and  M0  is  HIGH.  After  the  CON- 
VERT PARALLEL  INPUT  AND  LOAD  operation  is  executed, 
the  SERIALSHIFT  OUT  operation  outputs  the  converted  data 
to  the  tape  drive.  A  series  of  one  convert  operation  followed 
by  4  shift  operations  will  transfer  a  sequence  of  5-bits  to  the 
tape  drive,  one  bit  per  clock  cycle. 
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Figure  2.  Typical  Tape  Storage  System 
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Design  Approach  (AmPAL16R6) 

The  PAL  device  implementation  of  the  GCR  Encoder/Decoder 
takes  advantage  of  the  multiplexer-like  structure  of  the  AND-OR 
array.  Each  valid  combination  of  ENABLE,  M-,  and  M0  selects  a 
different  set  of  AND  terms.  In  some  cases,  only  one  term  is 
selected  (in  data  steering  operations  for  example).  In  other 
cases,  multiple  AND  terms  are  selected  to  implement  a  com- 
binatorial logic  function  (the  5B-to-4B  conversion  for  example). 
This  concept,  using  the  control  inputs  to  enable  one  or  more 
AND  terms,  allows  the  direct  implementation  of  the  PAL  device 
design  from  the  mode  Data-Flow  Diagrams  (with  a  little  Kar- 
naugh map  help).  The  K-Maps  for  the  5B-to-4B  conversion  logic 
and  the  4B-to-5B  conversion  logic  for  the  Y3  output  are  shown  in 
Figures  4  and  5.  Given  these  maps  and  the  flow  diagrams  in 
Figure  3,  the  Boolean  equations  can  be  constructed  for  the  Y3 
output.  The  resulting  equation,  in  PALASM  software  format,  is 
shown  in  Figure  6. 

It  is  important  to  note  that  the  equation  in  Figure  6  is  written  for 
the  inverse  of  the  Y3  output  (Y3).  This  is  necessary  if  true  data  is 
desired  on  the  output  pin  because  of  the  inverting  nature  of  the 
output  buffer  on  the  PAL  device.  The  inverted  form  of  the  equa- 
tion is  easily  implemented  by  selecting  the  negative  version  of 
the  data  (Y3  in  the  hold  operation  for  example)  or  by  grouping 
zeros  in  a  combinatorial  logic  function  (see  Figures  4  and  5). 
Notice  that  the  multiplexer  strategy  works  equally  well  for  active- 
LOW  or  active-HIGH  logic  functions. 

Once  the  transformation  of  the  Data-Flow  Diagrams  and 
K-Maps  to  Boolean  equations  is  understood,  the  interested 


reader  shou  Id  be  able  to  construct  K-Maps  for  the  other  Y  out- 
puts and,  in  conjunction  with  the  Data-Flow  Diagrams  of 

Figure  3,  write  the  PALASM  software  equations  for  the  resulting 
logic  functions.  This  exercise  will  help  the  reader  to  fully  appreci- 
ate the  advantages  of  the  Data-Flow  Diagram/Multiplexer 
method  of  PAL  design.  Consult  the  full  PALASM  software  listing 
(Figure  10)  for  the  complete  solutions. 

It  is  important  to  note  that  the  diagrams  and  equations  in  Figures 
7,  8,  and  9  specify  the  true  output  for  invalid  signal  (INV),  rather 
than  INV  which  appears  in  the  system  diagram  of  Figure  2.  This 
is  necessary  if  the  correct  data  is  desired  on  the  output  pin  and 
is  due  to  the  inverting  nature  of  the  output  buffer  on  the  PAL 
device. 

The  TNV  signal  is  registered  and  held  until  the  clear  INV  flag 
input  (CIF)  is  brought  LOW,  deactivating  the  flag.  Only  during  a 
5B-4B  conversion  operation  {M-i  =  HIGH,  M0  =  LOW)  is  the 
INV  flag  activated.  Figures  7  and  8  show  the  INV  flag  mode 
definitions  and  the  intermediate  INVALID  logic  equation 
respectively. 


In  this  case,  an  active-LOW  output  is  desired  so  the  active-HIGH 
form  of  the  INV  signal  is  developed  internally.  Ones  are  grouped 
in  the  intermediate  combinatorial  logic  function  (INVALID)  and 
the  true  version  of  the  data  is  selected.  The  complete  PALASM 
software  equation  for  INV  is  given  in  Figure  9. 
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Figure  4.  5B-to-4B  Conversion  K-Map  for  Y3  Output  Figure  5.  4B-to-5B  Conversion  K-Map  for  Y3  Output 
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TITLE         4B-5B  ENCODER/DECODER 
PATTERN     PAT00  3 
REVISION  01 

AUTHOR       WARREN  MILLER 

COMPANY     ADVANCED  MICRO  DEVICES 

DATE  11/06/87 

CHIP  ENCO_DECO  PAL16R6 

CK  Ml  MO  D3  D2  Dl  DO  /EN  /CIFGND 
/E-   SIN  /INV  Y0  Yl  Y2  Y3     SOUT  /H  VCC 

EQUATIONS 

/SOUT   :=  EN*/SOUT 

+  /EN*/M1*/M0*/SIN 

+  /EN*/M1*  M0*/Y0 

+  /EN*  M1*/M0*/SIN 

+  /EN*  Ml*  MO*  D3*  Dl 

+  /EN*  Ml*  MO*  D3*  DO 


;  HOLD 

; SERIAL  SHIFT  IN 
; SERIAL  SHIFT  OUT 
.•CONVERT  SERIAL  INPUT  AND  LOAD 
; CONVERT  PARALLEL  INPUT  AND  LOAD 


/Yl 


/Y0        :=  EN*/Y0 

/EN*/Ml*/M0*/SOUT 
/EN*/M1*  M0*/Y1 
/EN*  Ml*/M0*/SOUT 
/EN*  M1*/M0*  Y3*  Y2*/Y0 
/EN*  Ml*  M0*/D3*  Dl 
/EN*  Ml*  M0*/D3*  D2*  DO 

EN*/Y1 
/EN*/M1*/M0*/Y0 
/EN*/M1*  M0*/Y2 
/EN*  M1*/M0*/Y0 
/EN*  M1*/M0*  Y3*  Y2 
/EN*  Ml*  M0*/D2 

/Y2        :=  EN*/Y2 

/EN*/M1*/M0*/Y1 
/EN*/M1*  M0*/Y3 
/EN*  M1*/M0*/Y1 
/EN*  Ml*  M0*/D3*/D1*/D0 
/EN*  Ml*  M0*/D3*  D2*/D1 
/EN*  Ml*  MO*  D3*/D1*  DO 

/Y3        :=  EN*/Y3 

+  /EN*/M1*/M0*/Y2 

+  /EN*/M1*  M0*/SOUT 

+  /EN*  M1*/M0*  Y3*  SOUT 

+  /EN*  M1*/M0*  Y3*/Y2 

+  /EN*  Ml*  MO*  D3*/D0 

+  /EN*  Ml*  MO*  D1*/D0 

INV      :=  /CIF*  INV 

+  /CIF*  Ml*/M0*/Y3*/Y2 

+  /CIF*  M1*/M0*/Y2*/Y1*/Y0 

+  /CIF*  Ml*/M0*/Y0*/SOUT 

+  /CIF*  M1*/M0*  Y3*  Y2*  Yl*   Y0*  SOUT 

H  =     Y3*   Y2*  Yl*  Y0*  SOUT 

;    SIMULATION  NOT  INCLUDED 

Figure  10.  PAL  ASM  Listing 
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Industrial  control  can  be  defined  as  simply  as  a  basic  state 
machine:  process  information  is  gathered  and  analyzed,  and 
outputs  feed  back  to  the  process  to  modify  it  based  upon  the 
analysis.  Thus,  PAL  device  applications  are  often  state 
machines. 

A  very  basic  industrial  control  system  is  a  heater  thermostat. 
Temperature  is  sensed,  compared  to  a  maximum  upper  or  lower 
limit,  and  the  heater  is  either  turned  on  or  off.  A  small  state 
machine  PLD  can  implement  this  function. 

Much  of  the  circuitry  in  industrial  control  applications  is  analog  or 
linear.  Process  information  is  gathered  by  sensors,  and  then  the 
information  is  conditioned  to  eliminate  noise.  The  conditioned 
signal  is  then  converted  to  a  digital  signal,  and  processed  by  a 
digital  processor.  The  control  outputs  are  then  converted  to 
appropriate  analog  signals  to  modify  the  process  (Figure  1 ). 

Data  Acquisition 

The  sensing  part  of  the  system  is  data  acquisition.  Data  acquisi- 
tion is  important  not  only  in  industrial  control  applications,  but  also 
in  scientific  monitoring  applications.  Data  acquisition  requiresthe 
capture  of  analog  signals  and  conversion  to  useful  information 
that  can  be  analyzed.  This  first  requires  conditioning  of  the  input, 
to  filter  out  noise.  The  signal  can  then  be  analyzed  with  analog 
circuitry,  but  the  power  of  digital  microprocessors  makes  conver- 
sion to  digital  logic  more  efficient. 

More  logic  is  being  incorporated  into  the  acquisition  circuitry.  The 
analog  circuitry  for  signal  conditioning  is  a  natural  addition  to  a 
sensor.  Analog-to-digital  conversion  (ADC)  can  even  be  per- 
formed in  the  same  unit.  Advanced  sensors  may  even  incorpo- 
rate a  microprocessor  to  provide  immediate  local  analysis  of  the 
data,  offloading  the  central  processor.  In  these  applications,  PAL 
devices  can  form  the  glue  logic  for  the  microprocessor. 

Data  Analysis 

Data  analysis  has  been  performed  in  the  past  by  a  central 
computer.  The  central  computerwould  have  thetask  of  polling  all 
of  the  sensors  and  analyzing  the  incoming  data.  However,  in  a 
system  where  thousands  or  millions  of  sensors  are  active,  or 
where  immediate  response  to  process  changes  is  required,  this 
system  s  inefficient. 


The  type  of  system  more  commonly  used  today  includes  distrib- 
uted control  and  processing  (Figure  2).  Microcomputers  or 
programmable  controllers  can  perform  low-level  processing  and 
react  with  control  signals  immediately.  Often  single-board  com- 
puters with  added  analog  conversion  circuitry  are  used. 

A  LAN  connection  between  a  number  of  intelligent  sensors  and 
processors  provides  a  very  efficient  system.  Simple  RS-232  links 
connect  sensors.  Buses  such  as  Multibus  or  VME  bus  are 
commonly  used  to  connect  the  boards  together.  Standards  such 
as  MAP  (Manufacturing  Automation  Protocol)  are  being  devel- 
oped for  future  LAN-based  systems,  allowing  dissimilar  devices 
to  be  connected  in  a  network.  Fiber-optic  links  will  provide  the 
data  reliability  over  long  distances  in  confined  areas  required  for 
industrial  control. 


uC 


CONTROLLED 
PROCESS 


LAN 


Figure  2. 


In  these  areas,  standard  computer  applications  for  PAL  devices 
are  numerous.  Bus  interface  is  another  key  application  area  for 
communication  within  distributed  systems. 

Control 

The  most  critical  part  of  the  system  is  control.  The  system  must 
respond  to  the  analyzed  information  with  signals  that  modify  the 
process  under  observation.  In  a  system  such  as  a  nuclear 
reactor,  immediate  and  accurate  response  is  the  most  important 
objective. 
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Figure  1. 
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PID  Algorithm 

For  smooth  control,  the  PID  algorithm  for  analysis  and  control  is 
often  used.  P  stands  for  Proportional:  the  incoming  data  is 
compared  to  a  reference  number,  and  then  the  process  is 
modified  in  proportion  to  the  difference  between  the  desired  and 
measured  values.  For  example,  a  heater  thermostat  that  meas- 
ures air  temperature  of  55  degrees,  when  its  reference  value  was 
set  at  65  degrees,  would  turn  on  the  heater,  and  reduce  the 
amount  of  heat  as  the  temperature  got  closer  to  the  reference 
value. 

Proportional  analysis  alone  often  causes  undershoot  or  over- 
shoot. If  the  heater  works  quickly,  it  can  heat  the  air  well  past  the 
reference  value.  And  if  it  works  slowly,  it  may  take  a  long  time  to 
heat  the  air  back  to  the  reference  value. 

For  these  reasons,  the  Integral  (I)  part  of  the  algorithm  is  added. 
The  integral  is  taken  of  the  difference  between  the  reference 
value  and  measured  value;  in  effect,  this  measures  how  long  the 
measured  value  has  been  off  from  the  reference  value.  Using  the 
integral  prevents  undershoot  and  overshoot. 

However,  the  result  can  be  a  control  output  that  causes  the 
measured  value  to  continue  oscillating  around  the  reference 
value.  For  example,  if  the  heater  is  turned  on  at  64  degrees  and 
an  air  conditioner  is  turned  on  at  66  degrees,  the  temperature  will 
oscillate  rapidly  around  the  reference  point  of  65  degrees. 

For  this  reason,  a  Derivative  (D)  input  is  used.  The  derivative 
looks  at  the  rate  of  change  of  the  measured  value,  and  will  slow 
down  the  response  if  the  measured  value  is  rapidly  approaching 
the  reference  value. 

The  complete  PID  algorithm  is 

OUT  -  Kp  X  ERR  +  Ki  X  JeRR  +  Kd  x  dERR/dt, 

combining  present,  past,  and  future  input  considerations.  Selec- 
tion of  the  proper  constants  allows  the  control  response  to  be  both 
rapid  and  smooth. 

The  PID  algorithm  requires  mathematical  processing  capabili- 
ties, similar  to  those  used  in  digital  signal  processing  (see  the 
section  on  Digital  Signal  Processing,  page  2-283). 

Control  circuitry  is  also  combining  analog  and  digital  logic.  Smart 
power  ICs  can  perform  logic  functions  while  receiving  and  send- 
ing high-power  signals.  These  ICs  can  perform  simple  functions 
like  overvoltage  protection,  cutting  off  a  circuit  if  the  powersurges. 

Applications 

Applications  for  industrial  control  systems  are  varied,  and  include 
the  following: 

•  Factory  Automation 

Automated  distribution  centers 
Power  grid  monitoring  and  control 

•  Motor  Control 

Robots 

Engine  testing 
Torque  control 


•  Transportation 

Air  traffic  control 
Subway  system  control 

•  Automotive  Electronics 

•  Satellite  Communication 

•  Simulators 

Performance  Requirements 

The  key  requirement  of  industrial  control  systems  is  the  need  for 
real-time  processing.  An  air-traffic  control  system  cannot  be 
allowed  to  slow  down  if  overloaded.  During  overloading  in  real 
systems,  the  information  provided  on  display  screens  is  reduced, 
instead  of  completely  losing  all  information  on  a  plane. 

An  interrupt  handling  system  is  more  efficient  than  a  standard 
polling  system.  Many  I/O  are  required,  with  high  DAC  and  ADC 
accuracy.  For  data  analysis,  a  large  memory  and  good  memory 
management  is  important.  Workstations  with  a  high  I/O  speed 
are  often  used  for  central  industrial  controllers. 

PAL  Device  Usage 

Most  of  the  circuitry  in  an  industrial  control  application  is  analog 
or  linear.  Only  the  analyzer  is  primarily  digital  logic.  The  analyzer, 
whether  local  or  central,  uses  a  standard  computer  architecture. 

Programmable  logic  devices,  especially  CMOS  PLDs,  are  used 
in  industrial  control  much  as  they  are  used  in  other  computer 
applications.  High  speed  is  important  in  real-time  control  applica- 
tions involving  a  large  amount  of  data. 

CMOS  PAL  Devices 

Because  of  the  harsh  environment  often  found  in  factories  or 
around  motors,  CMOS  technology  is  required.  Specifications  for 
the  industrial  range  CMOS  ZPAL  family  are  guaranteed  over  a 
wider  temperature  range,  and  a  wider  supply  voltage  range.  In 
addition,  CMOS  inputs  are  less  sensitive  to  noise  fluctuations. 
The  lower  power  consumed  by  all  CMOS  PAL  devices  allows  a 
smaller  power  supply,  important  in  the  small  areas  controllers 
often  must  be  placed.  Also,  it  is  easier  for  a  battery  to  back  up  the 
power  supply  for  critical  operations.  The  reduced  heat  is  good  for 
enclosed  areas,  and  allows  CMOS  devices  to  be  placed  closer 
together,  even  in  surface-mounted  devices,  to  save  room. 

The  erasability  of  CMOS  PAL  devices  allows  changes  to  refer- 
ence values,  PID  control  constants,  or  other  process  control 
changes.  Erasability  also  allows  easy  field  service  upgrades  or 
repair. 

PAL  Device  Applications 

PAL  devices  find  applications  in  all  levels  of  industrial  control, 
from  the  low-level  sensing  equipment  to  the  mainframe  analysis 
computers. 

Low-Level  Applications 

Low-level  data  acquisition  and  response  circuitry  can  be  based 
on  an  individual  state  machine  PLD.   For  example,  stepper 
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motors  are  often  used  for  mechanical  valves.  Asingle  PALdevice 
can  be  programmed  to  convert  enable,  direction,  and  stepping 
information  into  the  appropriate  signals  to  hold  or  step  the  motor 
(page  2-550). 

Another  example  is  shaft  encoding  (page  2-558).  Shaft  encoders 
convert  the  position  of  a  rotating  shaft  into  a  digital  signal.  The 
position  is  sensed  optically,  and  the  digital  information  is  encoded 
into  position,  speed,  and  direction  signals. 

More  complex  applications  may  be  handled  by  dedicated  state 
machine  PLDs,  including  the  PROSE  device  or  the  Am29PL1 41 . 

Medium-Range  Applications 

Dedicated  microcontrollers  can  be  used  in  medium-range  appli- 
cations, such  as  small  controllers  linked  to  a  central  computer. 
These  applications  require  the  intelligence  of  a  microcontroller 
but  not  a  complete  microprocessor-based  computer  system.  In 
these  applications,  PAL  devices  form  the  glue  logic  between  the 
microcontroller  and  its  surrounding  chips. 


Microprocessor  support  is  a  key  area  of  application  (see  the 
section  titled  Microprocessor-Based  Systems,  page  2-1 31 ).  This 
is  especially  true  of  distributed  control  systems,  where  several 
microprocessors  perform  local  processing.  A  standard  architec- 
ture can  be  modified  for  different  areas  of  the  process  by  program- 
ming the  PALdevices  differently.  PAL  devices  also  allow  the  local 
controllers  to  take  up  less  space. 

Another  key  application  area  is  bus  interface  for  distributed 
systems  (see  the  section  titled  Bus  Interface,  page  2-325).  Either 
synchronous  or  asynchronous  buses  may  be  used. 

Since  industrial  controllers  are  usually  interrupt-driven  for  effi- 
cient response,  interrupt  logicforms  another  potential  application 
area  for  PAL  devices. 

Digital  signal  processing  is  often  performed  in  the  analysis 
section  of  the  industrial  controller,  usually  in  a  central  processor. 
The  applications  would  not  require  the  speed  or  complexity  of 
most  DSP  applications,  but  would  use  similar  circuitry.  See  the 
section  titled  Digital  Signal  Processing. 


High-End  Applications 


Dedicated  local  computers  and  central  data  processing  comput- 
ers are  the  high-end  applications  for  PLDs.  Here,  microproces- 
sors are  the  brains  of  the  system  and  PLDs  perform  many  of  the 
support  functions. 
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Functional  Description 

Stepper  motors  and  linear  actuators  are  used  in  a  variety  of 
applications  requiring  precise  rotational  and/or  linear  move- 
ment. Examples  are  printers,  floppy  disk  drives,  mechanical 
valves,  etc.  Stepper  motors  are  two-phase  permanent  magnet 
motors  which  provide  discrete  angular  movement  every  time  the 
polarity  of  a  winding  is  changed.  In  linear  actuators  the  angular 
movement  is  converted  to  linear  movement  via  a  load  screw.  In 
essence,  they  are  dc  motors  without  brushes,  where  the  user 
provides  commutation  with  external  logic. 

In  these  examples,  the  control  and  drive  circuitry  for  such  a 
motor  is  implemented  digitally  using  PAL  devices. 

Circuit  Operation 

One  type  of  drive  circuit,  unipolar  drive,  is  shown  in  Figure!  Two 
drive  sequences  are  given  in  Tables  1a  and  1b.  Angular  rotation 
is  achieved  by  saturating  the  transistor  drivers  in  the  sequence 
shown  in  the  appropriate  table  (full  or  half  step).  Now,  assume 
the  circuit  of  Figure  1  is  connected  to  a  stepper  motor  designed 
for  7.5°  steps.  By  following  the  step  sequence  of  Table  1a  (full 
step),  the  shaft  will  rotate  7.5°  each  time  the  state  is  changed.  If 
the  sequence  of  Table  1  b  is  followed,  a  3.75°  (half-step)  rotation 
will  result  for  each  change  of  state.  For  both  step  sequences,  the 
direction  can  be  reversed  by  stepping  backwards  through  the 
table  (step  4-3-2-1-4-etc). 


Figure  1 


CLOCKWISE 
ROTATION 


CLOCKWISE 
ROTATION 


Table  1a 
FULL  STEP  SEQUENCE 


STEP 

QO 

Q1 

Q2 

Q3 

1 

1 

0 

1 

0 

2 

1 

0 

0 

1 

3 

0 

1 

0 

1 

4 

0 

1 

1 

0 

1 

1 

0 

1 

0 

Table  1b 

HALF  STEP  SEQUENCE 

STEP 

QO 

Q1 

Q2 

Q3 

1 

1 

0 

1 

0 

2 

1 

0 

0 

0 

3 

1 

0 

0 

1 

4 

0 

0 

0 

1 

5 

0 

1 

0 

1 

6 

0 

1 

0 

0 

7 

0 

1 

1 

0 

8 

0 

0 

1 

0 

1 

1 

0 

1 

0 

COUNTER- 
CLOCKWISE 
ROTATION 


COUNTER- 
CLOCKWISE 
ROTATION 
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In  this  application,  one  PAL16R4  can  be  used  to  provide  the  logic 
levels  required  to  drive  two  stepper  motors  in  the  full  step  mode. 
Due  to  the  high  current  drive  required  (100-400  mA/phase), 
external  inverting  high  current  buffers  must  be  used  (ULN  2001 
or  equivalent).  In  the  design,  the  following  features  are  provided 
within  the  PAL  device: 


(E  inputs). 

•  Select  clockwise  or  counter-clockwise  rotation. 

•  Set  the  motor  to  logic  state  step  1. 

A  block  diagram/pinout  is  shown  in  Figure  2. 


MOTOR B 
CONTROLS 


Figure  2 

A  function  table  for  each  motor  control  section  is  given  below. 


D  FUNCTION 

X 

1 

X 

X 

X 

Hold  motor  in  current  position 

X 

X 

1 

X 

X 

Hold  motor  in  current  position 

t 

0 

0 

1 

X 

Set  outputs  to  step  1  levels 

1 

0 

0 

0 

0 

Step  motor  clockwise 

t 

0 

0 

0 

1 

Step  motor  counter-clockwise 

The  full  step  sequence  (Tableja)  can  be  jsimplified  from  4 
outputs  to  2  outputs  since  Q1  =  Q0  and  Q3  =  Q2.  The  sequence 
can  then  be  expressed  as  follows: 


Q0, 


n+1 


1,Q1 


n+1 


1.Q3n+1=0 


=  0.  Q2n+1 
when  E1  =  1  or  E2  =  1: 

Q0n+1  =  Q0n,  Q1n+1  =  Q1n,  Q2n+1  =  Q2n,  Q3n+1  : 


Q3 


D  =  0 

D  =  1 

STEP 

Q0 

Q2 

Q0 

Q2 

1 

1 

1 

1 

1 

2 

1 

0 

0 

1 

3 

0 

0 

0 

0 

4 

0 

1 

1 

0 

1 

1 

1 

1 

1 
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The  step  sequences  can  be  converted  to  equations  by  use  of  a 
Karnaugh  map. 

Q0n 


Q2n 


3 

0 

0 

(  1 

0 

C 1 

1  J 

0 

Q0n+1  =  Q2n  •  D  +  Q2n  •  D 


These  equations  make  up  the  first  design  file. 

Conclusion 

Although  this  example  could  be  used  "as  is"  in  a  stepper  motor 
application,  the  programmability  of  PAL  devices  could  allow  for 
any  desired  modifications.  Changes  to  the  circuit  might  include: 

1.  Drive  only  one  stepper  motor,  using  a  PAL16R6.  The  other 
flip-flops  could  be  used  as  a  programmable  counter,  allowing 
for  different  speed  settings. 

2.  Drive  only  one  stepper  motor,  using  the  extra  inputs  and 
outputs  to  handle  other  circuit  functions. 

3.  Drive  only  one  stepper  motor,  using  a  PAL16R6.  The  other 
flip-flops  could  be  used  as  a  4-bit  position  counter. 

4.  The  substitution  of  a  PAL16R8,  and  another  inverting  buffer 
would  allow  the  driving  and  control  of  four  stepper  motors. 

5.  Reprogram  for  half-step  operation.  This  is  done  in  the  second 
design  example. 


Q0n 


Q2n 


02, 


n+1 


Q2n+1  =  OOn  •  D  +  QOn  •  D 


Factor  in  E1  and  E2: 

QOn+1  =  El  ■  E2  ■  Q2n  •  D  +  El  •  E2  •  Q2n  •  D 
Q2n+1  =  El  ■  E2  •  QOn  •  D  +  El  •  E2  •  QOn  •  D 

Express  the  set  function  as  an  equation: 
QOn+1  =  El  •  E2~-  S  Q2n+i  =  El  ■  E2  •  S 

Express  the  hold  function  (when  E1  or  E2  =  1) 
QOn+i  =  QOn  •  E1  +  QOn  •  E2 
Q2n+1  =  Q2n  ■  E1  +  Q2n  •  E2 

Combining  all  the  above: 

QOn+1  :=EVE2-S  +  Q0n-E1  +  QOn  •  E2  +  El  ■  E2 •  Q2n  ■  D  +  El  ■  E2  •  Q2n • D 
01  n+1  :=Q_°n+l 

Q2n+1  :=  §3  ■  E2  •  S  +  Q2n  •  E1  +  Q22  •  E2  +  E1  •  E2  •  QOn  •  D  +  E1  •  E2  •  QOn  ■  D 
Q3n+1  :=  Q2n+1 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 

CHIP  SMC  PAL16R4 


STEPPER  MOTOR  CONTROLLER 
P7015 
1.00 

DAVE  SACKETT 

DEVOE  COMPANY ,  INDIANAPOLIS,  INDIANA 
02/23/81 


CLK  /E1A  /E2A  SA  DA  /E1B  /E2B  SB  DB  GND  /OC 
/Q3B  /Q1B  /Q2B  /QOB  /Q2A  /QOA  /Q3A  /Q1A  VCC 


EQUATIONS 

QOA  :=  Q0A*/E1A 

+     QOA*  /E2A 
+     SA  *   E1A*  E2A 
+  /Q2A*  E1A*  E2A*  DA 
+     Q2A*  E1A*  E2A*/DA 


HOLD  IF  NOT  El 
HOLD  IF  NOT  E2 
STEP  1   IF  SET 

LOAD  /Q2A  IF  COUNTER-CLOCKWISE 
LOAD     Q2A  IF  CLOCKWISE 


Q1A  =  /QOA 

Q1A.TRST  =  VCC 

Q2A   :=  Q2A*/E1A 

+     Q2A*  /E2A 

+      SA   *    E1A*  E2A 

+     QOA*   E1A*   E2A*  DA 

+   /QOA*   E1A*  E2A*/DA 

Q3A  =  /Q2A 

Q3A.TRST  =  VCC 

QOB    :=  Q0B*/E1B 

+  QOB*  /E2B 

+  SB  *   E1B*  E2B 

+  /Q2B*   E1B*   E2B*  DB 

+  Q2B*   E1B*  E2B*/DB 


Q1B   =  /QOB 
Q1B.TRST 


VCC 


Q2B 


:=  Q2B*/E1B 
+     Q2B*  /E2B 
+     SB  *  E1B*  E2B 
+     QOB*   E1B*   E2B*  DB 
+   /QOB*   E1B*  E2B*/DB 


Q3B  =  /Q2B 

Q3B.TRST  =  VCC 

SIMULATION 

SETF  OC  E1A  E2A  SA  E1B  E2B  SB 
CLOCKF  CLK 


; HOLD  IF  NOT  El 

; HOLD  IF  NOT  E2 

;STEP  1   IF  SET 

; LOAD  QOA   IF  COUNTER-CLOCKWISE 

; LOAD  /QOA  IF  CLOCKWISE 


HOLD  IF  NOT  El 
HOLD  IF  NOT  E2 
STEP  1   IF  SET 

LOAD  /Q2B  IF  COUNTER-CLOCKWISE 
LOAD     Q2B   IF  CLOCKWISE 


HOLD  IF  NOT  El 
HOLD  IF  NOT  E2 
STEP  1   IF  SET 

LOAD  QOB  IF  COUNTER-CLOCKWISE 
LOAD  /QOB  IF  CLOCKWISE 
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CHECK  QOA  /Q1A  Q2A  /Q3A  QOB  /Q1B  Q2B  /Q3B 

SETF  /E1A  /E2A  /E1B  /E2B 
CLOCKF  CLK 

CHECK  QOA  /Q1A  Q2A  /Q3A  QOB  /Q1B  Q2B  /Q3B 

SETF  E1A  E2A  /SA  /DA  E1B  E2B  /SB  DB 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 
CLOCKF  CLK 

CHECK  /QOA  Q1A  /Q2A  Q3A  /QOB  Q1B  /Q2B  Q3B 
CLOCKF  CLK 

CHECK  /QOA  Q1A  Q2A  /Q3A  QOB  /Q1B  /Q2B  Q3B 
CLOCKF  CLK 

CHECK  QOA  /Q1A  Q2A  /Q3A  QOB  /Q1B  Q2B  /Q3B 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 
CLOCKF  CLK 

CHECK  /QOA  Q1A  /Q2A  Q3A  /QOB  Q1B  /Q2B  Q3B 

SETF   DA  /DB 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 

SETF  /E1A  /E1B 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 

SETF   E1A  /E2A  E1B  /E2B 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 

SETF  SA  SB 
CLOCKF  CLK 

CHECK  QOA  /Q1A  /Q2A  Q3A  /QOB  Q1B  Q2B  /Q3B 
; DESCRIPTION 

;THIS  PAL16R4  PROVIDES  THE  LOGIC  LEVELS  REQUIRED  TO  DRIVE  TWO 
; STEPPER  MOTORS   IN  THE  FULL   STEP  MODE. 


THE  FOLLOWING  OPERATIONS  MAY  BE  PERFORMED  FOR  EACH  STEPPER 
MOTOR  CONTROLLER  INDIVIDUALLY: 


CLK 

/El 

/E2 

S 

D  • 

OPERATION 

X 

H 

X 

X 

X 

HOLD  MOTOR  IN  CURRENT  POSITION 

X 

X 

H 

X 

X 

HOLD  MOTOR   IN  CURRENT  POSITION 

c 

L 

L 

H 

X 

SET  OUTPUTS  TO  STEP  1  LEVELS 

c 

L 

L 

L 

L 

STEP  MOTOR  CLOCKWISE 

c 

L 

L 

L 

H 

STEP  MOTOR  COUNTER-CLOCKWISE 

LRU 

REVISION 
AUTHOR 
COMPANY 
DATE 


1.00 

COL I / SACKETTE 
MONOLITHIC  MEMORIES 
12/07/82 


CHIP  DUAL  SMC  PAL16R8 


CLK 
/OC 


/ENA  SETA  ROTA  MODEA  /ENB  SETB  ROTB  MODEB  GND 
/SW1B  / SW2B  /SW3B  /SW4B  /SW4A  /SW3A  /SW2A  /SW1A  VCC 


EQUATIONS 


SW1A 


=  SW1A 

+  /SW2A 
+  SW3A 
+  /SW2A*/SW3A 
+ 


*/ENA 

*/SW4A*  ENA*  ROTA*  MODEA 

*  ENA*   ROTA* /MODEA 

*  ENA* /ROTA*  MODEA 
SW4A*   ENA* /ROTA* /MODEA 

ENA  *  SETA 


HOLD   SW1A: DISABLE 
HALF-STEP  MTR  CW 
FULL-STEP  MTR  CW 
HALF-STEP  MTR  CCW 
FULL-STEP  MTR  CCW 
SET  A  TO  STEP  1 


SW2A 


SW3A 


+/SW1A 
+ 

+/SW1A 
+ 


+/SW1A 

+ 

+ 

+  SW1A 
+ 


SW2A  */ENA 

*/SW3A  *   ENA*   ROTA*   MODEA* /SETA 

SW4A*   ENA*  ROTA*/MODEA*/SETA 
*/SW4A*   ENA*/ROTA*  MODEA*/SETA 
SW3A  *   ENA* /ROTA* /MODEA* /SETA 


SW3A  */ENA 

*/SW4A*   ENA*   ROTA*  MODEA 
SW2A  *   ENA*   ROTA* /MODEA 

/SW2A  */SW4A*   ENA*/ROTA*  MODEA 

*    ENA* /ROTA* /MODEA 


ENA 


*  SETA 


SW4A:= 

+  /SW2A*/SW3A 
+  SW1A 

+/SW1A  */SW3A 
+  SW2A 


SW1B: 


SW4A*/ENA 

*  ENA*   ROTA*   MODEA* /SETA 

*  ENA*   ROTA* /MODEA* /SETA 

*  ENA* /ROTA*  MODEA*/SETA 

*  ENA* /ROTA* /MODEA* /SETA 


=   SW1B  */ENB 
+  /SW2B  */SW4B*   ENB*   ROTB*  MODEB 

+  SW3B  *   ENB*   ROTB* /MODEB 

+  /SW2B*/SW3B  *  ENB*/ROTB*  MODEB 

+  SW4B*   ENB* /ROTB* /MODEB 

+  ENB  * 


SETB 


SW2B : 


SW2B  */ENB 
+/SW1B  */SW3B  *   ENB*   ROTB*   MODEB* /SETB 

+  SW4B*   ENB*  ROTB*/MODEB*/SETB 

+/SW1B  */SW4B*   ENB*/ROTB*  MODEB*/SETB 

+  SW3B  *   ENB* /ROTB* /MODEB* /SETB 


; HOLD  SW2A: DISABLE 
; HALF-STEP  MTR  CW 
; FULL-STEP  MTR  CW 
:  HALF-STEP  MTR  CCW 
;  FULL-STEP  MTR  CCW 

HOLD  SW3A: DISABLE 
HALF-STEP  MTR  CW 
FULL-STEP  MTR  CW 
HALF-STEP  MTR  CCW 
FULL-STEP  MTR  CCW 
SET  A  TO  STEP  1 

HOLD  SW4A: DISABLE 
HALF-STEP  MTR  CW 
FULL-STEP  MTR  CW 
HALF-STEP  MTR  CCW 
FULL-STEP  MTR  CCW 

HOLD  SW1B: DISABLE 
HALF-STEP  MTR  CW 
FULL-STEP  MTR  CW 
HALF-STEP  MTR  CCW 
FULL-STEP  MTR  CCW 
SET  B  TO  STEP  1 

HOLD  SW2B: DISABLE 
HALF-STEP  MTR  CW 
FULL-STEP  MTR  CW 
HALF-STEP  MTR  CCW 
FULL-STEP  MTR  CCW 
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SW3B:=  SW3B  */ENB  ; HOLD  SW3B: DISABLE 

+/SW1B  */SW4B*   ENB*  ROTB*  MODEB  ; HALF-STEP  MTR  CW 

+  SW2B  *   ENB*   ROTB* /MODEB  ; FULL-STEP  MTR  CW 

+  /SW2B  */SW4B*  ENB*/ROTB*  MODEB  ; HALF-STEP  MTR  CCW 

+  SW1B  *  ENB* /ROTB* /MODEB  ; FULL-STEP  MTR  CCW 

+  ENB  *   SETB  ;SET  B  TO  STEP  1 

SW4B:=  SW4B*/ENB  ; HOLD  SW4B : DISABLE 

+  /SW2B*/SW3B  *  ENB*  ROTB*  MODEB*/SETB  ; HALF-STEP  MTR  CW 

+   SW1B  *  ENB*  ROTB* /MODEB* /SETB  ; FULL-STEP  MTR  CW 

+/SW1B  */SW3B  *   ENB*/ROTB*  MODEB* /SETB  ; HALF-STEP  MTR  CCW 

+  SW2B  *   ENB* /ROTB* /MODEB* /SETB  ; FULL-STEP  MTR  CCW 


SIMULATION 

SETF  OC  ENA  SETA  ENB  SETB 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  SW3A  /SW4A  SW1B  /SW2B  SW3B  /SW4B 

SETF  /SETA  ROTA  MODEA  /SETB  /ROTB  MODEB 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  /SW3A  / SW4A  /SW1B  /SW2B  SW3B  /SW4B 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  /SW3A  SW4A  /SW1B  SW2B  SW3B  /SW4B 

SETF  /ENA  /ENB 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  /SW3A  SW4A  /SW1B  SW2B  SW3B  /SW4B 

SETF  ENA  ENB 
CLOCKF  CLK 

CHECK  /SW1A  /SW2A  /SW3A  SW4A  /SW1B  SW2B  /SW3B  / SW4B 
CLOCKF  CLK 

CHECK  /SW1A  SW2A  /SW3A  SW4A  /SW1B  SW2B  /SW3B  SW4B 
CLOCKF  CLK 

CHECK  /SW1A  SW2A  / SW3A  /SW4A  /SW1B  /SW2B  /SW3B  SW4B 
CLOCKF  CLK 

CHECK  /SW1A  SW2A  SW3A  /SW4A  SW1B  /SW2B  /SW3B  SW4B 
CLOCKF  CLK 

CHECK  /SW1A  /SW2A  SW3A  /SW4A  SW1B  / SW2B  /SW3B  /SW4B 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  SW3A  /SW4A  SW1B  /SW2B  SW3B  /SW4B 

SETF  /ENA  /ROTA  /MODEA  /ENB  ROTB  /MODEB 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  SW3A  /SW4A  SW1B  /SW2B  SW3B  /SW4B 
SETF  ENA  ENB 
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CHECK 


CLOCKF  CLK 


/SW1A  SW2A  SW3A  /SW4A  SW1B  / SW2B  /SW3B  SW4B 


CLOCKF  CLK 
CHECK 


/SW1A  SW2A  /SW3A  SW4A  /SW1B  SW2B  / SW3B  SW4B 


CLOCKF  CLK 

CHECK  SW1A  /SW2A  /SW3A  SW4A  /SW1B  SW2B  SW3B  /SW4B 
CLOCKF  CLK 

CHECK  SW1A  /SW2A  SW3A  /SW4A  SW1B  /SW2B  SW3B  /SW4B 


DESCRIPTION 


CLK 

/OC 

/EN 

SET 

ROT 

MODE 

SW1-SW4 

COMMENTS 

X 

H 

X 

X 

X 

X 

z 

HI-Z 

C 

L 

H 

X 

X 

X 

HOLD 

HOLD  MOTOR  POSITION 

C 

L 

L 

H 

X 

X 

1 

SET  MOTOR  POSITION  TO  STEP  1 

C 

L 

L 

L 

H 

H 

SW  PLUS 

1 

HALF-STEP  MOTOR  CLOCKWISE 

C 

L 

L 

L 

H 

L 

SW  PLUS 

2 

FULL-STEP  MOTOR  CLOCKWISE 

C 

L 

L 

L 

L 

H 

SW  MINUS 

1 

HALF-STEP  MOTOR  COUNTERCLOCKWI 

C 

L 

L 

L 

L 

L 

SW  MINUS 

2 

FULL-STEP  MOTOR  COUNTERCLOCKWI 

ZXMonomMcMMemoriesa 
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The  trend  away  from  analog  systems  to  digital  numerical  control 
systems  focuses  new  attention  on  instruments  that  convert  the 
analog  output  of  position-sensing  devices  into  digital  format. 

The  recording  of  the  position  of  moveable  parts  of  a  machine 
requires  high  accuracy  and  noise  immunity  which  can  be 
attained  through  shaft  encoder  circuits  of  the  type  used  in  speed 
controllers  and  optical  position-sensing  devices. 


Principles  of  Shaft  Encoding 

Most  commonly  used  are  opto-electrical  encoders.  The  advan- 
tages of  this  type  of  circuit  are  direct  shaft-to-digital  encoding,  a 
minimum  number  of  power  supplies,  low  power  requirements, 
low  cost  and  high  speed. 


-ovCc 


CIRCUIT  WHICH  PRODUCES  MODULATED 
OUTPUT  SIGNALS  USING  LIGHT  SENSORS 
(PHOTO  DIODES) 


-  LIGHT 
-SENSORS 


Thus,  if  power  is  interrupted  an  incremental  encoder  must  have 
its  zero  position  reestablished. 

The  two  signals  from  the  light  sensors  have  to  be  amplified  and 
converted  into  digital  format  using  Schmitt  triggers. 


ANALOG- 


JT 

AMPLIFIER 

Optical  encoders  measure  shaft  rotation  by  detecting  the  light 
which  passes  through  a  rotating  code  disc  and  a  fixed  slit. 

It  consists  of  a  number  of  light  sources  and  sensors  whose  paths 
are  interrupted  by  a  disc  that  has  transparent  and  opaque  areas. 

By  using  various  concentric  patterns,  the  shaft  position  can  be 
defined.  Light  shines  through  the  disc  onto  sensors.  A  sensor 
turns  on  when  the  light  passes  through  its  corresponding  trans- 
parent part  of  the  disc  giving  an  output  that  depends  on  the 
opacity  of  the  segment.  The  combined  output  of  all  the  segments 
is  digital  information  representing  the  disc  and  the  shaft  position. 

—  An  absolute  encoder  has  a  number  of  concentric  tracks  on  the 
code  disc,  which  provide  parallel  readout  of  shaft  angle  with- 
out counting  pulses.  The  code  patterns  on  the  disc  are  a  kind 
of  storage. 

Thus,  the  readout  is  also  present  after  a  power  interruption. 

—  Incremental  encoders,  which  will  be  described  here,  have  a 
single  code  track  on  the  disc.  Angular  position  is  determined 
by  counting  pulses  produced  by  the  modulated  light  falling 
onto  the  photodetectors. 

Direction  sensing  is  obtained  by  the  use  of  quadrature  signals 
which  are  provided  by  appropriate  phasing  of  the  paths. 


The  two  signals  are  out  of  phase  by  a  quarter  of  a  period.  The 
direction  of  movement  can  be  determined  by  which  signal  leads 
or  lags  the  other. 

Out  of  this  phase  relationship  the  shaft  encoder  produces 
information  about  direction  and  speed  for  a  following  up/ 
down-counter. 

To  avoid  random  discrepancies  during  switching  operations 
shaft  encoders  should  be  built  using  synchronous  clocked 
circuitry. 

The  diagram  below  shows  a  typical  circuit  diagram  for  shaft 
encoding. 
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-DOWN 


2-558 


I  Monolithic 


Memories  I 


Shaft  Encoders 


The  phase  relationship  of  the  two  signals  PHIO  and  PHI90  is 
determined  by  the  four  registers,  and  their  outputs  are  encoded. 

The  underlying  principle  of  the  above  circuit  is  the  time  delay 
between  the  clock  pulses  for  the  two  signals  PHIO  and  PHI90. 

S1  =  PHIO  at  CLK  S2  =  PHIO  at  CLK  +  1 

S1  =  PHI90  at  CLK  S4  =  PHI90  at  CLK  +  1 

A  transition  in  PHIO  or  PHI90  from  L  to  H  or  vice  versa  causes  a 
change  in  the  data  stored  in  the  registers,  which  is  encoded  for 
direction  sensing. 

The  logic  equations  for  the  encoder  circuitry  are  given  below 
PHIO  leads  F 


PHIO 
PHI90 


i  PHI90: 

S1  *  S2 

•   S3  '  /S4 

+/S1  '  /S2 

•  /S3  *  S4 

+  S1  ■  /S2 

•  /S3  •  /S4 

+/S1  •  S2 

*   S3  •  S4 

PHI90: 

/S1  •  /S2 

■   S3  •  /S4 

+  S1  *  S2 

*  /S3  *  S4 

+  S1  *  /S2 

*   S3  *  S4 

+/S1  *  S2 

*  /S3  •  /S4 

 1  I  I  I  I  I 

I  I  I  I  I  I  I  I  I  I  I  I  I  I 

PHIO  LEADS  PHI90  BY  90° 

1             1             1            1             1  1 

DOWN 
=  HIGH 


J  L 


i  r 


TrijnjnjxnjiJ~Lnjnjn_nj- 


PHIO  LAGS  PHI90  BY  90° 


PHIO 
PHI90 
UP  =  HIGH 
DOWN 


j  1  r 


J  L 


1  I  L 


 mjijiJiJijiJijnji_rTJ-Lr 


When  the  shaft  rotates  in  a  clockwise  direction,  the  input  PHIO 
leads  the  input  PHI90  by  90°  and  the  logic  will  generate  pulses 
only  at  the  UP-output. 

On  the  other  hand,  when  the  shaft's  rotation  is  counterclockwise 


the  input  PHIO  lags  the  input  PHI90  by  90°.  In  this  case,  four 
pulses  per  square  wave  are  presented  at  the  DOWN-output.  Note 
that  both  the  UP  and  DOWN  outputs  of  the  counter  are  normally 
held  high. 

To  ensure  that  no  shaft  encoder  transition  is  missed,  the  clock 
frequency  should  be  at  least  8xNxS,  where  N  is  the  number  of 
pulses  produced  by  the  encoder  for  each  shaft  revolution  and  S 
is  the  maximum  speed  in  revolutions  per  second  to  be  expected. 
Most  commonly  CLK  frequencies  in  the  range  of  1  MHz  are  used 
for  optimum  circuit  operation. 

Synchronous  two-channel  shaft  encoders  as  described  above 
are  relatively  insensitive  to  encoder  phase  errors.  They  use  no 
temperamental  Mono-Flops  and  can  detect  any  illegal  transition 
states  generated  by  the  circuit. 

I  nterference  on  the  input  lines  is  ignored  if  it  occurs  between  two 
clock  cycles.  Random  noise  on  both  input  lines  results  in  one  UP 
count  for  each  DOWN  count  so  that  in  the  end  the  digital  infor- 
mation remains  unchanged.  Hence  synchronous  shaft  encoders 
are  extremely  useful  in  electrically  noisy  environments. 
The  PAL  Device  Applications  are  examples  of  such  circuits 
using  a  single  PAL  device  each. 
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PHI90- 

SET- 


SHAFT  ENCODER  #1 


PAL 
16R4 


74193 
UP/DOWN 
COUNTER 


Q0-O3 


SHAFT  ENCODER  #2 


PHIO- 
PHI90- 
X4- 
STf- 


PAL 
16R8 

COUNT 

74697 
UP/DOWN 
COUNTER 

U/D 

r 

Q0-Q3 


SHAFT  ENCODER  #3 
WITH  INTERNAL  4-BIT 
UP/DOWN  COUNTER 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 

CHIP  SE1  PAL16R4 


SHAFT  ENCODER  No.  1 
P7097 
1.00 

WILLY  VOLDAN 
MONOLITHIC  MEMORIES 
09/09/82 


CLK  PHIO  PHI90  X4  NC  NC  NC  NC  /SSET  GND 
/OC  DOWN  NC  S4   S3  S2  SI  NC  UP  VCC 


EQUATIONS 
/SI 

/S2 

/S3 


/S4 


/PHIO 
SSET 

/SI 
SSET 

/PHI90 
SSET 

/S3 
SSET 


; CHECK  FOR  PHIO 
; INITIALIZE  S1=L 

; CHECK  FOR  SI 
/INITIALIZE  S2=L 

; CHECK  FOR  PHI90 
; INITIALIZE  S3=L 

; CHECK  FOR  S3 
.•INITIALIZE  S4  =  L 


/DOWN  =     SI*  S2*  S3*/S4*  PHIO*  PHI90 
+  /Sl*/S2*/S3*  S4*/PHI0*/PHI90 
+     Sl*/S2*/S3*/S4*  PHI0*/PHI9O 
+  /SI*   S2*  S3*   S4*/PHI0*  PHI90 
DOWN . TRST  =  VCC 


PHIO  LEADS  PHI90 
PHIO  LEADS  PHI90 
PHIO  LEADS  PHI 90 
PHIO  LEADS  PHI90 


/UP       =  /Sl*/S2*  S3*/S4*/PHI0*  PHI90 
+     SI*   S2*/S3*   S4*  PHI0*/PHI90 
+     Sl*/S2*  S3*  S4*  PHIO*  PHI90 
+  /SI*  S2*/S3*/S4*/PHI0*/PHI90 
UP. TRST  =  VCC 


PHI90  LEADS  PHIO 
PHI90  LEADS  PHIO 
PHI 90  LEADS  PHIO 
PHI90   LEADS  PHIO 


SIMULATION 

SETF  OC  SSET 
CLOCKF  CLK 

CHECK  /S4   /S3   /S2   /SI   UP  DOWN 

SETF  /SSET  /PHIO  /PHI90 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  PHI90 
CLOCKF  CLK 
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CHECK  /S4  S3  /S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  S4   S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  S4   S3  /S2  SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  S4   S3  S2  SI  UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3   S2  SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  S2   SI  UP  DOWN 
SETF  /PHIO 


CLO 


CKF  CLK 


CHECK  /S4  /S3  S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /S4  S3  /S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  S4   S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  S4   S3  /S2   SI  /UP  DOWN 
CLOCKF  CLK 

CHECK   S4   S3   S2   SI   UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3   S2   SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3   S2  SI  UP  DOWN 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /S4  /S3  S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 
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SETF  PHI  9  0 
CLOCKF  CLK 

CHECK  /S4  S3  /S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  S4   S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  S4  S3  /S2  SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  S4   S3   S2  SI  UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3   S2  SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3   S2   SI  UP  DOWN 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /S4  /S3  S2  /SI  /UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  SSET 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  /SSET 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  S2  SI  UP  DOWN 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /S4  S3   S2  SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4  S3  S2  SI  UP  DOWN 
SETF  /PHIO 
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CLOCKF  CLK 

CHECK  S4   S3   S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4   S3  /S2  /SI  UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3  /S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  S2  SI  UP  DOWN 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /S4   S3  S2  SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4   S3   S2  SI  UP  DOWN 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  S4   S3   S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4   S3  /S2  /SI  UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3  /S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 

SETF  PHIO 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3   S2   SI  UP  DOWN 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /S4  S3   S2   SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4   S3  S2  SI  UP  DOWN 
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SETP  /PHIO 
CLOCKF  CLK 

CHECK  S4   S3   S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  S4  S3  /S2  /SI  UP  DOWN 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  S4  /S3  /S2  /SI  UP  /DOWN 
CLOCKF  CLK 

CHECK  /S4  /S3  /S2  /SI  UP  DOWN 
; DESCRIPTION : 

;THIS  PAL16R4   IMPLEMENTS  A  TWO  CHANNEL  SHAFT  ENCODER  OF  THE 
;  TYPE  USED  IN  SPEED  CONTROLLERS  AND  OPTICAL  DEVICES. 

•BOTH  THE   "UP"   AND   "DOWN"   OUTPUTS  OF  THE  PAL  DEVICE  ARE 
; NORMALLY  HIGH. 

; WHEN  THE   SIGNAL  AT  THE   "PHIO"    INPUT  LEADS  THE  SIGNAL  AT  THE 
;"PHI90"    INPUT,    THE   "DOWN"   OUTPUT  ALTERNATES  BETWEEN  HIGH  AND 
;LOW  LEVELS  AT  HALF  THE   "CLK"   FREQUENCY  RATE.      ALSO,    WHEN  THE 
,-SIGNAL  AT  THE   "PHIO"    INPUT  LAGS  THE  SIGNAL  AT  THE  "PHI90" 
; INPUT,   THE   "UP"  OUTPUT  ALTERNATES  BETWEEN  HIGH  AND  LOW 
; LEVELS  AT  HALF  THE   "CLK"   FREQUENCY  RATE. 

;  THE  SHAFT  ENCODER  FEATURES  THE  CONFIGURATION  AND  OUTPUT 
,-POLARITY  TO  DRIVE  AN  74S193  TYPE  UP/ DOWN  COUNTER. 

;THIS  DESIGN  WITH  GLITCH-FREE  OUTPUTS  WILL  BE  EXTREMELY 
; USEFUL   IN  ELECTRICALLY   NOISY  ENVIRONMENTS.      THE  PINOUT  IS 
; GIVEN  AS  A  FIRST  PROPOSAL  AND  CAN  BE  CHANGED  ACCORDING  TO 
; THE  PC  BOARD  LAYOUT. 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 

CHIP  SE2  PAL16R8 


SHAFT  ENCODER  No.  2 
P7098 
1.00 

WILLY  VOLDAN 
MONOLITHIC  MEMORIES 
09/09/82 


CLK  PHIO  PHI90  X4  NC  NC  NC  NC  /SSET  GND 
/OC  UD  NC  S4  S3  S2  SI  NC  COUNT  VCC 


EQUATIONS 


/SI 


/S3 


:=  /PHIO 
+  SSET 

:=  /PHI90 
+  SSET 


; CHECK  FOR  PHIO 
; INITIALIZE  S1=L 

; CHECK  FOR  PHI90 
; INITIALIZE  S3=L 


/S2 


/S4 


:=  SI 

+  SSET 

:=  S3 

+  SSET 


; CHECK  FOR  /SI 
; INITIALIZE  S2=L 

; CHECK  FOR  /S3 
; INITIALIZE  S4=L 


/COUNT   :=     SI*   S2*/S3*  S4 
+  /Sl*/S2*  S3*/S4 
+  /SI*  S2*/S3*/S4*  X4 
+     Sl*/S2*   S3*   S4*  X4 
+     SI*   S2*  S3*/S4 
+  /Sl*/S2*/S3*  S4 
+  /SI*   S2*  S3*  S4*  X4 
+     Sl*/S2*/S3*/S4*  X4 

/UD  :=  /SI*   S2*/S3*  S4 

+  /SI*  S2*  S3*  S4 
+  /SI*  S2*  S3*/S4 
+  SI*  S2*  S3*/S4 
+  Sl*/S2*  S3*/S4 
+  Sl*/S2*/S3*/S4 
+  Sl*/S2*/S3*  S4 
+  /Sl*/S2*/S3*  S4 


SIMULATION 

SETF  OC  SSET 
CLOCKF  CLK 

CHECK  /SI   /S2  /S3  /S4 

SETF  /SSET  /PHIO  /PHI90  /X4 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  S4  COUNT  UD 


;THIS  OUTPUT  ALTERNATES 
; BETWEEN  HIGH  AND  LOW  WITH 
; HALF  OR  QUARTER  THE 
;CLK  FREQUENCY 


THIS  OUTPUT  DETERMINES 
IF  SIGNAL  PHIO  LEADS 
OR  LAGS  SIGNAL  PHI90 
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:  PHIO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  / COUNT  UD 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  /S2  S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  S3  /S4  COUNT  UD 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  /S4  COUNT  /UD 
CLOCKF  CLK 


CLOCKF  CLK 

CHECK  /cl  S2  /S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI   S2  /S3   S4  COUNT  UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2  /S3   S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  /COUNT  UD 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  /S2  S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2   S3  /S4  COUNT  UD 


SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI  S2   S3  /S4  /COUNT  UD 

SETF  /PHI90  X4 
CLOCKF  CLK 

CHECK  /SI  S2  /S3  /S4  COUNT  /UD 


/COUNT  UD 


SETF  /PHI90 
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CLOCKF  CLK 

CHECK  /SI   S2  /S3   S4  /COUNT  UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2  /S3   S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  /COUNT  UD 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  /S2   S3   S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  S3  /S4  / COUNT  UD 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI   /S2  S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI   S2   S3  /S4  /COUNT  UD 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  S4  /COUNT  UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  /COUNT  UD 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  /S2  S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  S3  /S4  /COUNT  UD 

SETF  SSET 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  COUNT  /UD 

SETF  /SSET  /PHIO  /PHI90  /X4 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  S4  COUNT  UD 
SETF  PHI90 
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CLOCKF  CLK 

CHECK  /SI   S2   S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI  S2   S3  /S4  COUNT  /UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2   S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2   S3  /S4  /COUNT  /UD 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  COUNT  /UD 

SETF  /PHI 0 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  S4  /COUNT  /UD 

SETF  PHI90 
CLOCKF  CLK 

CHECK   /SI   S2   S3   S4   COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI   S2   S3  /S4  COUNT  /UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2   S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2   S3  /S4  /COUNT  /UD 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  COUNT  /UD 

SETF  /PHIO  X4 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK   /SI   S2   /S3   S4   /COUNT  /UD 
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SETF  PHI90 
CLOCKF  CLK 

CHECK  /SI   S2   S3   S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI  S2  S3  /S4  /COUNT  /UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2   S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2   S3  /S4  /COUNT  /UD 

SETF  /PHI  90 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  /S3   S4  /COUNT  /UD 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI  S2  /S3   S4  /COUNT  /UD 

SETF  PHI 90 
CLOCKF  CLK 

CHECK  /SI  S2  S3  S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  /SI  S2  S3  /S4  /COUNT  /UD 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI   S2  S3  /S4  COUNT  /UD 
CLOCKF  CLK 

CHECK  SI  /S2  S3  /S4  /COUNT  /UD 
; DESCRIPTION 

;THIS  PAL16R8   IMPLEMENTS  A  TWO  CHANNEL  SHAFT  ENCODER  OF  THE 
; TYPE  USED  IN  SPEED  CONTROLLERS  AND  OPTICAL  DEVICES. 

; THE  "COUNT"  OUTPUT  OF  THE  PAL  DEVICE  IS  NORMALLY  HIGH.  DURING 
; SHAFT  ENCODING  THIS  OUTPUT  ALTERNATES  BETWEEN  HIGH  AND  LOW. 

; INPUT  "X4"   SELECTS  BETWEEN  HALF   (X4=H)   OR  QUARTER   (X4=L)  CLK 
; FREQUENCY  OF  THE  "COUNTER"  OUTPUT. 
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OUTPUT   "UD"    DETERMINES  WHETHER  SIGNAL  PHIO   LEADS    (UD=H)  OR 
LAGS    (UD=L)    SIGNAL  PHI90. 

THE  SHAFT  ENCODER  FEATURES  THE  CONFIGURATION  AND  OUTPUT 
POLARITY  TO  DRIVE  A  74S697  TYPE  UP/DOWN  COUNTER. 

THIS  DESIGN  WITH  GLITCH-FREE  OUTPUTS  WILL  BE  EXTREMELY  USEFUL 
IN  ELECTRICALLY  NOISY  ENVIRONMENTS.     THE  PINNING  IS  GIVEN  AS 
A  FIRST  PROPOSAL  AND  CAN  BE  CHANGED  ACCORDING  TO  THE  PC-BOARD 
LAYOUT. 


Shaft  Encoders 


TITLE   SHAFT  ENCODER  No 
P7099 
1.00 

WILLY  VOLDAN 
MONOLITHIC  MEMORIES 
09/09/82 


PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 

CHIP  SE3  PAL20X10 


3    (WITH   INTERNAL   4-BIT  UP/DOWN  COUNTER) 


CLK  PHIO  PHI90  X4  /LD  NC  D3  D2  Dl  DO  /SSET 
GND  /OC  DOWN  S4   S3  S2  SI  Q3  Q2  Ql  QC  UP  VCC 

EQUATIONS 

/SI    :=  /PHIO 
+  SSET 


/S2 


/S3 


=  /SI 

+  SSET 

=  /PHI90 

+  SSET 


/S4    :=  /S3 

+  SSET 


/DOWN 


/UP 


/Q0 


/Ql 


/Q2 


/Q3 


:=  SI*  S2*  S3*/S4*  PHIO*  PHI90*  X4 
+     /Sl*/S2*/S3*   S4*/PHI0*/PHI90*  X4 

:+:  Sl*/S2*/S3*/S4*  PHI0*/PHI90 
+     /SI*  S2*   S3*   S4*/PHI0*  PHI90 

:=  /Sl*/S2*  S3*/S4*/PHI0*  PHI90 
.+       SI*   S2*/S3*   S4*  PHI0*/PHI90 

:+:  Sl*/S2*  S3*  S4*  PHIO*  PHI90*  X4 
+     /SI*   S2*/S3*/S4*/PHI0*/PHI90*  X4 

:=     /SSET*  LD*/D0 

+  /SSET*/LD*/Q0 
:+:   /SSET*/LD*  UP*/DOWN 

+     /SSET*/LD*/UP*  DOWN 

:=     /SSET*  LD*/D1 

+  /SSET*/LD*/Q1 
:+:    /SSET*/LD*  UP*/DOWN*/Q0 

+     /SSET*/LD*/UP*  DOWN*  Q0 

:=     /SSET*  LD*/D2 

+  /SSET*/LD*/Q2 
:+:    /SSET*/LD*  UP*/DOWN*/Q0 */Ql 

+     /SSET*/LD*/UP*  DOWN*  Q0*  Ql 

:=     /SSET*  LD*/D3 
+  /SSET*/LD*/Q3 
:+:    /SSET*/LD*  UP*/DOWN*/QO*/Q1*/Q2 


; CHECK  FOR  PHIO 
; INITIALIZE  S1=L 

; CHECK  FOR  SI 

; INITIALIZE  S2=L 

; CHECK  FO^  PHI  90 
; INITIALIZE  S3=L 

; CHECK  FOR  S3 

; INITIALIZE  S4=L 

PHIO  LEADS  PHI90 :COUNT=F/2 

PHIO  LEADS  PHI90 :COUNT=F/2 

PHIO  LEADS  PHI90  :COrJNT=F/4 

PHIO  LEADS  PHI90:COUNT=F/4 

PHI90  LEADS  PHI 0 : COUNT=F/4 

PHI90  LEADS  PHIO : COUNT=F/4 

PHI  90  LEADS  PHI0:COUNT=F/2 

PHI90  LEADS  PHIO :COUNT=F/2 

LOAD  DO  (LSB) 
HOLD  Q0 
DECREMENT 
INCREMENT 

LOAD  Dl 
HOLD  Ql 
DECREMENT 
INCREMENT 

; LOAD  D2 
; HOLD  Q2 
; DECREMENT 
; INCREMENT 

; LOAD  D3  (MSB) 
; HOLD  Q3 
; DECREMENT 
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+     /SSET*/LD*/UP*  DOWN*  QO*  01*  02        ; INCREMENT 


SIMULATION 

SETF  OC  SSET  /PHIO  /PHI90 
CLOCKF  CLK 

CHECK  /Si  /S2  /S3  /S4  Q3  Q2  Ql  QO 

SETF  /SSET  LD  D3  /D2  Dl  /DO 

CLOCKF  CLK 

CHECK  Q3  /Q2  Ql  /QO 

SETF  /LD  /X4 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  UP  DOWN  Q3  /Q2  Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  UP  DOWN  Q3  /Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  /S4  UP  /DOWN  Q3  /Q2  Ql  /QO 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  S2  S3  /S4  UP  DOWN  03  /Q2  /Ql  QO 


CLOCKF  CLK 

CHECK  SI   S2   S3   S4  UP  DOWN  Q3  /Q2  /Ql  QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI  S2  S3  S4  UP  DOWN  Q3  /Q2  /Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  S4  UP  /DOWN  Q3  /Q2  /Ql  QO 

SETF  /PHI 90 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3   S4  UP  DOWN  Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  UP  DOWN  Q3  /Q2  /Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  UP  DOWN  Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI  S2  /S3  /S4  UP  /DOWN  Q3  /Q2  /Ql  /QO 

SETF  PHI90 
CLOCKF  CLK 


PAL20X10 
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CHECK  SI   S2   S3  /S4  UP  DOWN  /Q3  Q2  Ql  QO 
CLOCKF  CLK 

CHECK  SI  S2   S3   S4  UP  DOWN  /Q3  Q2  Ql  QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI   S2   S3   S4  UP  DOWN  /Q3  Q2  Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  S4  UP  /DOWN  /Q3  Q2  Ql  QO 

SETF  X4  /PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  S4  UP  DOWN  /Q3  Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  UP  /DOWN  /Q3  Q2  Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  /S3  /S4  UP  DOWN  /Q3  Q2  /Ql  QO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  /S4  UP  /DOWN  /Q3  Q2  /Ql  QO 

SETF  PHI90 
CLOCKF  CLK 

CHECK  SI   S2   S3  /S4  UP  DOWN  /Q3  Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK   SI   S2   S3   S4   UP  /DOWN  /Q3  Q2  /Ql  /QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK   /SI   S2   S3   S4   UP  DOWN  /Q3   /Q2  Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2   S3  S4  UP  /DOWN  /Q3  /Q2  Ql  QO 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3   S4  UP  DOWN  /Q3  /Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  UP  /DOWN  /Q3  /Q2  Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK   SI   /S2   /S3   /S4   UP  DOWN  /Q3   /Q2   /Ql  QO 
CLOCKF  CLK 

CHECK  SI  S2  /S3  /S4  UP  /DOWN  /Q3  /Q2  /Ql  QO 
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SETF  PHI90 
CLOCKF  CLK 

CHECK  SI  S2   S3  /S4  UP  DOWN  /Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2   S3   S4  UP  /DOWN  /Q3  /Q2  /Ql  /QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI   S2   S3   S4  UP  DOWN  Q3  Q2  Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2   S3   S4  UP  /DOWN  Q3  Q2  Ql  QO 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3   S4  UP  DOWN  Q3  Q2  Ql  /QO 

SETF  SSET  /PHIO  /PHI90 
CLOCKF  CLK 

SETF  /SSET  LD  /D3  D2  /Dl  DO 

CLOCKF  CLK 

CHECK   /Q3  Q2   /Ql  QO 

SETF  /LD  /X4 
CLOCKF  CLK 

CHECK  /SI   /S2  /S3  /S4  UP  DOWN  /Q3  Q2  /Ql  QO 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2   S3  /S4  UP  DOWN  /Q3  Q2  /Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2   S3   S4  /UP  DOWN  /Q3  Q2  /Ql  QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK   SI  /S2  S3   S4  UP  DOWN  /Q3  Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  SI  S2   S3   S4  UP  DOWN  /Q3  Q2  Ql  /QO 

SETF  /PHI  9  0 
CLOCKF  CLK 

CHECK  SI   S2  /S3   S4  UP  DOWN  /Q3  Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  /S4  /UP  DOWN  /Q3  Q2  Ql  /QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  /S4  UP  DOWN  /Q3  Q2  Ql  QO 
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CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  UP  DOWN  /Q3  Q2  Ql  QO 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2   S3  /S4  UP  DOWN  /Q3  Q2  Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2   S3  S4  /UP  DOWN  /Q3  Q2  Ql  QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  S3  S4  UP  DOWN  Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2   S3   S4  UP  DOWN  Q3  /Q2  /Ql  /QO 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  SI  S2  /S3   S4  UP  DOWN  Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  /S4  /UP  DOWN  Q3  /Q2  /Ql  /QO 

SETF  X4  /PHIO 
CLOCKF  CLK 

CHECK  /SI   S2  /S3  /S4  UP  DOWN  Q3  /Q2  /Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  /UP  DOWN  Q3  /Q2  /Ql  QO 

SETF  PHI90 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  /S4  UP  DOWN  Q3  /Q2  Ql  /QO 
CLOCKF  CLK 

CHECK  /SI   /S2   S3   S4   /UP  DOWN  Q3   /Q2  Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  S3  S4  UP  DOWN  Q3  /Q2  Ql  QO 
CLOCKF  CLK 

CHECK  SI   S2  S3  S4  /UP  DOWN  Q3  /Q2  Ql  QO 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  SI   S2  /S3  S4  UP  DOWN  Q3  Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI  S2  /S3  /S4  /UP  DOWN  Q3  Q2  /Ql  /QO 

SETF  /PHIO 
CLOCKF  CLK 
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CHECK  /SI   S2  /S3  /S4  UP  DOWN  Q3  Q2  /Ql  QO 
CLOCKF  CLK 

CHECK  /SI  /S2  /S3  /S4  /UP  DOWN  Q3  Q2  /Ql  QO 

SETF  PHI 90 
CLOCKF  CLK 

CHECK  /SI  /S2  S3  /S4  UP  DOWN  Q3  Q2  Ql  /QO 
CLOCKF  CLK 

CHECK   /SI   /S2   S3   S4   /UP  DOWN  Q3  Q2  Ql  /QO 

SETF  PHIO 
CLOCKF  CLK 

CHECK  SI  /S2  S3   S4  UP  DOWN  Q3  Q2  Ql  QO 
CLOCKF  CLK 

CHECK  SI   S2  S3  S4  /UP  DOWN  Q3  Q2  Ql  QO 

SETF  /PHI90 
CLOCKF  CLK 

CHECK  SI   S2  /S3   S4  UP  DOWN  /Q3  /Q2  /Ql  /QO 
CLOCKF  CLK 

CHECK  SI   S2  /S3  /S4  /UP  DOWN  /Q3  /Q2  /Ql  /QO 

SETF  /PHIO 
CLOCKF  CLK 

CHECK   /SI   S2   /S3   /S4   UP  DOWN  /Q3   /Q2   /Ql  QO 

SETF  LD  /D3  /D2  /Dl  DO 

CLOCKF  CLK 

CHECK  /Q3  /Q2  /Ql  QO 

SETF  /D3   /D2   Dl  DO 

CLOCKF  CLK 

CHECK  /Q3  /Q2  Ql  QO 

SETF  /D3  D2  /Dl  DO 

CLOCKF  CLK 

CHECK  /Q3  Q2  /Ql  QO 

SETF  /D3  D2  Dl  DO 

CLOCKF  CLK 

CHECK  /Q3  Q2  Ql  QO 

SETF  D3  /D2  /Dl  DO 

CLOCKF  CLK 

CHECK  Q3  /Q2  /Ql  QO 

SETF  D3  /D2  Dl  DO 

CLOCKF  CLK 

CHECK  Q3  /Q2  Ql  QO 
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SETF  D3   D2   /Dl  DO 

CLOCKF  CLK 

CHECK  Q3  Q2  /Ql  QO 

SETF  D3  D2  Dl  DO 

CLOCKF  CLK 

CHECK  Q3  Q2  Ql  QO 

DESCRIPTION 

THIS   PAL20X10    IMPLEMENTS  A  TWO  CHANNEL  SHAFT  ENCODER  WITH  AN 
INTERNAL  4-BIT  UP/DOWN  COUNTER. 

BOTH  THE  "UP"  AND  "DOWN"  OUTPUTS  OF  THE  PAL  DEVICE  ARE  NORMALLY 
HIGH. 

WHEN  THE  SIGNAL  AT  THE   "PHIO"    INPUT  LEADS  THE  SIGNAL  AT  THE 
"PHI90"    INPUT,    THE   "DOWN"   OUTPUT  ALTERNATES  BETWEEN  HIGH  AND 
LOW  LEVELS  AND  THE  COUNTER  WILL  COUNT  DOWN.     WHEN  THE  SIGNAL 
AT  THE   "PHIO"    INPUT  LEADS  THE  SIGNAL  AT  THE   "PHI90"  INPUT, 
THE   "UP"   OUTPUT  ALTERNATES  BETWEEN  HIGH  AND  LOW  LEVELS  AND 
THE  COUNTER  WILL  COUNT  UP. 

INPUT   "X4"   SELECTS  BETWEEN  HALF   (X4=H)   OR  QUARTER    (X4=L)  CLK 
FREQUENCY  OF  THE  COUNTER  OUTPUTS. 

THE   INTERNAL   4-BIT  SYNCHRONOUS  COUNTER  HAS  COUNT  UP,    COUNT  DOWN 
CAPABILITIES.      ALSO,    THE  COUNTER  CAN  PARALLEL  LOAD  AND  HOLD  DATA 
INDEPENDENTLY  OF  THE  SHAFT  ENCODER  SECTION.      THE  REGISTERS  ARE 
SYNCHRONOUSLY   INITIALIZED  WHEN  /SSET   IS  HELD  LOW. 

THE  CONTROL  INPUTS  PROVIDE  THESE  OPERATIONS  WHICH  OCCUR 
SYNCHRONOUSLY  AT  THE  RISING  EDGE  OF  THE  CLOCK. 


£1  Monolithic  RHIl  Memories  £1  2-577 


Notes 


Military  Applications 


Monolithic  Memories/Advanced  Micro  Devices  is  a  major  partici- 
pant in  the  Command,  Communication  and  Control  (C3)  military 
marketplace.  Applications  include  programmable  obsolescence 
solutions,  surface  mount  designs,  radar  systems,  missile  guid- 
ance, aircraft  graphic  processors,  microprocessor  support  logic, 
state-machine  designs,  military  computer  hardware,  redundant 
solutionsfor  backup  systems,  and  reconfigurable  hardware  using 
Logic  Cell  Arrays. 

Programmable  products  offer  an  excellent  solution  to  military  ob- 
solescence difficulties.  The  dynamics  of  the  semiconductor  in- 
dustry has  shortened  product  life  cycles,  creating  a  sunset  tech- 
nology market.  Programmable  products  offer  a  quick,  low-cost 
replacement  for  obsolete  parts,  through  direct  pin  compatibility  or 
board  emulation. 

MMI/AMD  has  long  been  a  leader  in  military  bipolar  PAL,  PROM 
and  FIFO  products.  We  will  soon  be  introducing  military  zero- 
power  CMOS  PAL  devices  and  military  CMOS  Logic  Cell  Arrays. 
With  pin  and  function  compatible  PAL  products  offered  in  both 
bipolar  and  CMOS  technologies,  Monolithic  Memories  offers 


backup  systems  (redundant  solutions)  in  case  of  afailure  with  the 
original  design.  So,  whether  the  customer  needs  high-speed 
bipolar  or  low-power  CMOS,  we  will  be  the  only  company  offering 
both  technologies  for  the  military  marketplace. 

Surface  mount  technology  is  an  assembly  technique  when  circuit 
boards  are  assembled  with  components  bonded  to  metal  pads  on 
the  board  surface,  instead  of  being  inserted  into  through-holes 
like  conventional  Dual  In-line  packages  (DIP).  Surface  mount 
components  can  be  mounted  on  both  sides  of  a  circuit  board, 
doubling  the  functional  density.  This  lowers  cost  by  reducing  the 
number  or  size  of  boards  required.  The  military  need  for  smaller 
or  denser-packed  systems  make  surface  mounting  a  must.  We 
meet  this  challenge  offering  afull  line  of  products  in  surface  mount 
packaging,  including  Cerpack  (W)  and  Leadless  Chip  Carriers 
(LCC). 

For  more  information  on  military  products  and  applications,  see 
the  military  datasheet  section  (page  5-415). 
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AmPAL22V10 


22  Inputs 
10  Outputs 
8-16PT/Output 
Async.  Reset/Sync. 

Preset 
Register  Bypass 
Polarity 
Preload 
Three-State 
Security  Fuse 


PAL32VX10 


32  Inputs 
10  Outputs 
8-16  PT/Output 
Async.  Reset/Sync. 

Preset 
Sync.  Reset/ Async. 

Preset 
Dual  Feedback 
Registered  Inputs 
Register  Bypass 
J-K,  S-R,  DorT 

Rip  Flops 
Polarity 
Preload 
Three-State 
Security  Fuse 


M2064 


58  Configurable  l/Os 
64  Configurable  Logic 

Blocks  (CLBs) 
1-16  PT/CLB  Output 
1200  Gate  Equivalent 
TTL  or  CMOS  Input 

Levels 
Registered/Direct 

Inputs 
122  Registers 
On  Chip  Clock  Gen. 
Async.  Set/Res.  Inputs 
Three -State 
Security  Features 


M2018 

74  Configurable  l/Os 
1 00  Configurable  Logic 

Blocks  (CLBs) 
1-16  PT/CLB  Output 
1 800  Gate  Equivalent 
TTL  or  CMOS  Input 

Levels 
Registered/Direct 

Inputs 
1 74  Registers 
On  Chip  Clock  Gen. 
Asynch.  Set/Res.  Inputs 
Three-State 
Security  Features 


FUNCTIONALITY 


PAL16L8 
PAL16R8 
PAL16R6 
PAL16R4 


16  Inputs 
8  Outputs 
8  Product  Terms 
per  Output 
Three-State 
Security  Fuse 


PAL20L10 
PAL20X10 
PAL20X8 
PAL20X4 


20  Inputs 
10  Outputs 
4  PT/Output 
XOR 
Preload 
Three -State 
Security  Fuse 


PAL20L8 
PAL20R8 
PAL20R6 
PAL20R4 


20  Inputs 
8  Outputs 
8  PT/Output 
Three-State 
Security  Fuse 


PAL20RA10 


20  Inputs 
10  Outputs 
4  PT/Output 
Async.  Clock 
Async.  Set/Reset 
Register  Bypass 
Polarity 
Preload 
Three-State 
Security  Fuse 


PAL20S10 
PAL20RS10 
PAL20RS8 
PAL20RS4 


20  Inputs 
10  Outputs 
1-16  PT/Output 
PT  Sharing 
Polarity 
Preload 
Three-State 
Security  Fuse 


Note:  PT  -  Product  Term 


Figure  1.  Military  Configurable  Architectures 


Radiation  Hardness 


Spurred  on  by  America's  Strategic  Defense  Initiative,  military 
equipment  manufacturers  are  looking  for  radiation  hardened 
parts  to  put  into  strategic  weapon  and  space  systems.  It  has  been 
stated  that  some  level  of  radiation  tolerance  will  be  required  in  up 
to  50%  of  all  military  applications  by  1990.  Due  to  this  increased 
need  for  radiation-resistant  integrated  circuits,  MMI  and  AMD 
have  embarked  on  a  program  to  determine  what  radiation  dose 
rates  its  products  will  withstand,  and  to  provide  increasingly 
radiation-tolerant  products. 

The  Galileo  probe  to  Jupiter  was  bombarded  with  100,000  rads 
(Si)  within  a  one  hour  period.  In  contrast,  the  earth's  surface 
absorbs  0.1  rads  in  a  year.  Military  aircraft  must  withstand  a 
minimum  of  2,500  rads  total  dose.  A  nuclear  explosion 
emits  transient-  ionizing  radiation  of  at  least  109  rads  (Si)  per 
second.  Radiation  dosage  and  type  varies  greatly  with  the  envi- 
ronment. Table  1  shows  radiation  dosages  and  semiconductor 
tolerances. 

Radiation  effects  can  be  divided  into  two  major  categories; 
natural  space  effects  and  man-made  nuclear  effects. 

Space  radiation  consists  mainly  of  cosmic  rays  and  charged 
particles  in  radiation  belts  (Figure  1).  Cosmic  rays  from  the  sun, 
made  up  of  protons  and  nuclei,  cause  single  event  upsets  (SEU). 
A  SEU  occurs  when  a  high-energy  particle  passes  through  an  in- 
tegrated circuit,  changing  an  internal  logic  state. 

REFERENCES 


MAX.  SPACE  REQUIREMENT  - 

JUPITER  SATELLITE  PROBE  - 

STRATEGIC  WEAPONS  REQUIREMENT  - 
AIRCRAFT  REQUIREMENT  - 
90%  FATAL  HUMAN  EXPOSURE  - 

RADIATION  SICKNESS  - 

CANCER  TREATMENT  - 
MEDICAL  X-RAYS  - 

AIRLINE  TRAVEL  - 
EARTH  SURFACE  - 
POWER  PLANT  RELEASE  - 


Radiation  belts  around  planets,  such  as  the  Van  Allen  Belt  circling 
the  earth,  carry  trapped  protons  and  electrons.  These  charged 
particles  accumulate  overtime  in  CMOS  devices,  causing  thresh- 
old voltage  shifts.  In  oxide-isolated  bipolar  circuits  this  ionizing 
radiation  (total  dose)  will  cause  an  increase  in  leakage  current. 

A  nuclear  explosion  irradiates  a  large  neutron  fluence  and  high 
energy  gamma  rays  (Figure  2).  Neutron  radiation  damages  the 
semiconductor  lattice  structure,  creating  recombination  sites  for 
minority  carriers,  which  decreases  bipolar  transistor  current  gain. 
Neutrons  induce  no  change  to  CMOS  devices,  which  are 
majority-carrier  structures. 

Nanoseconds  after  a  nuclear  blast,  short,  high-intensity  electro- 
magnetic pulses  lash  out.  Difficult  to  stop,  these  gamma  rays 
generate  huge  numbers  of  holes  and  electrons,  causing  large 
currents,  destroying  both  bipolar  and  CMOS  circuits.  CMOS  is 
especially  susceptible  due  to  its  inherently  low  latch-up  sensitivity 
of  10a  rads  per  second. 

Emerging  technologies  and  process  improvements  will  make  the 
basic  military  hardness  requirements  for  semiconductor  devices 
a  reality  (Table  2).  Gallium  Arsenide  (GaAs)  products  now  meet 
a  no-upset  dose  rate  of  5  X  1 0"  rads  per  second  and  a  total  dose 
of  10"  rads.  While  junction-isolated  bipolar  devices  also  meet  in- 
herently high  dose  rates  of  up  to  1 0'°  rads/second  and  total  doses 
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GALLIUM  ARSENIDE 

10* 

JUNCTION  ISOLATED  BIPOLAR 
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OXIDE  ISOLATED  CMOS 

OXIDE  ISOLATED  BIPOLAR 

10« 

SILICON  ON  SAPPHIRE  (SOS) 
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Table  1.  Radiation  Perspectives 
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Natural 

Cosmic  Rays 

•  Protons  and  Nuclei 

•  Causes  Single  Event  Upsets  (SEUs) 

Charged  Particles 

•  Electrons  and  Protons  Found  in  Radiation  Belts 

•  Accumulated  Ionizing  Radiation  Over  Time  (Total  Dose) 
Measured  in  rads  (Si) 

•  Causes  Accumulation  of  Trapped  Charges  -»  Shifting 
Threshold  Voltages  in  CMOS 

•  Increases  Leakage  Currents  in  Bipolar 
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Figure  1.  Space  Effects 


Nuclear  Effects 
Man-Made 

Neutrons 

•  Neutron  Fluence  Measured  in  Neutrons  Per  Square 
Centimeter 

•  Damages  Lattice  Structure  -»  Decreases  Gain 

•  Affects  Bipolar  More  Than  CMOS 

Gamma  Rays 

•  Transient-Ionizing  Radiation  (Dose  Rate)  Measured  in 
rads  (Si)  Per  Second 

•  High  Energy  Photons  of  10s  rads  Per  Second 

•  Causes  Large  Photocurrents  ->  Increases  Supply  Current 

•  Results  in  Latch-Up  and  Permanent  Damage  from  Burn-Out 

•  Difficult  to  Shield  Against 

•  Ruins  CMOS 


438  04 


Figure  2.  Nuclear  Effects 
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of  up  to  1 07  rads,  radiation-hardened  CMOS  circuits  are  not  far 
behind  with  a  typical  dose  rate  of  1 0'°  rads/second  and  a  total 
dose  of  10s  rads  (Table  3). 

Process  changes  to  improve  the  radiation  tolerance  of  CMOS 
devices  are  not  without  penalties.  Guard  rings  reduce  latch-up 
sensitivity  and  leakage  paths,  but  decrease  circuit  density.  Thin- 
ner gate  oxides  increase  radiation  resistance  but  cause  reliabil- 
ity problems  due  to  hot  electron  effects.  A  dry  oxide  process 
produces  less  defects  but  increases  manufacturing  time.  Done 
correctly,  these  sacrifices  will  produce  a  marketable 
radiation-hardened  CMOS  process. 


At  Monolithic  Memories,  radiation  data  has  been  obtained  for 
neutron  fluence  and  dose  rate  effects.  Products  representing 
bipolar  junction  and  oxide-isolated  processes  passed  neutron 
irradiation  levels  of  up  to  10"  neutrons  per  square  centimeter 
(Table  4).  Dose  rate  data  obtained  on  the  junction-isolated 
products  showed  recovery  in  fifty  to  seventy  microseconds  from 
a  one  microsecond  pulse  of  2  X  1 0"  rads  (Si)  per  second  (Table 
5).  MMI  and  AMD  will  move  from  radiation  testing  towards  new 
radiation-tolerant  designs  and  processes  as  the  need  for  radia- 
tion-hardened products  increases. 

Space  Effects 
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PRODUCT 

DESCRIPTION 

PROCESS 

PASSED 
DATASHEET 
LIMITS  UP  TO 

(neutrons/cm2) 

PASSED 
FUNCTIONAL  UP  TO 

(neutrons/cm2) 

PAL16R4AM 

20  Pin,  30  ns 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

4.5X10" 

1  x  10" 

PAL16R4BM 

20  Pin,  20  ns 
PAL  Device 

Shallow-Junction  Isolated 
Implanted  Bipolar 

4.5  x  10" 

1  x10" 

PAL16R4DM 

90.  Pin   1  ^  nc 
£U  rill,  IO  lla 

PAL  Device 

Bipolar 

4.5  X  1013 

1  X  10" 

PAL20R4AM 

24  Pin,  30  ns 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

8x  10" 

1  x  10" 

PAL20RA10M 

24  Pin, 
Asynchronous 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

9.6  X  10" 

1  x  10" 

PAL32R16M 

1500  Gate,  40  Pin 
MegaPAL™  Device 

Junction  Isolated 
Diffused  Bipolar 

1  X  10" 

1  x  10" 

53S1681 

16KPROM 

Junction  Isolated 
Diffused  Bipolar 

4.5  X  10" 

4x  10" 

53RA1681A 

16K  Registered 
PROM 

Junction  Isolated 
Diffused  Bipolar 

8x10" 

1  x  10" 

53S3281 

32K  PROM 

Junction  Isolated 
Diffused  Bipolar 

1  X  10" 

1  x  10" 

Table  4.  Neutron  Bombardment  Data 
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PRODUCT 

DESCRIPTION 

PROCESS 

DOSE  RATE 

(rads/second) 

PHOTOCURRENT 

(amps) 

PAL12H6M 

20  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.6  x  10'° 

4.2 

PAL14H4M 

20  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.1  x  10" 

3.9 

PAL10L8M 

20  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.5  x  1 0" 

4.2 

PAL12L6M 

20  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.2  X  1 0" 

3.8 

PAL14L8M 

24  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

1.2  x  10" 

3.6 

PAL16L6M 

20  Pin  Small 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.3x10" 

4.0 

PAL16L8M 

20  Pin  Standard 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.5  x  10" 

5.3 

PAL16R8M 

20  Pin  Standard 
PAL  Device 

Junction  Isolated 

1.9  x  10" 

5.3 

PAL16R6M 

20  Pin  Standard 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.4  x  10" 

5.7 

PAL16R4M 

20  Pin  Standard 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.1  x  10" 

5.6 

PAL20L10M 

24  Pin  Exclusive-OR 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.1  X  10" 

6.7 

PAL20X10M 

24  Pin  Exclusive-OR 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

1.9  x  10" 

5.9 

PAL20X8M 

24  Pin  Exclusive-OR 
PAL  Device 

Junction  Isolated 
Diffused  Bipolar 

2.1  X  10" 

5.8 

Table  5.  Transient  Upset  Test  Data 
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 DESIGN  APPLICATIONS  

Programmable  Logic  Device 
Preserves  Pins,  Product  Terms 

Chris  Jay 

Monolithic  Memories  Inc.,  2175  Mission  College  Blvd..  Santa  Clara,  CA  95054;  (408)  970-9700. 


Programmable-logic  devices  continue  to  evolve,  re- 
sponding to  designers'  demands  for  improved  per- 
formance, functionality,  and  flexibility.  Among  the 
newest  is  the  PAL32VX10,  a  programmable-array 
logic  (PAL)  device  that  enhances  the  architecture 
of  the  earlier  PAL22V 1 0.  The  result  is  a  more  flexi- 
ble general-purpose  device. 

As  a  functional  and  architectural  superset  of  the 
22V10,  the  32VX10  can  be  programmed  to  carry 
out  any  function  that  its  predecessor  can.  It  also 
adds  two  features:  regis- 


Buried  registers  and 
exclusive-OR  gates 
make  this  PAL  device 
a  sound  foundation 
on  which  to  build 
state  machines  that 
conserve  I/O  pins. 


ters  that  can  be  buried 
for  designing  state  ma- 
chines, and  exclusive- 
OR  (XOR)  gates  for 
counter  and  JK  func- 
tions (Electronic 
Design,  Jan.  8,  p.  45). 

The  number  of  inputs 
to  the  fuse  array  of  the 
latest  programmable- 
array  logic  (PAL)  de- 
vice has  been  increased  to  32  by  the  addition  of  ten 
buried  feedback  paths.  A  macrocell  output  pro- 
grammed as  an  input  accesses  the  chip's  feedback 
path,  which  is  buried  within  the  fuse  array. 

As  a  result,  buried  state  machines  can  be  config- 
ured in  the  device.  Any  logic  design  calling  for 
many  input  pins  can  be  configured  with  dedicated 
input  pins  and  any  of  the  of  macrocell  I/O  pins 
(Fig.  1 ).  This  option  leaves  pins  free  for  configuring 
outputs  as  inputs,  and  other  Boolean  functions  may 
be  realized  in  buried  registers.  Thus,  functionality 
of  the  macrocell  need  not  be  lost. 

The  macrocell's  XOR  gate  (denoted  by  the  X  in 
the  part  number)  simplifies  state-machine  design 
because  it  promotes  gate  product-term  economy. 
This  economy  is  the  ability  to  create  complex  state- 
machine  designs  without  exhausting  the  limited 
number  of  product  terms  in  each  macrocell. 

For  example,  there  are  ten  D-type  registers  in  the 
32VXI0,  one  in  each  of  its  ten  macrocells.  The 
XOR  input  to  each  register  and  no  more  than  three 
Reprinted  with  permission  from  Electronic  Design, 

Vol.  35,  No.  13  (May  28,  1987).  Copyright  1 987  Hayden  Publishing  Co.,  Inc. 


product  terms  transform  a  D-type  register  to  a  JK 
register.  Once  the  JK  register  is  configured,  subset 
functions  like  the  T  (toggle)  and  an  SR  (set-reset) 
may  be  derived. 

The  macrocell's  product-term  economy  can  be  a 
result  of  the  hold  state  inherently  defined  in  the  JK 
truth  table — a  function  not  offered  by  the  D-type 
register  alone  (see  the  table).  For  a  D-type  register 
to  hold  its  contents  in  a  "set"  state  during  several 
states,  additional  product  terms  must  hold  that 
condition  until  it  is  relaxed.  In  many  tasks,  apply- 
ing product  terms  as  holding  functions  could  ex- 
haust the  limited  supply  in  a  given  cell  and  compro- 
mise a  state-machine  design. 

A  practical  task  for  the  hold  function  is  in  a  line- 
sync  generator.  The  blanking  and  line-sync  pulses 
are  programmed  as  JK  functions,  and  a  hold  condi- 
tion is  invoked  for  their  active  duration. 

The  XOR  gate  also  influences  the  output  polari- 
ty of  a  logic  signal.  Active-high  or  -low  logic  signals 
may  be  created  at  the  output  pins.  In  PAL  devices 
without  polarity  control,  the  output  can  be  inverted 
by  applying  DeMorgan's  Theorem  to  a  sum-of- 
products  architecture.  This  approach,  however, 
brings  with  it  the  added  baggage  of  more  product 
terms.  For  example,  one  term  containing  N  literals 
(Boolean  variables)  is  inverted  by  creating  the  sum 
of  N  negated  literals,  with  each  negated  literal  us- 
ing a  product  term. 

Each  of  the  32VX10's  ten  macrocells  is  config- 
ured around  a  sum-of-products  and  XOR  input  to  a 
D-type  register,  which  is  clocked  from  a  dedicated 
input  pin.  Each  macrocell  can  be  programmed  for  a 
combinational  or  registered  output.  One  product 
term,  associated  with  the  control  to  bypass  the  reg- 
ister, selects  the  path  dynamically,  if  needed. 

Typically,  the  registers  are  programmed  as  a  re- 
fresh counter  for  dynamic  RAM.  When  a  host  mi- 
croprocessor must  access  the  RAM  for  read  and 
write  cycles,  its  address-line  outputs  bypass  the  reg- 
isters, which  contain  the  current  refresh  count  for 
the  R  AMs.  Instead,  the  outputs  address  the  memo- 
ry inputs  directly.  Afterwards,  the  address  outputs 
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from  the  registers  are  enabled  again  to  refresh  the  RAM. 

In  fast  video  applications,  a  similar  philosophy  is  in  ef- 
fect. The  host  microprocessor  bypasses  the  contents  of 
the  video-memory  address  counter  to  update  the  video 
RAM.  It  is  important,  though,  that  any  counter-incre- 
ment signal  be  disabled  when  the  registers  are  bypassed; 
an  active  clock  input  would  overwrite  any  contents  of  the 
D-type  registers.  In  a  video  RAM  system,  this  might  not 
be  a  disadvantage,  but  for  dynamic-RAM  arrays,  over- 
writing a  refresh  counter  could  lead  to  loss  of  data. 

One  product  term  might  also  dynamically  influence 
the  logic  signal's  output  polarity.  With  a  registered  out- 
put, a  dedicated  register-output-polarity  multiplexer  se- 
lects either  a  Q  or  Q  output,  routing  the  selected  signal  to 
the  output  pin.  One  programmable  polarity  fuse  selects 
the  required  path.  This  fuse  does  not  affect  the  program- 
mable polarity  associated  with  combinational  outputs. 

The  global  product  terms  from  the  fuse  array  are  dedi- 
cated to  resetting  the  ten  registers  synchronously  and  also 
presetting  them  asynchronously.  The  action  of  these  two 
product  terms  can  be  selected  by  the  programmable-po- 
larity multiplexer  for  individual  registered  cells. 

For  instance,  for  a  registered  output,  the  asynchro- 
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1.  Each  of  the  ten  macrocells  in  Monolithic  Memories'  PAL32VX10  PAL  device 
is  driven  by  an  exclusive-OR  gate,  a  feature  that  promotes  frugal  use  of 
product  terms.  For  example,  the  gates  reduce  the  number  of  terms  needed 
to  implement  JK  registers.  The  macrocells  can  also  feed  output  and  input 
signals  back  to  buried  registers.  As  a  result,  designing  state  machines  with 
the  device  is  easier. 


nous-preset  gate  can  be  programmed  to  a  synchronous- 
reset  gate  or  vice  versa.  The  single  product-term  input  to 
the  XOR  gate  can  select  the  output  polarity  for  either 
combinational  or  registered  outputs. 

The  second  added  feature  of  the  new  PAL  device,  its 
conversion  of  an  internal  D-type  register  to  a  JK-register 
cell,  is  demonstrated  by  five  Boolean  equations  formed 
from  a  sum  of  products: 

Q:=Q*J+Q*K(1) 
Q:=Q*J+Q*K(2) 
Q:=Q:  +  :Q*J+Q*K(3) 
Q:=Q:  +  :Q' J+Q*K(4) 
Q:=Q:  +  :Q*J+Q*K(5) 

where  the  Boolean  operators  are: 

for  an  inversion; 
:+ :  for  an  XOR  operation; 
+  for  an  OR  operation; 
*  for  an  AND  operation;  and 

:=  is  the  state-machine  operator  "becomes  equal  to" 

Note  that  all  five  Boolean  equations  have  true  or  com- 
plement conditions  for  either  J  or  K;  and  a  mixture  of  J 
and  K  or  J  and  K  functions. 
Three  of  the  equations  require 
the  XOR  function.  Moreover,  J 
and  K  active-high  conditions 
can  be  used  in  equation  3  only 
with  an  XOR  operator. 

The  Boolean  operators  are 
compatible  with  the  Palasm  as- 
sembler-support software.  Pa- 
lasm is  a  computer-aided 
(CAD)  tool  that  generates  pro- 
gramming information  for 
PAL  devices.  The  Boolean  ex- 
pression information  describes 
the  operation  of  the  device  to  be 
programmed. 

A  good  design  specification 
done  by  a  CAD  tool  like  Pa- 
lasm should  come  with  a  com- 
plete comment  field  to  describe 
the  function  of  the  Boolean 
equations.  Palasm  accepts  a 
comment  field  in  the  descrip- 
tion of  a  logic-design  file.  When 
a  semicolon  precedes  a  state- 
ment, Palasm  ignores  that 
statement  during  the  assembly 
process. 

The  functional  description  of 
the  logic  circuits  is  in  terms  of 
Boolean  statements.  A  later 
version  of  Palasm  will  accept 


Register 
polarity 
 ^control 
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state-machine  logic  descriptions  on  a  higher  level.  Then, 
designers  will  be  able  to  deal  with  logic  statements  con- 
taining JK  functions  without  describing  them  as  a  series 
of  a  sum  of  products,  as  is  now  the  case. 

The  ability  to  create  subsets  of  a  JK  register  gives  the 
logic  designer  more  flexibility  to  design  sequential-logic 
circuits.  One  of  the  subset  functions,  for  example,  is  the 
toggle  (T)  flip-flop,  which  lends  itself  to  counter  applica- 
tions. Binary  counters  require  that  the  contents  of  any 
general  register,  QN,  remain  in  a  hold  state  until  the  prod- 
uct term  of  all  the  lesser  significant  registers  goes  to  a  log- 
ic high.  Then  the  register  must  change  state  or  toggle  after 
the  arrival  of  the  next  clock  pulse.  The  T  function  may  be 
created  by  tying  the  J  and  K  inputs  together  and  treating 
this  composite  as  the  T  input.  A  hold  state  is  maintained 
when  T  =  0,  and  a  change  of  state  results  when  T  =  1 . 

The  application  of  the  toggle  function  to  binary 
counters  can  be  developed  by  considering  a  general  regis- 
ter QN  in  a  binary  count  sequence,  where  the  lesser  signifi- 
cant registers  are  QN.,,  QN.2,.  •  Qi,  Qo-  Using  equation  3, 
the  JK  description  for  a  general  register  is  given  as 

QN:=QN:+:K*QN+J'QN  (6) 

A  hold  condition  must  apply  when  the  product  term  of 
all  lesser  significant  registers  is  a  logic  zero,  or: 

Qn  VQn  V  •Q.*Qo  =  o 

and  a  toggle  condition  must  apply  when 

QNVQN_2*...Q,*Q0=1 
This  develops  to 

J  =  K  =  QNVQN_2*...Q,*Q0(7) 

which  represents  the  T  input  to  a  toggle  register.  Substi- 
tuting equation  7  into  6  leads  to 

QN:=QN:  +  :QN*(QN_,*QN_2...Q,'Q0) 
+  Qn 

*(Qn-,*Qn-2*--Q,*Qo)(8) 
Substituting  equation  7  into  6  for  J  and  K  inputs  leads 

Qn:=<V+:(Qn  +  Qn) 
*Qn  ,*Qn  2*-  •Q,*Qo(9) 

Because  (QN  +  Q^)  =  1 ,  equation  9  reduces  to 
Qn  :=  Qn  (holdQN) 

■+■  Qn-,  '  Qn-2  *•  ■  Qi  *  Qo  (toggle  QN)  (10) 

Equation  10  is  a  general  equation  for  a  register  in  a  bi- 
nary counter.  Only  two  product  terms  are  used  because 
the  XOR  gate  makes  it  possible  to  invoke  the  toggle  func- 
tion as  a  JK  subset.  Without  the  XOR  gate,  the  general 
equation  for  register  N  would  be 


QN:=Qn 

*QN_1*QN_2*...Q,*Q„(toggleQN) 
+  QN*QN-T(holdQN) 
+  Qn  *  Qn-2  (hold  QN) 


+  QN*Q,(holdQN) 

+  QN*Q0(holdQN)(ll) 

Equation  1 1  requires  N  product  terms  as  hold  func- 
tions. This  means  that  for  a  10-bit  counter,  the  most  sig- 
nificant register,  Q„  would  need  nine  individual  product 
terms  as  hold  functions  and  one  additional  product  term 
for  a  toggle  function. 
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2.  Thanks  to  its  easy  implementation  of  the  JK  func- 
tion and  its  subsets,  the  32VX10  makes  a  compact 
line-address  counter  and  line-sync  generator  for 
video  tasks. 
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In  contrast,  equation  10  makes  use  of  the  XOR  func- 
tion, requiring  one  product  term  to  hold  and  one  product 
term  to  toggle.  No  matter  how  significant  the  register  in 
the  count  sequence,  the  inclusion  of  an  XOR  function  in 
the  equation  eliminates  large  numbers  of  product  terms. 
The  advantage  of  the  XOR  gate  increases  for  a  10-bit,  up- 
down  counter  because  the  required  number  of  product 
terms  doubles.  Q,  would  normally  need  20  product  terms 
but  with  an  XOR  gate,  only  three  do  the  job. 

A  practical  outcome  of  configurable  JK  and  T  registers 
is  the  design  of  a  line-sync  generator  circuit  for  control- 
ling a  video  RAM,  and  for  synchronizing  line-display  in- 
formation to  a  CRT  interface  (Fig.  2).  Memory-address 
outputs  MA0  through  MA„,  which  are  derived  from  an 
internal  counter,  drive  the  address  inputs  to  the  video 
RAM.  One  complete  line  of  video  information  is  dis- 
played and  surrounded  by  line-sync  and  blanking  pulses 
from  two  macrocells  programmed  as  JK  registers. 

The  internal  memory-address  counter  is  programmed 
to  address  an  80-character  screen  and  then  issue  an  ac- 
tive-high blanking  pulse.  The  line-sync  pulse  goes  active- 
high  when  the  counter  reaches  a  decimal  count  of  79  (4F 
hex)  and  is  turned  off  at  a  decimal  count  of  103  (67  hex). 
The  blanking  pulse  is  negated  and  the  counter  returns  to 
zero  after  reaching  a  decimal  count  of  107  (6B  hex). 

The  JK  functions  for  turning  on  line-sync  and  blanking 
pulses  come  from  decoding  the  count  equations.  The 
blanking  pulse  overlaps  the  line-sync  pulse  to  give  front 
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and  back  porches  to  the  video  line-scan  signal.  Also,  be- 
cause the  host  shares  access  to  the  video  memory,  it  can 
update  video  information.  In  that  case,  a  Write  (WR)  is 
driven  low  to  produce  three-state  video-address  outputs. 

When  at  work,  the  microprocessor  can  change  the  con- 
tents of  the  video  RAM  during  frame-  or  line-signal  fly- 
back periods  to  avoid  displaying  random  data  during  ac- 
tive display  periods.  To  make  such  changes  possible,  the 
WR  input  can  be  derived  from  a  smaller  combinational 
PAL  device  that  decodes  the  microprocessor's  address 
lines  and  maps  the  video  RAM  into  its  address  space. 

An  asynchronous  reset  input  (RST)  powers  up  the 
PAL  to  a  known  state.  With  its  Count  Enable  Input 
(CEI)  and  Output  (CEO)  signals,  the  device  can  cascade 
with  other  circuitry,  such  as  a  frame-sync  generator  and  a 
video-shift  register. 

The  memory-address  counters  are  programmed  as  T 
flip-flops;  line-sync,  blanking  and  CEO  functions  as  JK 
registers.  An  Enable-State  Input  (ENST)  enables  input 
states  0  through  2  as  count  enable  inputs  in  preference  to 
the  CEI  signal.  When  in  the  low  state,  ENST  qualifies  in- 
put states  0  through  2  as  active-high  count-enable  inputs. 

To  complement  the  line-sync  generator  design,  the 
32VX10  is  programmed  as  a  7-bit  video-shift  generator. 
Seven  registers  shift  video  data  and  three  are  pro- 
grammed as  a  3-bit  counter  to  control  loading  and  shift- 
ing of  video  data  in  the  shift  register.  When  all  are  high, 
counter  output  states  0  through  2  cause  the  shift  register 
to  load  and  the  line-sync  generator  to  increment. 

For  the  example  circuit,  another  32VX10  is  pro- 
grammed to  support  the  line-sync  generator.  As  with  the 
other  32VX10s,  the  frame-sync  output  is  programmed  as 
a  JK  function.  The  internal  counter  accesses  row-charac- 
ter information  as  well  as  the  contents  of  the  video  RAM. 
The  32VX 10  video-shift  register  accepts  dot  information 
from  the  character  generator  and  supplies  a  parallel-to- 
serial  output  for  the  video  display.  Any  parallel-to-serial 
output  format  can  be  handled  by  the  32VX 10. 

Readers  interested  in  the  PAL  design  specifications 
may  contact  the  author  for  further  details. □ 
Chris  Jay,  logic-cell-array  applications  manager  with 
Monolithic  Memories,  has  been  with  the  company  since 
1983.  Three  years  were  spent  as  a  field-application  engi- 
neer in  Northern  Europe.  Before  that,  he  worked  for  Texas 
Instruments  in  Bedford,  England  and  Fairchild  Camera 
and  Instrument  in  Bristol.  England,  performing  logic  de- 
sign and  applications  engineering,  respectively.  Jay  earned 
a  BSc  (honors)  degree  from  Essex  University  in  Coulches- 
ter,  England  in  1977. 
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PLD  Programmability 
Extends  Its  Sway  Over 
Complex  I/O 


Electrically  erasable  CMOS  parts,  the 
densest  24-pin  SLIM  PALs  yet,  reach 
new  heights  in  architectural  flexibility 


Om  Agrawal,  Product  Planning  Manager 
Fares  Mubarak,  Senior  Test  Engineer 
Field-Programmable  Logic  Division 
Advanced  Micro  Devices,  Inc.,  Sunnyvale,  CA 


Two  electrically  erasable  PAL 
devices  being  introduced  here 
by  Advanced  Micro  Devices 
offer  by  far  the  greatest  flexibility  of 
any  programmable  array  logic 
(PAL)  device  to  date.  Several  fea- 
tures contribute  to  the  adaptability 
of  these  CMOS  ICs— the  highest  yet 
number  of  product  terms  (188  and 
178)  of  any  24-pin  PAL;  the  opti- 
mization of  one  device,  the  AmPAL- 
29M16,  for  synchronous  designs  and 
the  other,  the  AmPAL29MA16,  for 
asynchronous  and  glue  logic  applica- 
tions; and  the  convenience  of  some 
11,000  electrically  programmable 
and  erasable  interconnecting  cells 
(see  Figs.  1  and  2) . 

But  the  most  distinctive  enhance- 
ment is  the  programmable  input  and 
output  bidirectionality  of  each  de- 
vice's 16  I/O  logic  macrocells  (I/O- 
LMs).  Until  now,  some  program- 
mable logic  devices  (PLDs)  have 


had  output  macrocells,  but  none  has 
had  input  macrocells,  let  alone  the 
I/O  type.  The  I/O  pin  on  each  I/O- 
LM  can  in  addition  be  configured  as 
combinatorial,  as  an  edge-triggered 
register,  or  as  a  transparent  latch. 

What's  more,  on  8  of  the  16  I/O 
pins,  the  associated  8  register/ 
latches  can  serve  independently  of 
the  pins  as  programmable  buried 
state  registers.  Current  40-pin 
CMOS  EPLDs  have  only  half  that 
number.  These  internal  registers 
offer  designers  an  extra  eight  flip- 
flops  for  such  jobs  as  keeping  track 
of  flags  or  generating  timing  and  con- 
trol signals. 

The  11,000  or  so  interconnecting 
elements  are  electrically  erasable 
(E2)  cells  that  define  what  each 
PAL  device  can  do.  They  connect  all 
array  inputs  to  the  programmable 
AND-OR  array,  and  they  configure 
the  architecture  of  the  I/OLMs.  The 
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user  programs  these  cells  by  charg- 
ing and  discharging  them,  so  that  de- 
sign changes  can  be  made  quickly — 
far  faster  than  for  ultraviolet-light 
erasable  PLDs,  or  EPLDs. 

The  record  number  of  product 
terms,  16  macrocells,  and  horde  of 
E2  cells  make  the  two  newcomers 
the  densest  24-pin  SUM  PAL  de- 
vices available,  whether  bipolar  or 
CMOS.  The  ICs  are  five  to  six  times 
denser  than  industry-standard,  first- 
generation  20-  and  24-pin  EPLDs 
and  E2PLDs  and  twice  as  dense  as 
second -generation  24-pin  parts.  In 
fact,  these  CMOS  PALs  are  denser 
even  than  40-pin  bipolar  PLDs. 

Their  35-ns  max  input-to-output 
propagation  delay  and  external  clock 
frequency  of  20  MHz  max  fixes  the 


new  CMOS  devices  in  the  medium 
speed  range  expected  of  bipolar 
units.  Maximum  active  power  dissi- 
pation is  120  mA.  Each  comes  in  two 
logic  levels— CMOS  HC  and  CMOS 
HCT  logic  for  TTL  systems.  (The 
trademark  PAL  belongs  to  Mono- 
lithic Memories.) 

From  what  follows,  it  will  become 
clear  that  the  basic  1/  O  logic  macro- 
cell  structure  is  the  same  for  both  the 
synchronous  29M16  and  asynchro- 
nous 29MA16.  Differences  arise, 
though,  in  the  I/OLM  control  mech- 
anisms— their  clocking,  resetting 
and  presetting,  and  output  enabling. 
Also,  the  distribution  of  the  product 
terms  between  logic  and  control 
functions  is  different  for  the  two 
PALs.  The  testing  and  debugging 


Fig.  1.  The  architecture  of  the  synchronous  AmPAL29M16  E1  programmable 
logic  device  derives  much  of  its  flexibility  from  its  16  bidirectional  input  and 
output  logic  macrocells.  The  macrocells  may  use  either  one  or  two  clocks, 
and  each  controls  the  output  of  on  average  11  logic  product  terms. 


features,  though,  are  identical.  In 
conclusion,  an  application  of  the 
29M16  will  be  outlined — the  use  of 
nearly  a  dozen  of  these  packages  to 
build  a  32-bit  direct  memory  access 
(DMA)  controller. 

I/OLM  with  a  difference 

The  flexibility  of  the  I/O  and  stor- 
age elements  of  these  parts  lends 
them  the  look  more  of  a  gate  array 
than  a  PAL  device  (see  Figs.  1  and 
2) .  Eight  of  the  I/OLMs  have  single 
feedback  to  the  AND/ OR  array,  and 
eight  have  dual  feedback.  Single 
feedback  comes  from  either  the  I/O 
pin  or  the  storage  element,  dual  feed- 
back comes  from  both. 

It's  when  an  I/O  pin  is  used  for 
input  that  its  associated  storage  ele- 
ment can  be  turned  to  use  as  a  buried 
register  or  latch.  As  many  as  all  eight 
of  the  dual -feedback  I/OLMs  can 
be  devoted  to  user-programmable 
buried-state  register/ latches.  These 
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latch? 


are  helpful  for  both  synchronous  and 
asynchronous  state-machine  designs. 

The  presence,  number,  and  pro- 
grammability of  the  E2  cells  within 
the  I/OLMs  is  the  most  significant 
architectural  variable  of  these  PAL 
devices.  ( Second -generation  PLDs 
had  three  to  four  times  fewer  archi- 
tectural cells.)  Is  the  macrocell  to  be 
an  input  or  an  output?  An  edge-trig- 
D  flip-flop  or  a  transparent 
What  are  the  macrocell's 
clocking,  feedback,  and  output- 
enable  schemes  and  output  polarity? 
What  if  it  frees  up  a  buried  state 
register? 

Programming  the  E2  cells 

All  these  decisions  are  made  by 
programming  the  E2  cells.  Single- 
feedback  macrocells  have  nine  E2 
cells,  dual-feedback  macrocells  have 
eight.  In  its  virgin,  erased  (charged 
or  disconnected)  state,  an  E2  cell's 
value  is  1.  In  its  programmed  (  dis- 
charged or  connected)  state,  it  is  a  0. 


Both  synchronous  and  asynchro- 
nous 1/  OLMs  contain  a  configurable 
storage-element  cell  and  five  multi- 
plexers— two  2 : 1  multiplexers  for  se- 
lecting I/O  and  feedback  path  and 
three  4:1  multiplexers  for  selecting 
output  path,  clock,  and  output  en- 
able (see  Fig.  3) . 

Whether  a  macrocell  is  to  be  an 
input  or  output  and  the  nature  of 
its  storage  element  (edge-triggered 
register  or  transparent  latch)  is  set 
by  E2  cells  S2  and  S3.  In  its  virgin 
state,  the  I/OLM  is  an  output  cell 
and  the  storage  element  is  an  edge- 
triggered  register.  It  can  be  changed 
by  the  user  to  an  input  latch  or  regis- 
ter or  output  latch  by  appropriate 
programming  of  S2  and  S3 . 

The  polarity  (active  high  or  active 
low)  of  the  cell's  output  and  its  se- 
quential (registered  or  latched)  or 
combinatorial  nature  are  determined 
by  S0  and  S^  With  both  of  these 
cells  in  the  erased  (charged)  state, 
for  example,  the  output  is  set  to  be 


Fig.  2.  The  16  I/O  macrocells  in  the  asynchronous  AmPAL29MA16  can  each 
be  allotted  its  own  product-term-driven  clock.  The  PAL's  need  tor  more 
contro  terms  leaves  an  average  ot  7  logic  product  terms  per  macrocell. 


active  low  and  combinatorial. 

By  programming,  or  discharging, 
cell  S0,  the  designer  makes  the  out- 
put go  active  high.  The  OR-gate  out- 
put bypasses  the  storage  element 
and  feeds  the  output-path  selection 
multiplexer  directly. 

On  the  other  hand,  by  program- 
ming S! ,  the  designer  makes  the  out- 
put sequential  ( registered/ latched ) . 
The  Q  output  of  the  storage  element 
passes  through  the  4:1  multiplexer 
to  an  inverting  output  buffer  and  is 
fed  back  to  the  AND  array  inter- 
nally. This  configuration  is  particu- 
larly useful  for  state  machine  de- 
signs. Programming  both  S0  and  Sj 
results  in  an  active-high,  sequential 
output. 

The  feedback  multiplexer  for  the 
I/OLMs  with  single  feedback  is  con- 
trolled by  yet  another  E2  cell  (S8) . 
This  separate  control  of  the  feedback 
multiplexer  makes  for  the  ultimate 
in  flexibility  in  state  machine  design. 
Each  I/OLM,  with  registered,  com- 
binatorial or  latched  outputs,  choice 
of  polarity,  and  dual  feedback,  has 
12  possible  output  configurations. 
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The  timing  needs  of  synchronous 
state  machines  are  much  simpler 
than  those  of  asynchronous  and  glue 
logic — hence  the  difference  in  the 
two  PALs'  clocking,  output-enable, 
and  preset/set  arrangements  and 
in  the  number  of  product  terms  as- 
signed to  logic  and  control. 

For  synchronous  state-machine 
design,  the  flexibility  of  one  or  two 
synchronous  clocks,  preferably  with 
polarity  control,  is  often  needed  for 
individual  macrocells.  It  is  impor- 
tant for  designing  single-,  dual-,  or 
quadruple-state  machines  with  a  var- 
iable number  of  state  registers.  Pro- 
grammable clock  polarity  offers  the 
ability  to  trigger  certain  groups,  or 
banks,  of  registers  on  one  edge  and 
other  groups  on  the  opposite  edge. 
The  result  is  faster  internal  state 
machines. 

Early  PAL  devices  had  a  single 


synchronous  clock.  Some  newer  syn- 
chronous CMOS  erasable  PAL  de- 
vices offer  dual  clocks,  but  the  two 
are  selectable  for  a  bank  of  8  regis- 
ters at  a  time,  rather  than  for  16 
macrocells  individually.  Thus,  they 
are  limited  to,  at  most,  two  different 
state  machines,  each  with  an  eight- 
register  maximum. 

In  addition,  these  PAL  devices  us- 
ually have  two  separate  clock  pins. 
So,  if  the  designer  needs  a  single- 
state  machine  with  up  to  16  registers, 
the  second  clock  pin  is  wasted. 

Better  clocking 

The  29M16s  alleviate  these  draw- 
backs. Each  I/OLM  has  a  4:1  pro- 
grammable clock-selection  mecha- 
nism selected  by  two  E2  cells,  S4  and 
S5.  For  addressing  synchronous 
logic,  the  29M16  has  one  dedicated 
CLK/LE  (latch  enable)  pin  and 


Fig.  3.  The  II O  logic  macrocell,  in  either  the  synchronous  AmPAL29M16  gate 
array  (with  input  gates  shaded  yellow)  or  the  asynchronous  AmPAL29MA16 
(inputs  on  yellow  background  at  tar  left),  depends  for  its  functionality  on  how 
the  erasable  S  cells  configure  the  multiplexers  and  storage  cell. 

  COMMON  OE 


one  CLK/LE/ 1  (input)  pin.  For 
macrocells  configured  as  registers, 
these  pins  are  clock  pins;  for  latches, 
they're  latch-enable  pins. 

Either  the  clock  or  latch-enable 
pin  may  be  selected  to  trigger  each 
storage  element  on  either  the  rising 
or  trailing  edge.  With  this,  system 
designers  can  implement  up  to  a 
quadruple-state  machine,  with  each 
meant  to  have  a  different  number  of 
registers. 

Also,  since  the  device  offers  16 
I/OLMs  with  input  register/ latches, 
the  PAL  is  applicable  to  pipelined 
systems  with  eight  input  latches  and 
eight  output  registers.  The  dual- 
clock  capability  allows  the  PAL  to 
run  with  synchronous  or  asynchro- 
nous inputs.  The  clocks'  program- 
mable polarity  allows  the  internal 
state  machine  to  run  at  twice  the 
speed  of  the  external  clock. 

Further,  since  only  one  pin  is  ded- 
icated as  a  clock/ latch  enable,  a  sin- 
gle state  machine  need  not  waste  the 
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second  clock  pin.  It  can  be  an  extra 
input. 

While  one  or  two  synchronous 
clocks  with  programmable  polarity 
are  sufficient  for  synchronous  state 
machines,  asynchronous  and  glue 
logic  applications  have  more  de- 
manding clock/ latch  requirements. 
They  call  for  more  flexible  PAL  de- 
vices with  combinatorial  outputs  and 
feedback  and  with  edge-triggered 
registers  and  transparent  latches 
having  individual  clock/ latch-enable 
capability  (preferably  with  pro- 
grammable polarity) . 

In  addition,  they  require  individ- 
ual control  of  reset  and  preset  and 
output-enable  product  terms,  as  well 
as  slightly  fewer  product  terms  per 
output.  Further,  for  asynchronous/ 
glue  logic  applications,  the  clock-se- 
lection and  output-enable  schemes 
should  be  independent. 

The  29MA16  addresses  these 
needs  with  up  to  16  product-term- 
driven  asynchronous  clock/ latch-en- 
able controls.  Each  of  these  controls 
is  a  product  term  driven  from  the 
AND-OR  array  as  a  function  of  cer- 
tain input  signals.  These,  in  turn, 
are  controlled  by  the  user  according 
to  the  application. 

Like  its  synchronous  counterpart, 
each  I/OLM  of  the  asynchronous 
29MA16  has  a  4:1  multiplexer  for 
clock  selection.  Unlike  the  other, 
however,  it  allows  the  user  to  select 
from  either  a  common  synchronous 
clock/latch  enable  or  an  individual 
clock  driven  by  a  product  term. 

Preset  and  reset 

The  synchronous  29M16  uses 
common  asynchronous  reset  and 
preset  controls  for  all  registered  or 
latched  inputs  and  outputs.  When 
the  product  term  for  asynchronous 
preset  is  asserted,  all  the  register/ 
latches  are  immediately  loaded  with 
a  high,  independent  of  the  clock. 
Conversely,  when  the  asynchronous 
reset  product  term  is  asserted,  all  the 
register/ latches  are  immediately 
loaded  with  a  low,  independent  of 
the  clock.  Actual  output  depends  on 
the  macrocell  polarity. 

The  asynchronous  29MA16,  on 
the  other  hand,  offers  individual 
asynchronous  reset  and  preset  con- 
trol for  each  of  its  storage  elements. 


This  feature,  plus  the  individual 
clock  capability,  makes  it  easy  for 
the  device  to  replace  a  large  amount 
of  glue  logic. 

As  in  all  PAL  devices,  the  size, 
organization,  and  distribution  of  the 
internal  product  terms  of  the  29M16 
and  29MA16's  AND-OR  array  de- 
termine its  logic  capability.  Both 
PAL  devices  can  have  29  inputs,  of 
which  24  issue  from  the  I/O  logic 


macrocells  (8  from  I/OLMs  with 
single  feedback  and  16  from  I/OLMs 
with  dual  feedback),  1  comes  from 
the  I/OE  pin,  and  3  are  dedicated. 
The  29MA16  adds  a  fourth  dedi- 
cated input,  but  the  29M16  instead 
has  the  CLK/LE/I  pin. 

Of  the  29Ml6's  188  product  terms, 
all  of  176  are  logic  terms  and  12  are 
control  terms.  The  176  logic  product 
terms  are  distributed  among  the  16 


PALs  move  to  ever-greater  flexibility 


Important  gauges  of  a  PAL  device's  flexi- 
bility are  the  number  and  complexity  of 
its  input/output  logic  macrocells.  With 
the  introduction  of  its  electrically  eras- 
able, CMOS-based  AmPAL29M16  and 
AmPAL29MA16,  Advanced  Micro  Devices 
has  brought  more  flexibility  than  ever  to 
programmable  logic  devices. 

First-generation,  bipolar  fuse-pro- 
grammable PAL  devices  had  at  most 
eight  outputs  fixed  at  the  factory  as 
either  registered  or  combinatorial  with 
active  high  or  low  polarity.  The  second 
generation  improved  on  this  somewhat 
by  adding  10  programmable  outputs  con- 
figurable as  registered  or  combinatorial 
and  active  high  or  low. 

Although  some  second-generation 
CMOS  EPROM-based  PLDs  offer  as  many 
as  16  logic  cells,  these  can  be  configured 
only  as  outputs.  Further,  these  outputs 
lack  a  transparent-latch  capability,  a  re- 
quirement for  both  asynchronous  and 
synchronous  designs. 

Architectural  flexibility  also  extends  to 
features  such  as  variable  product  terms, 
varying  the  number  of  product  terms  per 
output,  and  programmable  buried  states. 
Variable  product  terms,  first  introduced 
in  bipolar  PAL  devices  like  the  AmPAL- 
22V10,  help  the  device  match  the  appli- 
cation. Programmable  buried  states  use 
internal  storage  elements  for  state-ma- 
chine design;  until  now,  PAL  devices 
lacked  independent  buried  states.  Such 
elements  were  used  only  in  conjunction 
with  I/O  pins.  Consequently,  when  an 
I/O  pin  was  an  input,  its  associated  stor- 
age element  was  wasted. 

At  first,  PAL  devices  were  produced 


with  fuse-based  bipolar  technology;  they 
could  not  be  reprogrammed.  Next  genera- 
tion EPROM-based  PLDs  (or  EPLDs)  are 
reprogrammable,  as  are  the  still  newer 
electrically  erasable  PLDs  (E'PLDs). 

To  be  erased,  EPLDs  must  be  in  ex- 
pensive windowed  ceramic  packages 
through  which  they're  exposed  to  ultra- 
violet light  for  a  lengthy  time.  Easier  to 
reprogram,  E!PLDs  fit  in  inexpensive 
plastic  packages. 

Complete  testability  is  another  sig- 
nificant advantage  of  E!  technology.  Un- 
like bipolar  and  EPLDs,  which  require 
additional  testing  circuitry,  E!  devices 
use  only  the  array  and  its  operating  cir- 
cuitry to  make  DC,  AC,  and  functional 
tests. 

Finally,  a  PAL  device  must  meet  the 
specific  demands  of  combinatorial,  syn- 
chronous, and  asynchronous  applica- 
tions. While  some  CMOS  PLDs  have  at- 
tempted to  address  the  broad  range  of 
applications  with  a  "one-size-fits-all" 
approach,  this  has  limitations.  The  device 
tends  to  have  extra  features  that  show 
up  in  overhead  while  not  providing  others 
that  are  key.  The  result  is  expensive  and 
less  than  optimal. 
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Fig.  4.  This  multichannel  32-bit  DMA  controller  is  built 
from  11  AmPAL29M16s,  one  AmPALISLS,  a16x  16-bit 
RAM,  and  an  8-bit  command  register. 
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1/  OLMs,  as  was  shown  in  Fig.  1 .  The 
eight  I/OLMs  with  dual  feedback 
have  eight  product  terms  each,  while 
of  those  with  single  feedback,  four 
have  12  product  terms  and  four  have 
16.  The  average  is  11  product  terms 
per  output. 

This  variable  distribution  of  prod- 
uct terms,  as  well  as  the  number  of 
product  terms  per  output,  allows 
more  complex  functions — such  as 
specialized  counters  and  more  com- 
plex state  machines — to  be  imple- 
mented in  a  single  device. 

Of  the  29M16's  12  control  terms, 
two  are  used  as  common  asynchro- 
nous reset  and  preset,  and  eight  are 
output-enable  product  terms — one 
pair  of  common  AND-XOR  product 
terms  per  bank  of  four  registers.  The 
remaining  two  are  used  for  observa- 
bility in  testing  and  for  preload  of 


 ►  TO  BUS 

the  device's  storage  element. 

As  for  the  29MA16,  its  smaller 
total  of  178  product  terms  is  divided 
into  only  112  logic  terms  and  all  of 
66  control  terms.  The  logic  product 
terms  are  also  distributed  in  a  var- 
iable fashion  (see  Fig.  2  again); 
the  average  is  7  product  terms  per 
output.  Of  the  control  terms,  four 
are  distributed  to  each  of  the  16 
I/OLMs,  and  the  remaining  two  are 
for  observability  and  preload. 

Output  enable  control 

The  I/O  pin  for  both  new  devices 
may  be  configured  as  a  dedicated  (or 
permanently  disabled )  input,  a  dedi- 
cated output,  or  dynamically  con- 
trolled. This  flexibility  results  be- 
cause the  connection  from  the  I/O- 
LM  to  its  I/O  pin  is  controlled  by  a 
4:1  multiplexer. 


On  both  devices,  three  of  the  in- 
puts to  this  multiplexer  come  from 
ground,  Vrc,  and  the  1/  O-enable  pin. 
On  the  synchronous  29M16,  the 
fourth  input  to  the  multiplexer  is 
driven  by  the  two  AND-XOR  prod- 
uct terms  already  mentioned  as  be- 
ing common  for  a  bank  of  four 
I/OLMs.  In  synchronous  state  ma- 
chines, this  arrangement  allows  the 
I/O  pins  to  be  controlled  in  groups 
of  four.  The  AND-XOR  product 
term  control  also  allows  for  program- 
mable polarity,  a  useful  feature  for 
bus-control  applications. 

On  the  29MA16,  this  fourth  input 
is  driven  by  an  individual  product 
term  from  the  AND-OR  array.  Such 
individual  product-term  control 
gives  users  maximum  flexibility  in 
configuring  the  I/O  pins. 

Both  the  parts  have  built-in  fea- 
tures to  simplify  testability  and  sys- 
tem-level debugging.  These  include 
a  preload  and  a  power-up  reset  for 
testability  and  an  observability 
product  term  for  debugging  and  for 
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tracing  buried  state  registers. 

When  the  observability  product 
term  is  asserted,  it  suppresses  the 
combinatorial  output  data,  prevent- 
ing the  data  from  appearing  on  the 
I/O  pin.  It  then  presents  the  con- 
tents of  the  register/ latches  on  the 
output  pin  for  each  of  the  logic  mac- 
rocells. 

The  built-in  power-up  reset  ini- 
tializes the  register/ latches  to  a 
known  state,  a  feature  that  simpli- 
fies state  machine  designs.  A  secur- 
ity cell  in  both  devices  protects  the 
logic  against  unauthorized  copying. 

A  multichannel  DMA  controller 

The  AmPAL29M16/29MA16  are 
suitable  for  applications  ranging 
from  glue  and  general-purpose  logic 
replacement  to  synchronous  and 
asynchronous  state  machines.  Fairly 
complex  Mealy  and  Moore  state  ma- 


chines can  be  readily  designed,  and 
the  I/O  latches  support  double  pipe- 
lined systems. 

Figure  4  is  the  block  diagram  of  a 
direct  memory  access  (DMA)  con- 
troller built  for  the  32-bit  68020  mi- 
croprocessor. It  is  a  simple,  four- 
channel,  single-cycle,  32-bit  mem- 
ory-to-memory data-transfer  design. 

The  68020  processor  accommo- 
dates memory  blocks  and  peripheral 
widths  ranging  from  8  to  32  bits  on 
its  system  bus.  It  also  imposes  cer- 
tain connectivity  requirements  on 
slave  devices:  8-bit  devices  must  go 
to  the  uppermost  8  bits  of  the  data 
bus,  while  16-bit  devices  must  con- 
nect to  the  data  bus's  upper  word. 

Consequently,  issues  related  to  the 
data  width  for  the  DMA  operation 
are  the  data-transfer  size  (8,  16,  or 
32  bits),  the  data-funneling  scheme 
(for  packing  and  unpacking  data 


KEY 

BRST_ZER  =  BURST  COUNTER  «T  ZERO 
CNT-ZER  =  BLOCK  COUNTER  AT  ZERO 
WAIT.  ZER  =  WAIT  COUNTER  AT  ZERO 
DSACK      =  DATA  ACKNOWLEDGED 

=  BUS  GRANTED 
RS  -  READ  STATE 

WS  =  WRITE  STATE 


Fig.  5.  The  state  diagram  of  the  DMA  controller's 
operation  shows  that  it  will  cycle  continuously 
through  the  read  and  write  states  until  some  other 
bus  arbiter  is  granted  control  of  the  bus. 


to  support  a  flexible  data-transfer 
scheme) ,  and  block  size. 

The  DMA  logic  comprises  two  ad- 
dress pointers  (one  for  source  and 
one  for  destination) ;  a  32-bit  data 
latch;  burst,  wait,  and  block-length 
counters;  an  8-bit  command  register; 
a  16  x  16-bit  RAM;  and  the  DMA 
sequence  controller.  This  design  uses 
a  total  of  11  CMOS  AmPAL29M16s 
— 9  to  implement  pointers,  latch,  and 
counters  and  the  remaining  couple  in 
the  sequence  controller. 

The  two  pointers  have  a  width  of 
30  bits  divided  into  an  upper  16-bit 
segment  register  field  and  a  lower 
14-bit  offset  register  field.  The  16- 
segment  register  field  allows  data  to 
be  transferred  in  64-K  segments, 
each  segment  being  a  maximum  of 
16-K  blocks  (or  64-K  words,  with  4 
words  in  each  block) . 

The  task  of  segment  register  main- 
tenance comes  under  CPU  control. 
Thus,  if  the  data  to  be  transferred 
spans  a  segment  boundary,  the  CPU 
must  break  up  the  transfer  into  two 
separate  operations  and  set  up  seg- 
ment registers  for  it. 

The  14-bit  offset  register's  value 
is  loaded  into  the  RAM  locations  by 
the  CPU  before  the  DMA  operation 
In  this  example  the  DMA 
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transfers  only  long  words,  so  the 
least  significant  address  bits  A,  and 
A0  are  stuck-at-zeroes.  The  offset 
register  is  incremented  after  every 
DMA  read/write  cycle  unless  the 
command  register  says  not  to. 

A  32-bit  data  latch  temporarily 
holds  the  long-word  data  fetched 
from  the  source.  During  destination 
writes,  this  becomes  the  source  for 
the  data.  The  burst,  wait,  and  block 
count  registers  are  each  of  them  14- 
bit  decrementers. 

The  burst  register  counts  down  af- 
ter every  DMA  read/ write  cycle.  At 
zero,  the  data  and  address  buses  are 
relinquished  by  the  DMA  controller 
for  a  number  of  cycles  denned  in  the 
wait  register. 

The  14-bit  block  counter  decre- 
menter,  whose  value  is  loaded  into 
RAM  locations  by  the  CPU,  keeps 
track  of  the  number  of  long  words 
transferred  in  each  current  DMA  cy- 
cle. The  counter  is  loaded  with  a 
value  by  the  CPU  before  each  new 
DMA  operation  and  is  decremented 
by  the  DMA  control  logic  at  each 
transfer.  A  count  of  zero  indicates 
the  end  of  current  DMA  operation. 


The  control  sequencer 

The  DMA  control  sequencer  has 
three  jobs.  It  handles  loading  both 
the  pointers  and  the  registers  from 
the  RAM  locations.  It  controls  the 
DMA  read/ write  cycles.  And  it  con- 
trols the  interfaces  to  the  CPU  ( han- 
dling of  burst  and  wait  registers )  and 
the  memory. 

Because  of  I/O  limitations  in  the 
sequence,  the  buried  registers  asso- 
ciated with  inputs  implement  the 
state  sequence.  The  output-enable 
structure  enhances  the  individual 
I/O  control  of  every  macrocell,  and 
the  clock  scheme  is  used  by  trigger- 
ing on  both  edges  of  the  system  clock 
to  improve  the  DMA  read/ write  cy- 
cle throughput. 

According  to  the  state  diagram  for 
the  DMA  sequence  controller  (see 
Fig.  5) ,  the  device  stays  in  the  idle 
state  until  a  DMA  operation  is 
started  by  the  CPU's  writing  the 
GO  bit  in  the  command  register. 
The  controller  then  loads  the  point- 
ers and  the  counter  registers  from 
the  RAM  locations  and  requests 
the  bus.  After  the  bus  is  granted,  it 
initiates  memory-to-memory  data 


transfers — read  followed  by  write 
cycles.  (One  memory-to-memory 
transfer  cycle  is  a  single  operation.) 

DMA  preemption 

The  DMA  preemption  is  per- 
formed in  write  state  WS3.  When 
the  external  arbiter  signals  that  the 
bus  is  needed  by  another  potential 
bus  master  (by  negating  bus  grant) , 
the  DMA  controller  gives  up  the  bus 
and  enters  the  bus-request  state.  It 
stays  there  until  the  bus  control  re- 
turns to  the  DMA  controller. 

In  state  WS3,  if  the  block  counter 
reaches  value  0,  then  it  returns  to  the 
idle  state.  If  the  burst  length  has 
reached  zero,  but  the  counter  value 
is  not  zero,  then  it  goes  to  the  wait 
state  until  the  wait  register  value 
reaches  zero.  Then  it  transfers  to  the 
"load  burst  register"  state.  In  state 
WS3,  however,  if  both  the  counter 
and  burst-register  values  are  not 
zero,  and  if  no  other  bus  arbiter 
needs  the  bus,  the  controller  contin- 
ues memory-to-memory  data  trans- 
fers until  it  is  done.  □ 


PROSE  Devices  Simplify 
State  Machine  Design 


A  bipolar  device  combines  a  PAL  device  for  conditional  branching 
with  a  PROM  device  for  storing  system  control  functions  to  make 
the  design  of  small  state  machines  easier. 


Oontrol  logic  is  required  in  a  broad  range  of  ap- 
Wplications — from  low-level  control  of  custom 
bit-slice  microprocessors,  to  bus  arbitration  and 
timing  generation  in  conventional  microprocessors, 
to  special-purpose  needs  like  data  encryption  and 
decryption.  Even  though  these  applications  vary  in 
complexity,  they  all  require  a  sequential-type  device 
(a  state  machine,  for  example)  that  provides  for  se- 
quences of  output  signals  based  upon  state  tran- 
sitions and  that  includes  some  capability  for 
branching.  More  complex  sequencers  provide  capa- 
bility for  looping  and  subroutining. 

Possible  candidates  for  control  logic  range  from 
Programmable  Array  Logic  (PAL)  devices,  to  the 
more  complex  Field  Programmable  Logic  Se- 
quencer (FPLS)  devices,  to  instruction-based 
microprogram  controllers  that  rely  on  on-chip 
memory  for  expanded  subroutine  capabilities. 
While  FPLS  and  PAL  devices  have  limited  product 
terms  for  many  state-machine  applications,  the 
other  microprogram  controllers  are  sometimes 
more  complex  than  required.  For  applications  re- 
quiring control  functions  that  small  and  medium 
state  machines  can  represent,  the  Prose  device  from 
Monolithic  Memories  Inc  (MM1)  can  facilitate  con- 
trol logic. 

Combining  a  PAL  device  for  conditional  branch- 
ing and  a  PROM  for  storing  the  control  functions 
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of  the  system,  Prose  is  intended  to  function  as  a 
programmable  sequencer  (Prose  =  PROgrammable 
SEquencer).  It  offers  a  maximum  of  128  states  and 
up  to  four-way  branching.  A  bipolar  device  oper- 
ating at  up  to  25  MHz,  Prose  functions  as  a  Moore 
state  machine.  Under  certain  conditions,  a  Mealy 
state  machine  can  also  be  modeled  with  Prose. 

Working  faster  and  simpler 

Custom  microprocessors,  such  as  those  built  with 
bit-slice  components,  require  control  logic  in  the 
form  of  multiple  IC  microinstruction  memories 
and  sequencers  to  generate  addresses  for  those 
memories.  For  complicated  sets  of  instructions 
with  many  states  and  complex  branching  and 
subroutines,  a  microprogram  controller  makes 
sense.  But  in  situations  in  which  a  more  limited  set 
of  instructions  and  instruction  transitions  is  re- 
quired, the  single-chip  Prose  device  is  faster  as  well 
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Prose  in  a  classic  traffic-controller  application 


Considers  four-way  intersection.  For  each  direc- 
tion (N,  S,  E  and  W),  there  are  two  sets  of 
lights:  one  for  the  forward  direction  (F)  and  one  for 
left  turns  (L).  Each  set  has  three  lights:  red  (R),  yel- 
low (Y)  and  green  (G).  For  each  of  the  four  direc- 
tions, there  are  also  two  detectors  (D):  a  left  turn 
detector  and  a  forward  detector. 

In  the  master  diagram  for  traffic-flow  control  (a), 
each  box  represents  a  set  of  states  generated  by 
the  controller  in  response  to  the  input  conditions. 
For  instance,  DO  NL  is  executed  given  the  input 
conditions  NLD  */SLD  (NLD  and  not  SLD). 

Four  Prose  devices,  a  counter  and  an  encoder 
are  used  to  implement  this  controller  (b).  The  in- 
puts to  this  controller  are  the  six  outputs  of  the  en- 
coder and  the  carry  out  (CO)  signal  from  the 
counter.  The  outputs  are  the  control  signals  to  the 
lights  themselves.  The  encoder  simply  passes  the 
NLD,  SLD,  ELD  and  WLD  signals.  It  generates  two 
new  signals,  NSD  and  EWD,  as  follows: 

NSD=NLD+NFD+SLD+SFD 
EWD  =  ELD+  EFD  +  WLD  +  WFD 

Prose  devices  1  through  3  generate  the  24  out- 
puts that  control  the  traffic  lights,  while  Prose  4 
preloads  the  counter.  Note  that  all  four  Prose  de- 
vices receive  the  CO  signal  from  the  counter.  CO 
tells  the  Prose  devices  when  the  preloaded 
number  of  cycles  has  been  completed  so  they  can 
generate  the  next  state.  The  counter  is  used  to 
conserve  states  in  the  Prose  devices;  that  is,  the 
devices  simply  cycle  back  to  the  same  state  until 
the  desired  number  of  cycles  has  been  completed 
or  until  some  condition  detected  by  the  sense  de- 
tectors warrants  jumping  out  of  the  loop  early. 
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as  simpler.  For  instance,  when  an  ALU  is  used  only 
for  a  multiply  or  filtering  operation,  Prose  devices 
reduce  chip  count  but  can  also  generate  the  required 
instructions  faster  than  the  more  complex  micro- 
program controller.  Prose  devices  can  also  be  used 
for  addressing  the  data  memory  to  fetch  the 
operands  for  the  filtering  operation. 

Consider  also  the  case  of  a  video  controller.  For 
scanning  purposes,  counters  that  operate  in  various 
sequences  and  count  lengths  are  required.  But  in- 
stead of  implementing  these  as  actual  counters,  the 
sequences  involved  can  be  unlocked  and  imple- 
mented as  state-machine  transitions.  And  there's 
an  advantage  beyond  the  mere  economy  of  parts.  A 
count  can  be  set  or  initiated  and  then  left  to  poll 
various  external  hardware  conditions  while  the  mi- 
croprocessor performs  other  operations. 


Memory  and  bus  arbitration  applications  offer 
further  opportunities  for  using  Prose  devices.  If 
two  processors  or  other  devices  vie  for  memory  ac- 
cess, a  simple  Prose  state  machine  can  implement 
the  logic  for  deciding  which  device  gets  the  bus 
and/or  access  to  memory.  In  such  a  case,  the  heavy 
artillery  of  a  microprogram  controller  and  memory 
isn't  required. 

As  a  sequencer  for  signal-processing  applica- 
tions, Prose  state  machines  offer  speed  and  suffi- 
cient functionality  without  the  overkill  of  a  com- 
plex microsequencer  paired  with  dedicated  mem- 
ory. For  simple  algorithms,  such  as  those  involved 
in  performing  a  fast  Fourier  transform,  Prose  de- 
vices can  control  the  set  of  vectors  that  are  mul- 
tiplied and  added  in  the  process.  Via  the  instruction 
sets  to  the  devices,  Prose  can  control  the  operation 
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(PRELOAD  110) 


Consider  a  block  such  as  DO  NL.  As  we  enter 
this  block,  we  have  just  completed  flow  in  the  east- 
west  direction.  The  previous  state  has  generated 
the  preload  for  the  counter  so  that  as  we  enter  the 
block  DO  NL  with  NLD"  /SLD,  the  counter  begins 
the  count.  Suppose  that  we  have  preloaded  110 
and  count  down.  On  the  count  following  000,  CO 
goes  high,  and  this  tells  the  Prose  devices  to 
branch  to  another  state.  Figure  (c)  is  a  state  transi- 
tion diagram  for  DO  NL.  The  state  "NLG  NFG"  re- 
cycles until  CO  or  /NLD  occurs.  Figure  (d)  shows 
the  details  of  DO  NLSL;  Figure  (e)  shows  DO  NFSF. 
These  diagrams  reflect  all  basic  transition  types  in 
the  controller.  There  are  four  blocks  of  the  type  DO 
NL,  two  of  the  type  DO  NLSL,  and  two  of  the  type 
DO  NFSF.  In  addition,  there's  one  other  block,  ALL 
REDS,  that  contains  only  one  state:  all  outputs  red. 
So,  from  counting  the  possible  states  of  the  sys- 
tem according  to  branch  type,  we  have  15  one-way 
branches,  10  two-way  branches,  0  three-way 
branches,  and  4  four-way  branches,  resulting  in  31 
of  the  128  PROM  locations  used. 

This  application  uses  four  Prose  devices — three 
for  generating  signals  to  control  the  lights  and  one 
for  controlling  the  counter  for  state  minimization. 
Although  outputs  are  different,  all  fourdevices  use 
the  same  inputs  and  execute  the  same  transition 
table  and  thus  can  be  considered  to  be  in  the  same 
state,  or  to  be  generating  a  single  state  with  a 
wider  output  word.  This  usage  demonstrates  how 
the  Prose  device  can  be  expanded  in  width.  When 
programming  the  devices,  state  transition  and 
condition  equations  are  identical  and  can  be 
cop  ed.  Only  the  output  equations  are  unique  to 
each  and  must  be  programmed  separately. 
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of  the  ALUs  or  multipliers  and  can  control  memory 
access  when  separate  Prose  devices  are  used.  A 
Prose  device  is  well  suited  to  relatively  simple  oper- 
ations that  don't  involve  complicated  decision 
making.  For  complex  digital  signal  processing  op- 
erations, a  programmable  DSP  device  may  be  bet- 
ter, but  it's  not  likely  to  be  as  fast  as  the  dedicated 
hardware  approach. 

In  peripheral  control,  the  simple  state-machine 
approach  can  be  efficient.  Consider  the  case  of  run- 
length-limited  code.  Both  encoding  and  decoding 
can  be  translated  into  state  sequence  diagrams, 
which  examine  the  serial  data  stream  as  it's  read, 
and  can  then  be  readily  programmed  into  a  Prose 
device. 

Industrial  control  and  robotics  also  require  sim- 
ple control  functions.  Tasks  such  as  mechanical  po- 


sitioning of  a  robot  arm,  simple  decision  making, 
calculating  a  trigonometric  function  and  displaying 
several  digits  usually  don't  require  a  high-power 
microsequencer  with  stacks  and  pointers  and  mi- 
croprogram memory.  What's  required  instead  is  a 
device  that  can  store  a  limited  number  of  states  and 
allows  simple  branching  upon  conditions. 

Control  logic  alternatives 

A  registered  PAL  device  is  a  simple  state  machine 
and  can  be  effectively  used  for  simple  control  func- 
tions, such  as  counters,  dynamic  RAM  controllers, 
interrupt  controllers  and  certain  types  of  video  con- 
trollers. Combining  a  PAL  device  with  memory 
such  as  a  PROM,  as  MMI  has  done  in  Prose,  gives 
the  device  much  more  powerful  control  capability. 
Another  alternative  for  control  logic  is  the  FPLS, 
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Prose  as  a  bus  arbiter 


Prose  can  be  handy  in  applications  that  require 
bus  arbitration.  Consider,  for  example,  two 
microprocessors  and  refresh  circuitry  that  require 
access  to  a  block  of  dynamic  RAMs  (a). 


MEMORY 
ARRAY 


DYNAMIC 
RAMS 


PROSE  ARBITRATION 


Prose  device  can  precisely  control  the  access  to 
memory. 

Once  access  is  granted,  a  complete  memory  cy- 
cle requires  five  clock  cycles.  Three  clocks  are  re- 
quired for  access;  two  are  required  for  DRAM 
precharging.  /RAS  is  asserted  from  the  second  cy- 
cle to  the  end  of  the  fourth,  while  /CAS  is  asserted 
from  the  third  cycle  to  the  end  of  the  fourth.  (Row 
and  column  addresses  are  asserted,  as  shown, 
prior  to  the  assertion  of  /RAS  and  /CAS.)  Both  /RAS 


REFRESH  CONTROL 


CC0'/CC1'/CC2VIS 


CC(0-2)  /CCACK(0-2) 

The  arbiter,  which  is  a  Prose  device,  receives  re- 
quests for  access  via  condition  codes  CC(2-0).  The 
device  may  also  receive  a  synchronous  initializa- 
tion request, /IS,  which  is  intended  to  reset  the  sys- 
tem to  an  initial  state.  From  these  input  signals, 
the  arbiter  must  generate  signals  to  control  the  ac- 
cess to  the  DRAMs.  Thus  it  must  generate  ac- 
knowledge signals  for  the  requesting  devices- 
condition  code  acknowledge  signals— /CCACK(2- 
0)— and  /RAS  (row  address  strobe)  and  /CAS 
(column  address  strobe)  for  the  memories  them- 
selves and  MCS  for  row/column  multiplexer  con- 
trol. With  this  set  of  input  and  output  signals,  the 


CC0VCC1VCC2VIS 
CC1VCC2VIS 


CC2VIS 


which  has  programmable  AND  and  OR  arrays  as 
well  as  JK  or  clocked-RS  flip-flops  with  internal 
registered  feedback  that  allow  sequencing  and  con- 
figurable I/O  pins.  FPLS  devices  have  a  fixed 
number  of  product  terms  that  can  be  shared  be- 
tween input  logic,  state  transitions  and  output  func- 
tions, dictated  by  the  application  needs.  Because 
the  Prose  device  has  dedicated  resources — the  PAL 
and  PROM  arrays — these  applications  trade-offs 
aren't  required.  While  not  characteristic  of  the  ar- 
chitecture, some  FPLS  sequencers  operate  at  dif- 
ferent speeds  depending  on  the  logic  functions 
implemented;  for  example,  as  the  complexity  of  the 
logic  functions  increases,  speed  decreases.  With  the 
Prose  device,  however,  operation  speed  is  a  con- 
stant. For  many  applications,  FPLS  devices  and 
Prose  devices  are  complementary;  a  design  engineer 
should  be  able  to  use  both,  choosing  the  right  de- 
vice for  the  application. 

Finally,  some  recently  introduced  instruction- 


based  machines,  similar  to  a  PROM  under  control 
of  a  microsequencer,  offer  another  option  for  con- 
trol logic.  These  machines  are  sophisticated  devices 
with  ample  states  for  state  machines,  but  they're 
too  complex  for  simple  control  applications.  More- 
over, user  adaptability  and  customization  is  lim- 
ited. Because  they're  powerful  and  have  sufficient 
outputs,  these  machines  can  easily  implement 
counter-type  state  machines.  But  they  offer  a 
limited  choice  of  combinations  of  input  literals  for 
condition  testing.  While  such  devices  form  a  power- 
ful microprogram  controller/sequencer  for  gener- 
ating addresses,  they  don't  provide  a  single-chip 
solution,  and  in  many  cases  they're  far  more  power- 
ful than  the  application  requires. 

Prose  device  overview 

The  major  building  blocks  in  the  Prose  device  are 
the  PAL  array,  the  PROM  array,  and  the  shadow 
and  output  registers.  In  brief,  the  PAL  array  gener- 
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and  /CAS  are  removed  during  the  fifth  and  the  first 
cycle  for  DRAM  precharging. 

The  sequences  of  memory  clock  cycles  with 
their  associated  control  signals  can  be  viewed  as  a 
set  of  state  transitions  (b).  The  condition  codes, 
CC(2-0),  are  assigned  as  follows:  CC2  is  a  refresh 
required;  CC1  is  a  request  from  Processor  1;  and 
CCO  is  a  request  from  Processor  2.  A  refresh  cycle 
differs  from  a  regular  memory  cycle  only  in  that 
/CAS  isn't  asserted. 

As  shown  in  the  transition  diagram,  A1  cycles 
upon  itself  if  IS  is  H  (or  active  low  signal  /IS  is  L), 
or  if  there's  no  request.  If  there's  a  refresh  request 
and  IS  is  L,  the  transition  A1  to  B1  occurs,  initiating 
the  cycle  of  states  B1  through  B5.  This  acknow- 
ledges the  refresh  request  and  generates  the  tim- 
ing required  to  carry  out  the  refresh  operation. 
Note  that  any  time  /IS  occurs,  the  next  transition  is 
to  A1.  The  same  is  true  for  the  loops  C1  through  C5 
(processor  1)  and  D1  through  05  (processor  2). 
Note  also  that  at  B5  if  there's  no  request  (and  IS  is 
L),  the  next  transition  is  to  A1.  But  if  IS  is  L  and  C2 
is  still  true  (another  refresh  request),  the  next  tran- 
sition will  be  to  B1.  Furthermore,  if  CC±  is  L  (no 
refresh  request),  IS  is  L,  but  CC1  is  H  (processor  1 
request),  the  transition  will  be  to  C1  directly. 
Similarly,  if  IS,  CC2  and  CC1  are  all  L,  and  CCO  is  H, 
the  transition  will  be  directly  to  state  D1.  The  tran- 
sition diagram  shows  a  similar  relationship  of  tran- 
sitions in  the  loops  C1  through  C5  and  D1  through 
D5.  There  are  16  states,  including  1  three-way,  12 
two-way,  and  3  four-way  states,  which  the  Prose 
device  can  easily  accommodate. 

Prose  can  also  handle  more  complex  configura- 
tions. Suppose  there  are  four  devices  competing 


for  bus  and  memory  access.  A  problem  arises  in 
simply  expanding  the  diagram  (c)  to  accommodate 
the  extra  device.  Doing  this  directly  would  require 
that  A1  be  a  five-way  branch.  But  the  Prose  is 
limited  to  four-way  branching. 
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This  difficulty  is  easily  avoided,  however,  by  in- 
serting another  state,  A2,  between  A1  and  the  tran- 
sition sequences  for  two  of  the  requesting 
devices.  Thus  A2  serves  as  the  embarkation  point 
forthese  transitions.  Here,  the  condition  codes  are 
CC3  (refresh),  CC2  (processor  1),  CC1  (processor  2) 
and  the  CCO  (an  I/O  port).  CC3  has  the  highest 
priority;  CCO  has  the  lowest.  Note,  however,  that 
D1  through  D5arethefiveclockcyclesfortherefresh 
circuit.  A  refresh  request  always  gets  the  highest 
priority  but,  since  speed  of  execution  isn't  crucial  for 
the  refresh  request,  it's  reached  via  A2.  That  is,  the 
refresh  request  is  never  ignored,  but  it's  adequate  to 
service  it  through  A2,  allowing  fasterservicing  (by  one 
clock  cycle)  of  processor  1  or  processor  2  requests. 
A1goestoA2forCC3*/IS  +  CC0-/CC2-/CC1-/IS;A2 
goes  to  D1  forCC3  "/IS.  This  configuration  requires 
no  one-way,  16  two-way,  1  three-way,  and  5  four-way 
states— easily  handled  by  Prose. 


ates  the  branch  address  bits  for  the  memory  (128 
words  x  21  bits),  which  stores  the  system's  next 
state  addresses,  device  outputs  and  control  func- 
tions. The  shadow  register  is  used  for  programming 
and  diagnosis,  and  the  output  register  provides  syn- 
chronous operation  and  registered  feedback. 

The  PAL  array  is  a  14H2-type:  it  has  14  comple- 
mentary inputs  and  two  active  high  outputs.  Each 
of  the  two  outputs  is  a  sum  of  eight  product  terms. 
Eight  of  the  inputs,  from  an  external  source,  are 
used  for  encoding  of  conditions  for  branch  selec- 
tion. The  other  six  inputs,  from  internal  feedback 
from  the  PROM,  are  used  for  selection  of  the  con- 
ditions that  determine  branching  when  decisions 
are  involved  in  state  transitions. 

The  PROM  consists  of  128  21-bit  words  or  loca- 
tions, each  of  which  can  be  viewed  as  a  state,  or  as 
containing  state-related  information.  Eight  bits  at 
each  location  are  intended  as  outputs.  The  other  13 
bits  are  intended  as  feedback:  five  are  feedback  to 


the  PROM  array  itself  for  partially  determining  the 
next  location  to  be  addressed,  and  six  are  condition- 
select  bits  to  the  PAL  array.  They're  used  for  prod- 
uct term  selection/enabling.  The  remaining  two 
bits  are  inputs  to  the  exclusive  OR  gates.  They're 
used  to  determine  address  bits  A6  and  A5  when 
branching  isn't  involved. 

When  no  branching  is  involved,  CS(5-0)  select  no 
condition  in  the  PAL  array,  so  that  no  product 
term  is  wasted,  and  the  PAL  outputs  are 
X1  =  X2  =  0.  Thus  XF1  and  XFO  are  used  to  select 
the  higher-order  address  bits — A6  and  A5,  in  this 
case — and  product  terms  are  conserved  for  use 
when  real  conditional  selection  (branching)  is  in- 
volved. When  conditional  branching  is  involved, 
no  more  than  four  branches  can  be  from  one  loca- 
tion, and  all  branch  states  will  be  in  the  same  prima- 
ry location  but  in  different  quadrants.  That  is, 
address  bit  A(4-0)  will  be  the  same,  and  the  condi- 
tions will  toggle  A6  and  A5  to  determine  the  exact 


2-605 


PROSE  Devices  Simplify  State  Machine  Design 


I  (0-7) 


PAL 
ARRAY 
14H2 


/ 


CS  (0-5) 


2 

XF  (0-1) 


V 


1L> 
1L> 


MODE 
SDI 
DCLK 
/  CLK 


PROM 
ARRAY 

128  <  21 


SHADOW  & 
OUTPUT  REGISTERS 


Q  (0-7) 


In  the  Prose  device,  the  PAL  ar- 
ray processes  inputs  to  produce 
two  decision  variables  that  ac- 
cess the  PROM  array,  which 
generates  outputs  from  the  ma- 
chine and  the  next  data  address. 
Support  logic  is  provided  for 
serial  scan  access  and  diagnosis. 


quadrant  location  of  each  of  the  possible  next 
states. 

For  diagnostic  purposes,  a  shadow  register 
breaks  the  data  path.  Data  can  be  shifted  in  or  out 
serially.  The  shadow  register  and  the  output  register 
are  operated  by  different  clocks,  so  that  data  may 
be  shifted  without  affecting  device  operation  and 
spoiling  data.  This  lets  data  (in  this  case,  test  vec- 
tors) be  shifted  into  the  shadow  register  and  clocked 
onto  the  output  register.  A  normal  operation  can 
then  be  performed,  after  which  the  data  can  be 
loaded  onto  the  shadow  register  and  shifted  out  to 
be  analyzed.  In  addition,  a  multiplexer  to  the  shad- 
ow register  lets  the  register  contain  data  from  the 
output  register  or  from  the  previous  shadow  regis- 
ter contents. 

Internally,  programming  of  the  Prose  device 
takes  place  via  the  shadow  register,  is  completely 
transparent  to  the  user  and  is  done  individually  for 
each  half  of  the  device.  For  programming  the  PAL 
part  of  the  device,  a  certain  pattern  of  data  is  shift- 
ed into  the  shadow  register  with  Vcc  raised  to  the 
programming  supervoltage  and  the  preset/enable 
pin  also  raised  to  the  supervoltage  afterwards.  Indi- 
vidual fuses  in  the  PAL  device  are  blown  and  se- 
lected by  the  appropriate  pattern. 

Programming  of  the  PROM  is  accomplished  in 
similar  fashion.  V^c  is  raised  to  the  supervoltage 
value;  SDI  is  raised  to  the  supervoltage  afterwards. 
The  bit  pattern  in  the  shadow  register  determines 
which  fuse  is  to  be  blown. 

Software  support 

Prose  state  machines  are  fully  supported  by  new 
programs  in  the  Palasm  suite — Proasm  for  assem- 
bly and  Prosim  for  simulation.  To  program  the 
Prose  device,  the  user  inputs  a  list  of  state  names, 
transitions  and  conditions  that  cause  transitions  to 
subsequent  states.  When  progamming  a  Moore  ma- 
chine, the  user  also  inputs  a  list  of  expected  outputs 
for  each  state,  and  when  programming  a  Mealy  ma- 


chine, the  user  inputs  a  list  of  outputs  expected  for 
each  path  to  each  state.  (For  a  Moore  state  ma- 
chine, the  outputs  depend  only  on  the  state.  For  a 
Mealy  machine,  outputs  depend  on  the  inputs  also.) 

User  input  to  Palasm  2  (for  state-machine  entry) 
consists  of  four  sections:  declaration,  state,  equa- 
tions and  simulation.  The  declaration  section  speci- 
fies the  type  of  Prose  device  to  be  programmed. 
Currently,  there's  just  one  device  in  the  family. 

The  state  section  specifies  the  state  transitions 
and  the  output  details.  The  user  specifies  the  type  of 
state  machine — Moore  or  Mealy — and  the  various 
defaults  that  should  be  used  when  the  next  state  or 
the  outputs  can't  be  determined  from  the  equa- 
tions. Furthermore,  the  user  specifies  state  transi- 
tions and  outputs  via  equations.  The  transition 
equations  specify  transitions  from  each  current 
state.  The  output  equations  specify  the  outputs  for 
each  state,  in  the  case  of  a  Moore  state  machine,  or 
the  outputs  expected  upon  the  transition  from  one 
state  to  another,  in  the  case  of  a  Mealy  state  ma- 
chine. For  the  Moore  machine,  there's  only  one  set 
of  outputs  for  each  state,  but  for  the  Mealy  ma- 
chine, the  outputs  depend  on  the  path  to  the  state. 
The  last  section,  the  simulation  section,  contains  in- 
structions that  tell  the  software  how  to  simulate  ac- 
tual operation  of  the  state  machine  and,  therefore, 
how  to  test  the  device  once  it's  programmed.  CO 
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In  the  abstract,  a  computer,  or  at 
least  parts  of  a  computer,  can  be 
defined  by  a  state-machine  di- 
rected-transition  graph  (  see  Fig.  1 ) . 
A  state  machine  is  a  small,  locally 
controlled  machine — not  intended 
for  a  complete  computation  of  a  com- 
plex program. 

Experts  generally  divide  state  ma- 
chines into  two  basic  classifications : 
algorithmic  and  synchronous-se- 
quential machines.  Algorithmic  ma- 
chines include  the  subcategorie: 
called  Turing  machines,  Post  ma- 
chines, and  finite  machines,  which 
primarily  describe  the  specific  algo- 
rithm used  in  programming  the  com- 
puter. A  basic  difference  between  a 
synchronous  sequential  state  ma- 
chine and  an  algorithmic  machine  is 
the  presence  or  absence  of  stop 
states.  A  start  state,  of  course,  must 
be  present  in  all  machines  or  else 
they  will  not  be  initiated. 

A  stop  state  in  an  algorithmic  ma- 
chine (like  a  Turing)  occurs  when 
it  has  finished  executing  an  algo- 
rithm. In  a  synchronous  sequential 
machine,  however,  to  achieve  a  stop 
state,  the  machine  must  branch  to  it- 
self— a  type  of  wait  state — until  an- 
other operation  is  ready.  The  only 
time  a  synchronous  sequential  state 
machine  may  be  actually  halted  is 
for  an  error  or  illegal  command  that 
forces  it  into  looping  on  itself  uncon- 
ditionally. But  once  in  such  a  state, 
the  state  machine  theoretically  can't 
exit  from  it.  The  machine  must  be 
reset  and  started  over  again. 

Implementing  a  state  machine 

Although  devices  such  as  PAL, 
PLA,  and  FPLS  arrays  can  be  used 
to  implement  such  a  state  machine,  a 
programmable  sequencer  (PROSE) 
device,  designated  PMS14R21,  is  a 
convenient,  flexible,  and  economical 
way.  It  consists  of  a  combination 
of  a  special  PAL  and  a  registered 
PROM  that  are  optimized  for  use 
in  local,  intelligent-control  state- 
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Fig.  1.  Parts  of  a  computer  can  be  represented  by  a  graph  of  a  state  machine 
showing  its  directed  transitions  between  states.  An  algorithmic  machine  (a) 
has  stop  state(s),  whereas  a  synchronous  machine  (b)  does  not. 

good  many  other  state  machines. 
Also,  timing  restraints  on  external 
devices  are  greatly  relaxed.  Perhaps 
most  important  of  all,  the  PROSE 
device  provides  a  small,  single-pack- 
age solution  to  state-machine  design 
(for  example,  24-pin,  3-in.  Skinny- 
DIPs  and  28-pin  plastic  leaded  chip 
carriers) . 


EX-OR 
CONTROL 

XF(0-1) 


A(0-4) 


PROM  ARRAY 
(128x21) 


ADDRESS 
FEEDBACK 


SHADOW  AND 
OUTPUT  REGISTERS 


,'13 


FB(0-12) 


Fig.  2.  The  PMS14R21  programmable  se- 
quencer has  a  special  PAL  array  joined  to 
a  registered  PROM  by  two  OR  gates. 


►  SDO 


OUTPUTS 

1(0-7) 


machine  design  (see  Fig.  2) . 

The  introduction  of  the  PROSE 
device  is  a  significant  advance  in  the 
design  of  fuse-programmable  state 
machines  of  small  to  middling  size. 
The  PAL  array  serves  to  select  tran- 
sition conditions,  and  the  PROM 
stores  the  state  assignments.  Also, 
two  exclusive-OR  gates  help  reduce 
the  total  number  of  states  and  prod- 
uct terms  a  state  machine  would 
need.  One  terminal  (I/E)  can  be 


programmed  to  receive  an  asynchro- 
nous initialization  signal  (one  that 
resets  all  registers  to  the  high  state) 
or  an  output  enable  (via  an  architec- 
tural fuse) ;  others,  to  enable  fault 
diagnosis.  A  power-up  reset  feature 
is  also  provided. 

Each  PROSE  device  has  eight  in- 
puts, eight  outputs,  up  to  128  states, 
and  up  to  a  four-way  branching  ca- 
pability. Many  local  control  func- 
tions can  be  done  faster  than  on  a 


Moore  and  Mealy  models 

Two  categories  of  state-machine 
models  are  generally  implemented: 
the  Moore  and  the  Mealy  state  ma- 
chines. The  Moore  machine's  out- 
puts depend  only  on  the  state;  they 
are  independent  of  inputs.  A  Mealy 
machine's  outputs,  however,  depend 
on  both  the  state  and  the  machine's 
inputs.  The  Mealy,  therefore,  is  a 
superset  of  a  Moore  state  machine. 

A  true  Mealy  state  machine  is  too 
versatile  to  be  denned  in  a  general 
architecture.  A  Moore  machine  is 
much  easier  to  define  and  simpler 
to  implement.  The  simpler  the  state 
machine  is,  the  faster  it  will  be.  On 
the  other  hand,  for  most  applica- 
tions, the  outputs  of  a  state  machine 
depend  somehow  on  the  inputs  as 
well  as  on  the  state. 

The  PROSE  device  can  support 
the  Moore  model.  Fortunately,  the 
Mealy  model  can  also  be  supported, 
provided  that  the  PROSE  device's 
output  pattern  can  be  pipelined  by 
one  clock  cycle.  Also,  the  number  of 


2-608 


rUUFJ]  Ai 


Designing  a  State  Machine  with  a  Programmable  Sequencer 


combinations  of  patterns  in  the  sub- 
sequent states,  and  the  state  thus 
pipelined,  must  not  exceed  four. 

Controlling  a  state  machine 

Naturally,  a  state  machine  must 
include  a  control  system  in  addition 
to  the  operating  portion  outlined  so 
far.  The  control  system  determines 
the  next  state  on  the  basis  of  the  cur- 
rent state  plus  a  combination  of  in- 
puts to  the  control  system.  Of  course, 
1  on  its  state,  the  operating  part 
of  the  system  determines  the  outputs 
of  the  machine. 

The  number  of  inputs,  to  a  certain 
extent,  determines  the  number  of 
conditions  (as  inputs  or  combina- 
tions of  input  literals)  from  which 
the  state  machine  can  select.  It  can 
also  determine  whether  encoding  the 
conditions  is  necessary,  which  then 
becomes  part  of  any  external  delay. 
A  small,  local,  intelligent,  arbitration 
unit  may  have  as  few  as  three  to  four 
states  to  as  many  as  30  to  40  states. 
A  highly  intelligent  controller  can 
have  thousands  of  states.  However, 
a  state  machine  generally  has  a  rela- 
tively small  number  of  states  to 
which  branching  occurs,  because  the 
latter  states  will  eventually  loop  to 
each  other  (except  maybe  for  the  few 
starting  states) . 

Conditional  branching  needed 

Because  a  state  machine  switches 
its  states  on  the  basis  of  both  the 
current  condition  and  inputs,  fre- 
quently conditional  branching  is 
necessary,  sometimes  to  more  than 
two  next  states.  Four-way  branching 
is  common.  If  more  than  four-way 
branching  is  required,  up  to  16-way 
branching  can  be  carried  out  by  two 
cycles  of  four- way  branches  (see 
Fig.  3) . 

But  in  applications  that  operate 
on  single-cycle  instructions,  such  a 
branch-fanout  tree  may  not  be  de- 
sirable. Such  single-cycle  multiple 
branching  can  be  done  provided  the 
machine  allows  losing  a  cycle  for  the 
fanout  and  has  enough  product  terms 
to  perform  the  branching  on  all  the 
conditions.  (A  PAL  or  PLA  device 
has  a  limited  number  of  available 
product  terms.) 

Similarly,  the  number  of  outputs 


Fig.  3.  A  state  fanout  tree  of  up  to  a 
16-way  branch  can  be  carried  out  in 
two  cycles  of  four-way  branches. 
This  operation  requires  the  state  ma- 
chine to  lose  a  cycle  for  the  fanout 
and  to  have  enough  product  terms 
available  to  perform  the  branching 
for  all  the  conditions. 

determines  whether  encoding  of  op- 
erations control  is  necessary  and 
how  many  operations  are  possible. 
Again,  decoding  of  control  signals 
externally  will  mean  extra  delay.  In 
a  single-device  state  machine,  since 
a  limited  number  of  pins  is  available, 
the  ratio  of  the  number  of  outputs  to 
the  number  of  inputs  is  very  critical. 
This  ratio  determines  the  applica- 


tion range  of  the  device. 

The  PROSE  state  machine  uses  a 
PAL  array,  designated  14H2,  instead 
of  a  simple  condition  multiplexer,  to 
avoid  limiting  the  choice  of  possible 
transition  conditions  to  just  one  in- 
put at  a  time.  Rather,  input  literals 
at  any  time  can  be  combined  so  that 
transition-condition  encoding  is  not 
needed.  The  eight  inputs  (I0  to  I7) 
to  the  PAL  device  are  condition  sig- 
nals. Together  with  the  six  addi- 
tional inputs  (CS0  to  CS5),  which 
carry  condition-select  feedback  sig- 
nals, they  determine  the  next  state. 
These  feedback  signals  serve  only  to 
select  the  conditions  for  branching. 

The  six  condition-select  feedback 
signals  to  the  PAL  array  simplify  the 
selection  of  the  transition  conditions. 
Also,  combinations  of  conditions  to 
minimize  product  terms  can  be  em- 
ployed. Since  the  output  registers 
are  all  initialized  to  high  states,  and 
the  next  state  after  an  initialization 
has  to  be  known  (state  127  for  this 
device),  all  product  terms  in  the 
PAL  array  must  be  disabled  with 
this  pattern  (all  high)  to  perform 
unconditional  branching.  Therefore, 
this  pattern  is  reserved  for  disabling 
all  product  terms  for  unconditional 
branching. 

The  14  PAL  inputs  are  comple- 
mented internally  to  help  program 
two  eight-product-term  OR  gates  at 
each  of  two  active-high  outputs  (X0 
and  X! ) .  The  conditions  controlling 
the  next  states  must  fit  within  these 
limits.  The  outputs  of  the  PAL  feed 
exclusive-OR  (EXOR)  gates,  which 
help  conserve  the  need  for  state  and 
product  terms. 

To  perform  an  unconditional,  or 
one-way  branch,  to  save  a  product 
term,  X(,  and  X1  are  at  00.  For  ex- 
ample, the  new  PROM  address  has 
A5  and  A6  equal  to  10  (binary) .  Ac- 
cordingly, the  EXOR  feedback  sig- 
nals XF()  and  XF1  are  programmed 
as  102  to  force  the  EXOR  outputs  to 
10  unconditionally. 

For  initialization,  the  CS0  to  CS5 
feedback  pattern  for  an  uncondi- 
tional branch  is  111111.  Thus,  the 
operational  feedback  of  CS0  to  CS5 
to  the  PAL  must  contain  at  least 
one  0 — that  is,  patterns  like  111111, 
11X1X1,  or  XX1XXX  should  not 
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1  PRIMARY  LOCATION 
be  used  to  select  any  product  term. 

Organization  of  the  PROM  array 

The  PROM  array  is  a  fuse-pro- 
grammable array  of  128  locations  by 
21  outputs.  Eight  primary  outputs 
are  directly  available  (Qn  to  Q7). 
The  actual  functions  are  denned  by 
the  user.  In  addition,  13  more  out- 
puts serve  as  feedback  signals  to  help 
determine  the  next  state.  All  the  out- 
puts are  registered  at  the  rising  edge 


Fig.  4.  An  absolute  address  map  (a) 
and  a  primary  location  map  (b)  show 
how  a  location  within  each  of  lour 
PROM  quadrants  has  the  same  pri- 
mary location.  If  two  states  branch 
to  the  same  state  (c),  that  state 
would  have  to  be  duplicated  if  all  the 
resulting  next  states  exceed  four. 


of  the  CLK  input  signal. 

The  primary  locations  in  the 
PROM  are  addressed  by  five  inputs, 
A0  to  A4,  which  are  fed  back  directly 
from  the  PROM  registered  outputs 
to  the  PROM  array.  These  5  bits  de- 
termine the  next  primary  location. 

Therefore,  32  primary  locations 
define  the  states.  The  remaining  two 
address  inputs  to  the  PROM  f  bits 
A-  and  Ar,),  which  the  PAL  array 
and  the  exclusive-OR  gates  generate, 
address  four  PROM  quadrants,  in 
each  of  which  branching  can  be  per- 
formed to  any  of  its  32  primary  lo- 
cations (see  Fig.  4a). 


Potentially,  each  of  the  128  loca- 
tions thus  addressed  could  define  a 
different  state.  In  actuality,  the  num- 
ber implemented  is  usually  fewer. 
For  example,  some  states  may  need 
to  be  duplicated  in  different  primary 
locations.  Consider  a  state  A  whose 
next  states  will  be  B,  C,  and  D,  and 
a  state  E  whose  next  states  will  be 
D,  F,  and  G.  Although  one  next  state, 
D,  is  common  to  the  two  conditional 
transfers,  putting  B,  C,  D,  F,  G  with- 
in the  same  primary  location  I  within 
the  four  quadrants)  is  impossible, 
since  a  primary  location  can  accom- 
modate only  four  next  states  in  all 
'see  Fig.  4b).  Therefore,  two  pri- 
mary locations  must  be  occupied 
with  the  state  D  and  its  clone. 

Another  example  requiring  dupli- 
cation of  states  occurs  even  when  a 
state  A  branches  to  states  B  and  C, 
and  state  D  branches  to  states  C,  E, 
and  F — state  C  being  common  fsee 
Fig.  4c) .  Although  the  total  number 
of  next  states  is  four,  if  insufficient 
product  terms  are  available,  the 
states  of  A  and  D  may  still  need  to 
be  duplicated  to  achieve  that  result. 

In  the  worst  case,  if  every  next 
state  had  to  be  duplicated  because 
of  incompatible  combinations,  or  if 
all  states  had  four-way  branches,  the 
PROSE  state  machine  could  still  ac- 
commodate 32  states.  Each  state 
would  have  to  be  pointed  to  a  differ- 
ent primary  location. 

Registers  for  the  PROM 

To  conserve  terminal  pins,  the  13 
bits  of  feedback  signals  from  the  21- 
bit  output  register  are  buried  within 
the  PROSE  chip  (see  Fig.  5) .  What 
is  called  a  shadow  register,  together 
with  a  diagnostic-on-chip  (DOC) 
scheme,  enables  testing  various  parts 
of  the  chip,  including  the  buried-sig- 
nal  conditions. 

The  shadow  and  output  registers 
(see  Fig.  6a)  are  operated  by  differ- 
ent clocks— DCLK  and  CLK,  re- 
spectively— in  several  modes  with 
the  MODE  and  SDI  signals  (see 
Table  1 ) .  The  output  registers  can 
multiplex  data  from  the  shadow  reg- 
ister (to  receive  a  test  vector)  or 
from  the  PROM  array  ( to  receive  a 
test  result).  Similarly,  the  shadow 
register  can  multiplex  data  from  the 


Designing  a  State  Machine  with  a  Programmable  Sequencer 


output  register  (for  the  test  result) ; 
the  previous  shadow-register  bits  (to 
shift  in  a  test  vector  and/ or  shift 
out  ai  test  result) ;  and  from  the  cur- 
rent shadow-register  bits  (as  just  a 
hold  operation) . 

The  DOC-register  concept  also 
solves  the  programming  issues  for 
the  PROSE  device.  A  pattern  in  the 
shadow  register  identifies  the  fuse 
to  be  blown  (see  Fig.  6b) .  Bits  PT0 
to  PT2  select  one  product  term  out 
of  eight  from  each  of  the  PAL's  sum 
of  terms.  Of  these,  bits  CS0  to  CS5 
and  I0  to  I7  select  one  input  to  the 
PAL  array.  The  bit  labeled  O/P  se- 
lects a  sum  term,  and  the  bit  T/C 
selects  either  the  true  or  complemen- 
tary value  of  the  input  selected. 

Similarly,  the  fuse  to  be  blown  in 
the  PROM  is  identified  by  the  pat- 
tern in  the  shadow  register  (see  Fig. 
6c) .  A0  to  A6  is  the  absolute  address 
for  the  PROM  array,  and  OA0  to 
OA4  selects  1  bit  out  of  the  21  bits 
in  a  PROM  word. 

Applications  abound 

The  PROSE  device  is  designed  to 
be  used  as  a  general-purpose  state 
machine  for  controlling  equipment 
or  for  data  manipulation.  The  128 
states  and  four-way  branch  capabili- 
ties make  it  possible  to  be  used  as  a 
high-speed  memory  or  bus  arbitra- 
tor, a  video-RAM  arbitrator,  an  in- 
dustrial traffic  controller,  a  signal- 
processor  sequencer,  a  sequencer  for 
mass  storage  or  communications,  a 
data  encoder  and  decoder,  and  for 
many  other  applications.  For  exam- 
ple, the  PROSE  state  machine  can 
implement  the  run-length-limited 
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Fig.  5.  The  block  diagram  for  a  diagnostic  register  bit  includes  both  a  shadow 
register  and  an  output  register.  Thus  a  test  vector  and  the  test  result  can  be 
shifted  in  and  out  of  the  shadow  register  without  corrupting  the  output  flow. 


code,  designated  2,7  RLL  (see  Table 
2).  It  is  popular  in  mass-storage 
equipment  and  can  increase  the  ca- 
pacity of  such  a  device  by  approxi- 
mately 50%. 

In  any  mass-storage  media,  a  min- 
imum separation  of  at  least  one  zero 


Table  1.  Functions  of  the  Diagnostic  Registers 


MODE 

SDI 

CLK 

DCLK 

Shadow 
register 
(SREG) 

Output 
register 
(OREG) 

Operation 

L 

X 

C 

hold 

OREG,<PROM 

normal 

H 

X 

c 

hold. 

OREG;<SREGi 

load  from  SREG 

L 

X 

C 

SREGi<SREG;-1 
SREG0<SDI 

hold 

shift  SREG 

H 

L 

c 

SREGi^OREGi 

hold 

load  to  SREG 

H 

H 

c 

hold 

hold 

no  operation 

II  ii  ll 
-1  I  X 

zero 
=  one 
don't  care 

C  =  clock  pulse  (low-to-high  transition) 
*  =  stable  clock  or  high-to-low  transition 
i  =  bit  number 

between  adjacent  logic  ones  is  re- 
quired. In  the  2,7  RLL  coding/ de- 
coding system,  at  least  two  zeros 
must  exist  between  any  two  adjacent 
ones.  (Thus  encoded  patterns  like 
11  or  101  are  illegal  because  there 
are  fewer  than  two  zeros  between  ad- 
jacent ones. ) 

Since  the  distance  between  ones 
determines  the  flux  transition  fre- 
quency on  the  magnetic  media,  a 
minimum  of  two  zeros  between  ad- 
jacent ones  means  that  flux  transi- 
tions are  at  least  three  bit  intervals 
apart.  Therefore,  the  encoded  flux 
density  becomes  3/X  (a  binary  100 
in  the  space  X,  the  minimum  on  the 
medium  between  transitions) .  Other 
schemes  like  MFM,  however,  require 
a  density  of  2/X  (a  binary  10  in  the 
minimum  space  X).  Accordingly, 
since  three  encoded  bits  occupy  the 
space  on  the  media  that  carried  2 
bits  before  encoding,  the  encoded 
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density  is  3/2,  or  1.5-times  the  orig- 
inal density.  (Additionally,  to  syn- 
chronize data,  a  maximum  of  seven 
zeros  can  be  present  between  any  two 
adjacent  ones.) 

Starting  from  the  INIT,  or  reset 
state,  the  encoder/ decoder  first  car- 
ries out  a  three-way  branch — to  per- 
form encoding  if  the  signal  ENC  is 
received,  or  to  perform  decoding  if 
the  signal  DEC  is  received,  or  else 
to  itself.  Once  the  state  machine 
starts,  it  continues  on  its  own  and 
will  be  stopped  when  the  machine 
receives  a  reset  signal  at  the  initiali- 
zation input  pin. 

Thus,  in  its  initial  state,  the 
PROSE  machine  loops  on  itself  until 
a  control  signal,  CTj ,  goes  low.  Then 
a  signal  CT0  must  be  set  to  com- 
mand either  an  encode  or  decode 
operation.  An  instruction  (/CT0/ 
CTj),  therefore,  forces  a  jump  to 
A00,  which  starts  the  encoding  op- 
eration of  the  state  machine  (see 
Fig.  7a) .  Data  inputs  serially,  before 
encoding,  as  M,  and  outputs  encoded 
serially  via  Q6. 

The  Q7  port  provides  an  output 
for  decoding;  it  is  not  used  during 
encoding.  Therefore,  in  the  states  in 


Table  2 
An  IBM  2,7  RLL  Code  Table 


Input  pattern 

Output  pattern 

000 

000100 

10 

0100 

010 

100100 

0010 

00100100 

11 

1000 

011 

001000 

0011 

00001000 

which  encoding  is  performed,  the  la- 
bels OX  and  IX  in  Fig.  7b  refer  to  the 
output  bits  of  Q7  and  Qc,  respec- 
tively, where  X  (for  the  Q-  output) 
is  a  don't-care  condition. 

With  PROSE  running  at  twice  the 
uncoded  data  rate  and  employing  a 
four-stage  pipeline,  if  the  input  pat- 
tern for  M  is 

1111111100011000101 
0  10  11...., 

the  encoded  data  output  at  Q6  will  be 

1000100010001000000100100000010 
00100010001001000... 

Similarly,  the  instruction  (/CT0/ 
CTj)  forces  a  jump  to  C00,  which 
starts  the  decoding  operation.  Dur- 


ing decoding,  Q6  is  not  used,  but 
now  Q7  is  the  decoded  serial  data 
output.  Therefore,  the  labels  for  the 
output  pattern  of  Q  are  either  OX 
or  IX,  where  again  X  stands  for  a 
don't  care. 

For  example,  if  the  input  pattern 
before  decoding  is  the  serial  data  in- 
put N: 

1000100010001000000100100000010 
0010001000100..., 

the  output  pattern  at  Q7  will  be 

11111111000110001010.... 

with  each  bit  appearing  twice,  since 
all  legal  data  is  clocked  twice. 

In  a  practical  design,  if  erratic  pat- 
terns occur  in  the  decoded  data 
stream,  the  encoder/ decoder  should 
neither  correct  the  data  nor  stop,  but 
should  make  an  intelligent  guess  at 
the  error's  probable  nature  and  con- 
tinue without  holding  up  the  data 
stream.  However,  in  this  example, 
illegal  patterns  have  been  made  to 
enter  an  error  state,  because  this  sit- 
uation is  much  easier  to  follow  for 
illustration  purposes. 


BIT    0      1       2      3       4      5      6       7      8       9      10     11      12     13      14      15      16      17      18     19  20 


AO 

Al 

A2 

A3 

A4 

Qo 

ffi 

0.2 

03 

«4 

05 

06 

07 

cs0 

CSi 

CS2 

cs3 

CS4 

cs5 

XFo 

XFl 

I 

A 

PROM  ARRAY  OUTPUT  PATTERN  PAL  ARRAY  XOR 

ADDRESS  FEEDBACK  FEEDBACK 

FEEDBACK 


BIT  0 


10      11      12     13  14 


16     17  18 


"1 

PT2 

T/C 

cs0 

cs. 

CS2 

cs3 

cs4 

cs5 

0/P 

<b) 


PRODUCT-TERM  CONTROL 
CODE 


— V — 
UNUSED 


V 

PAL  ARRAY 
FEEDBACK 


BIT 


12     13     14      15      16     17  18 


19  20 


AO 

Al 

A2 

A3 

A4 

OAo 

OAl 

OA  2 

OA  3 

OA  4 

A5 

A6 

PROM  ARRAY 
ADDRESS 
FEEDBACK 


UNUSED 


PROM  DATA 
BIT  CODE 


MOST- 
SIGNIFICANT 
PROM  ADDRESS  BITS 


Fig.  6.  Bit  assignments  for  the  shadow  and  output  registers  during  normal  operation  (a)  are 
quite  different  from  the  shadow  and  the  output  registers  during  PAL  array  programming  (b) 
end  from  the  shadow  and  the  output  registers  during  PROM  array  programming  (c). 
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One  three-way  branch  (INIT) ,  53 
two-way  branches,  and  15  uncondi- 
tional branches  occur  in  the  2,7 
RLL.  The  three-way  branch  will  oc- 
cupy three  quadrants  of  one  primary 
location.  The  53  two-way  branches 
can  be  merged  into  40  two-way 
branches  and  placed  into  20  primary 
locations  merely  by  combining  with 
two  identical  next  states.  The  15  un- 
l  branches  can  be  merged 
into  8,  which  will  occupy  two  pri- 
'  locations.  Thus,  the  total  num- 
ber of  occupied  primary  locations  is 
)  +  2  =  23,  which  is  well  under 
the  maximum  of  32.  This  state-mini- 
mization approach  is  primitive  and 
merges  out  only  the  states  whose  sets 
of  next  states  are  the  same  or  subsets 
of  other  remaining  states. 

The  PMS14R21  PROSE  state  ma- 
chine will  be  available  in  April  for 
$25.95  each  in  quantities  of  100.  □ 


INIT  (BESET  STARTUP  STATE) 
/CT^/CTo     4  /CTrCTo 


FROM  INIT 
C51 


ERR  (FOR  ILLEGAL 
PATTERN! 


Fig.  7.  The  state-machine  diagrams  of  a 
2,7  RLL  encoder  (a)  and  its  decoder  (b) 
are  a  typical  example  of  the  possible  ap- 
plication of  the  PMS14R21  device. 
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;  THE  DESIGN  FILE 

;  The  following  specification  was  written  in  the  PALASM®  2  for- 
;  mat,  and  can  be  compiled  using  the  PALASM  2  and  PROASM1* 
;  software  packages  developed  by  Monolithic  Memories. 

;  There  are  two  corrections  to  the  said  article: 
;  1)  Table  2,  the  output  pattern  for  input  010  is  100100. 
;  2)  Table  2  (same  table),  the  output  pattern  for  input  0010  is 
00100100. 

;  This  specification  was  written  by  Frank  Lee  with  help  from 
;  Jo-NingTa 


;  Header 

Title 
Pattern 
Revision 
Author 
Company 
Date 

;  Chip 

CHIP  PROSE-27RLL  PMS14R21 

;  Pin  Definition 

CLK  DCLK  CT0  CT1  N  M  NC  NC  NC  NC  SDI  GND 
/I  SDO  Q7  Q6  NC  NC  NC  NC  NC  NC  MODE  VCC 

;  String  Definition 

;  Used  in  section  on  output  patterns 

STRING  CQS  VQ7*Q6' 

STRING  SQ6  VQ7*  Q6' 

STRING  SQ7'  Q7VQ6' 
;  State  machine  type :  Moore  machine,  output  pattern  is  dependent 
;  on  current  state  only. 

;  Feature  or  architecture  pin  is  used  as  reset  (or  initialization)  pin. 

STATE  MOORE  MACHINE  MASTER  RESET 

;  This  design  spec  starts  with  description  for  state  transitions,  fol- 
;  lowed  by  output  patterns,  and  conditions  for  the  branches. 

;  State  transitions 
;  Power-up  state 

;  The  PROSE  device  is  initialized  to  all  l's.  On  the  first  clock,  it  will 
;  perform  unconditional  branch  to  state  INIT 


14R21.G17 

27rll.eg 

03 

Frank  Lee 
MMI 

3  6 '87 
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POWER  UP:  =  VCC  ->  INIT 

;  The  PROSE  will  stay  at  INIT  until  there  is  a  command  for  it  to 
;  start  encoding  (condition  ENC)  or  decoding  (DEC).  A00  is  the 
;  starting  state  for  encoding  and  COO  is  the  starting  state  for  de- 
;  coding. 

INIT  :=  ENC  ->  A00  +  DEC  ->  COO  +->  INIT 
;  Encoder  design 

;  A00,A10,...,B00,...  are  states  for  the  encoder,  as  defined  in  fig.  7. 
;  EIN  is  defined  in  the  condition  section.  It  is  the  same  as  input  M. 
;  VCC  is  an  always-true  condition. 
;  A00  will  branch  to  All  on  EIN,  else  to  A10 

A00  :=  EIN  — >  All  -\ — >  A10 
;  A10  :  always  branches  to  A20 

A10  :  =  VCC— >A20 

All  :  =  VCC->A21 

A20  :  =  EIN  —  >  A3H  >  A30 

A21  :=  EIN  ->  A33  H  >  A32 

A30  :=  VCC— >A40 

A31  :=  VCC—  >  A41 

A32  :=  VCC->  A00 

A33  :=VCC—  >  BOO 

A40  ;=  EIN  — >  A51  +  — >  A50 

A41  :=EIN->A53  +—  >  A52 

A50  :=  VCC->A00 

A51  :=  VCC->A60 

A52  :=  VCC— >  A00 

A53  :=VCC—  >  BOO 

A60  :=EIN—  >  A71  +—  >  A70 

A70  :=  VCC->A20 

A71  :=  VCC— >  A20 

BOO  :=  EIN  — >  Bll  +  — >  BIO 

BIO  :=  VCC—  >B20 

Bll  :=  VCC->B21 

B20  :=  EIN  — >  A31  +  —  >  A30 

B21  ;=  EIN  ->  A33  -j — >  A32 


;  Decoder  design 

;  COO,C10,...,D00,...,E00,...  are  states  for  the  decoder,  as  defined  in 
;fig.7. 

;  DIN  is  defined  in  the  condition  section.  It  is  the  same  as  input  N. 
;  COO  will  branch  to  Cll  on  DIN,  else  to  CIO 

COO  :=  DIN  — >  Cll  4-  — >  CIO 

CIO  :=  DIN  — >  C21  H  >  C20 

;  ERR  is  the  error  state  for  an  illegal  encoded  pattern  received. 

Cll  :=  DIN  ->  ERR  -\ — >  C22 
C20  :=  DIN  — >  C31  +  — >  C30 
C21  :=  DIN  — >  ERR  J — >  C32 
C22  :=  DIN  — >  ERR  +  — >  C33 
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C30  : 

— 

DIN 

->C41  +- 

->C40 

C31  : 

— 

DIN 

—  >  ERR  + 

->C42 

C32 

DIN 

—  >  ERR  + 

->D00 

C33 

— 

DIN 

->C43  +- 

->E00 

C40 

— 

DIN 

— >C50  +  - 

->  ERR 

C41 

—> 

DIN 

—  >  ERR  + 

->C51 

C42 



DIN 

—  >  ERR  + 

->C52 

C43 

= 

DIN 

—  >  ERR  + 

->C53 

C50 

DIN 

— >  ERR  + 

->C60 

C51 

DIN 

— >  ERR  + 

->C00 

C52 



DIN 

->C61  +- 

->E00 

C53 



DIN 

—  >  ERR  + 

->D00 

C60 



DIN 

— >  ERR  + 

->C70 

C61 



DIN 

— >  ERR  + 

->C71 

C70 



DIN 

—  >  ERR  + 

->E00 

C71 



DIN 

— >  ERR  + 

->D00 

D00 



DIN 

— >D11  +- 

->D10 

D10 



DIN 

—  >D21  +- 

->D20 

Dll 

•   

DIN 

—  >  ERR  + 

—  >  D22 

D20 



DIN 

—  >C31  +- 

->C30 

D21 



DIN 

—  >  ERR  + 

->C32 

D22 



DIN 

— >  ERR  + 

->C33 

E00 



DIN 

— >E11  +- 

->E10 

E10 



DIN 

—  >  E21  +- 

->E20 

Ell 



DIN 

—  >  ERR  + 

—  >E22 

E20 



DIN 

—  >E31  +- 

->E30 

E21 



DIN 

—  >  ERR  + 

—  >  E32 

E22 



DIN 

—  >  ERR  + 

->E33 

E30 



DIN 

—  >  E41  +.- 

->E40 

E31 



DIN 

—  >  ERR  + 

->  E42 

E32 



DIN 

—  >  ERR  + 

->E43 

E33 

DIN 

—  >  E45  +- 

->E44 

E40 

DIN 

->C50  +- 

->  ERR 

E41 

DIN 

—  >  ERR  + 

->C51 

E42 

DIN 

—  >  ERR  + 

->C52 

E43 

DIN 

->D11  +- 

->D10 

E44 

DIN 

->E11  +- 

->E10 

E45 

DIN 

—  >  ERR  + 

->C53 

;  Error  stay  until  initialized 
ERR  :=  VCC  — >  ERR  ;STAY  UNTIL  RESET 

;  <state>.OUTF  refers  to  the  output  pattern  for  the  <state>. 
;  CQS,  SQ6,  and  SQ7  are  output  patterns.  CQS  means  output  bits 
;  6  and  7  are  all  O's.  SQ6  means  output  bit  6  is  1  and  output  bit 
;  7  is  0.  SQ7  means  output  bit  7  is  1  and  output  6  is  0.  In  all 
;  cases,  output  bits  0-5  are  all  don't  cares  as  far  as  this  design 
;  is  concerned. 
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INIT.OUTF 

= 

CQS 

A00.OUTF 

CQS 

A10.OUTF 

CQS 

All.OUTF  ' 

CQS 

A20.OUTF 

SQ6 

A21.0UTF  : 

SQ6 

A30.OUTF 

CQS 

A31.0UTF 

CQS 

A32.0UTF 

CQS 

A33.0UTF 

CQS 

A40.OUTF 

CQS 

A41.0UTF 

= 

CQS 

A50.OUTF 

= 

CQS 

A51.0UTF 

— 

CQS 

A52.0UTF 

= 

SQ6 

A53.0UTF 

CQS 

A60.OUTF 

CQS 

A70.OUTF 

= 

SQ6 

A71.0UTF 

CQS 

B00.OUTF 

= 

CQS 

B10.OUTF 

= 

SQ6 

Bll.OUTF 

SQ6 

B20.OUTF 

CQS 

B21.0UTF 

CQS 

C00.OUTF 

CQS 

C10.OUTF 

- 

CQS 

Cll.OUTF 

= 

CQS 

C20.OUTF 

= 

CQS 

C21.0UTF 

= 

CQS 

C22.0UTF 

= 

CQS 

C30.OUTF 

= 

CQS 

C31.0UTF 

= 

CQS 

C32.0UTF 

CQS 

C33.0UTF 

= 

CQS 

C40.OUTF 

= 

CQS 

C41.0UTF 

= 

CQS 

C42.0UTF 

= 

CQS 

C43  OUTF 

= 

CQS 

C50.OUTF 

= 

CQS 

C51.0UTF 

CQS 

C52.0UTF 

! 

CQS 

C53.0UTF 

= 

CQS 

C60.OUTF 

:  = 

CQS 

C61.0UTF 

:  = 

CQS 

C70.OUTF 

= 

CQS 

C71.0UTF 

CQS 

D00.OUTF 

CQS 

D10.OUTF 

SQ7 

Dll.OUTF 

SQ7 

D20.OUTF 

SQ7 

D21.0UTF 

SQ7 

D22.0UTF 

SQ7 
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E00.OUTF:  =  CQS 
E10.OUTF :  =  SQ7 
Ell.OUTF:  =  SQ7 
E20.OUTF:  =  SQ7 
E21.0UTF:  =  SQ7 
E22.0UTF:  =  SQ7 
E30.OUTF:  =  SQ7 
E31.0UTF:  =  SQ7 
E32.0UTF:  =  SQ7 
E33.0UTF :  =  SQ7 
E40.OUTF :  =  SQ7 
E41.0UTF :  =  SQ7 
E42.0UTF :  =  SQ7 
E43.0UTF:=SQ7 
E44.0UTF :  =  SQ7 
E45.0UTF:=SQ7 
ERR.OUTF:=CQS 


CONDITIONS 

The  conditions  for  state  transitions 

ENC  =  /CT1*  'CTO    ;  encode  command 

DEC  =  /CT1*  CTO    ;  decode  command 

EIN  =  M  ;  encoder  input  (original  NRZ  data) 

DIN  =  N  ;  decoder  input  (encoded  data) 


;  SIMULATION 

;  Simulation  vectors  not  shown  here 
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EPCs  and  PLDs  simplify 
VME  Bus  control  


Designer's  Guide  to 
VME  Bus  Control — Part  1 


By  using  fuse-programmable  controllers  ( FPCs)  and 
PLDs,  you  can  implement  VME  Bus  control  in  your 
computer  system  with  a  minimum  of  hardware.  Just 
a  few  chips  per  plug-in  module,  and  a  bus  arbiter, 
can  perform  all  bus-control  functions.  This  article. 
Part  1  of  a  2-part  series,  describes  the  bus-arbitration 
process  and  control  functions  and  shows  you  how  to 
implement  the  VME  Bus  protocol  in  an  FPC.  Part  2 
will  describe  the  implementation  of  slave  controllers, 
discuss  interrupt  handling,  and  provide  tools  for  pro- 
gramming the  FPC. 

Arthur  Khu,  Advanced  Micro  Devices 


Until  recently,  you  needed  many  ICs — sequencers,  mi- 
croprogrammed control  stores,  and  other  MSI/LSI 
chips — to  implement  Vj  3us  control  in  a  computer 
system.  Now,  however,  you  can  use  a  minimum  of 
hardware  to  handle  the  bus's  intermodule  communica- 
tion functions.  Just  a  few  fuse-programmable  control- 
lers (FPCs)  and  programmable-logic  devices  (PLDs) 
relieve  the  CPU  of  all  bus-control  functions. 

A  typical  VME  Bus-based  computer  system  com- 
prises one  or  more  master  modules  (eg,  CPU  boards), 
one  or  more  slave  modules  (eg,  cache/memory  boards), 
a  bus  arbiter,  and  interrupt-handling  circuitry.  A  mas- 
ter initiates  a  data  transfer  to  a  slave  by  requesting 
control  of  the  data  bus  from  the  bus  arbiter.  Once  bus 
control  has  been  granted  to  a  master,  the  master  and 
slave  exchange  control  signals  according  to  predefined 


protocols  that  guarantee  an  orderly  transfer  of  data 
between  the  communicating  modules.  The  interrupt- 
handling  circuitry  services  all  interrupt  requests. 

By  using  the  streamlined  design  in  Fig  X,  you  can 
implement  most  intermodule  communication  in  a  VME 
Bus-based  system  by  using  just  two  types  of  VLSI 
devices:  the  Am29PL141  fuse-programmable  device 
and  the  AmPAL22V10  programmable-logic  device.  For 
the  remainder  of  the  bus-control  functions,  you'll  need 
some  bus-arbitration  circuitry,  which  must  occupy  a 
particular  slot  on  the  VME  Bus  backplane,  but  which 
can  reside  on  the  same  board  with  the  bus  master  of 
highest  priority. 

When  you  design  VME  Bus  control  into  your  system, 


n 


HIGH- 
SPEED 
RAM 


Fig  1 — You  can  implement  VME  Bus  control  in  a  computer  system 
by  using  just  a  few  fuse-programmable  controllers  (FPCs)  and 
programmable-logic  devices. 
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You  can  implement  VME  Bus  control  in 
your  computer  system  by  embedding  the 
bus-control  functions  in  state  machines  that 
comply  with  the  VME  Bus  protocol. 


your  first  step  is  to  consider  the  VME  Bus  protocols. 
These  protocols  specify  the  steps  your  circuitry  must 
take  to  perform  any  bus-related  operation,  such  as  the 
transfer  of  data  between  two  modules.  You  can  describe 
these  protocols  by  means  of  flow  diagrams  that  show 
how  the  various  interface  signals  reflect  the  interac- 
tions between  the  communicating  modules.  Fig  2,  for 
example,  shows  how  the  priority  bus  arbiter  resolves 
simultaneous  bus  requests  from  two  modules  that  use 
the  same  request  line. 

Next,  you  analyze  the  flow  diagrams  to  pick  out  the 
functions  that  can  be  incorporated  into  FPCs  or  PLDs, 
and  you  define  state  machines  for  those  functions.  You 
can  describe  the  state  machines  abstractly,  by  Boolean 
logic  equations,  or  diagrammatically,  by  means  of 
standard  flow-chart  symbols  (eg,  rectangles  to  repre- 
sent processes  and  diamonds  to  represent  control-flow 
decisions).  Finally,  you  must  write  programs  for  the 
state  machines  in  a  high-level  or  assembly  language. 
You  repeat  this  process  to  design  each  of  the  four  main 
types  of  VME  Bus  controllers:  bus  arbiters,  masters, 
slaves,  and  interrupt  handlers. 

Before  any  master  module  can  perform  a  data  trans- 
fer, it  must  request  control  of  the  data  bus  from  the  bus 
arbiter;  the  arbiter  must  check  to  see  whether  the  bus 
is  free  and  must  resolve  any  contention  between  two 
masters  of  equal  priority.  For  example,  consider  a 
priority-option  bus  arbiter  implemented  on  a  single 
PLD.  The  arbiter  will  monitor  the  four  bus-request 
lines  (BR0-3)  and  assign  the  highest  priority  to  BR3. 

As  the  flow  diagram  (Fig  2)  shows,  the  arbiter 
grants  the  bus  to  the  requesting  module  that's  using 
the  highest  active  request  line,  which  is  BR]  in  this 
example.  To  enable  your  bus  arbiter  to  resolve  simulta- 
neous bus  requests  from  two  or  more  modules  that  use 
the  same  request  line,  you  must  daisy-chain  the  associ- 
ated bus-grant  signal  to  all  the  devices  using  that 
request  line.  Therefore,  the  arbiter  must  be  in  the  first 
slot  of  the  VME  Bus  system.  The  module  that's  closest 
to  the  bus  arbiter  will  have  the  highest  priority:  If  it 
requests  the  bus,  it  will  receive  the  bus  grant  and  lock 
out  any  modules  farther  down  the  chain  (Listing  1). 
The  AND/OR  array  of  the  PLD  processes  all  bus 
requests  in  parallel,  so  that  priority  arbitration  is 
complete  in  only  one  clock  cycle. 


Priority  arbitration  options 

Sometimes,  the  arbiter  must  force  the  current  bus 
master  to  relinquish  control  of  the  bus;  this  situation 
occurs,  for  example,  when  a  bus  master  of  higher 


LISTING  2 

IF  (BBSY*(BR0+BR1  +  BR2+BR3))  THEN 
BEGIN 

IF  (BR3'(/(MASTER[1:0]  -  3)  )  )  THEN 

BCLR  :•  V,     .  "assert  BCLR  if  MASTER  <  >  3" 

IF  (/BR3-BR2-(/(MASTER(1:0l  •  3)  )  )  THEN 
BCLR  :.  1; 

IF(MASTER[1:0]  =  0)  THEN 
BCLR  =1; 

END; 


priority  initiates  a  bus  request.  The  arbiter  examines 
the  priorities  of  both  the  current  bus  master  and  the 
new  requester.  If  these  conditions  meet  the  predefined 
bus-clear  conditions,  the  arbiter  asserts  the  bus-clear 
signal  (BCLR). 

In  the  design  in  Fig  1,  the  bus  arbiter  keeps  track  of 
the  current  bus  master's  priority  by  recording  the 
bus-request  line  that  was  used  to  gain  control  of  the 
bus.  For  example,  if  the  current  bus  master  used  BR2  to 
gain  control  of  the  bus,  it  sets  two  output  registers 
called  Bus_Master  to  the  binary  value  2.  Either  of  the 
two  following  conditions  will  activate  BCLR: 

•  The  value  in  Bus_Master  is  2,  1,  or  0  and  the 
active  bus  request  line  is  3  or  2. 

•  The  value  in  Bus_Master  is  0,  and  any  bus- 
request  line  is  active. 

When  the  value  in  Bus_Master  is  3,  the  arbiter  will 
not  honor  any  bus  requests  until  the  Bus  Busy  line 
(BBSY)  is  high.  Listing  2  gives  the  logical  expression  of 
these  conditions.  Once  having  asserted  BCLR,  the 
arbiter  holds  this  line  in  the  active  state  until  the 
current  bus  master  releases  BBSY. 

Be  sure  to  define  the  BCLR  in  such  a  way  that 
uninterruptible  devices  can  use  bus-request  line  3, 
which  has  the  highest  priority.  Devices  that  temporari- 


LISTING  1 

IF(/BBSV(BRO+BR1  +  BR2-t-BR3))THEN 

"if  bus  not  busy  and  a  " 

BEGIN 

"  request  line  is  active  " 

IF  (BR3)  THEN 

"it  BR3  is  active,  grant  - 

BG3IN      1  ; 

"  bus  to  device  on  8R3  " 

IF  (/BR3-BR2)  THEN 

"activate  bus  grant  * 

BG2IN  :=  1; 

"  daisy  chain  2  * 

IF  (/BR37BR2"BR1)  THEN 

"if  BR1  is  active  and  BR3" 

BG1IN  .=  1; 

"  and  8R2  are  not.  then  " 

"  grant  bus  to  device  * 

"  using  request  tine  1 

IF  (/BR3VBR2VBR1-BR0)  THEN 

"8R0-3  and  BGOIN  to 

BGOIN      1  ; 

"  BG3IN  are  active  low  " 

END  ; 

IF  (/BBSVBG1IN)  THEN  "if  BG1IN  is  asserted  in  response  to  a  request  " 

BG1IN  :=  1;           "over  line  BR1,  then  continue  asserting  BG1IN" 

"until  requesting  device  drives  BBSY  active  " 
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Fig  2 — Flow  diagrams  help  you  define  the  VME  Bus  protocols.  This  flow  diagram  illustrates  how  the  bus  arbiter  chooses  between  requesters 
that  have  the  same  priority  level. 
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LISTING  3 


DEVICE  ■  MASTER-  (Am2»PL141)     "specify  the  device  to  a 
DEFINE 


bus_request 
bus_grant 


-  10 

-  tl 

-  12 

-  13 


"define  the  test  conditions" 
"master  request  input  •' 
"bus  grant  in  line  " 
"data  transfer  acknowledge  line " 
"check  address  strobe  bus  line  " 


.  0000#h 

.  eooo#h 

busmaster  -  4000#h 
arb_req    -  2000#h 


"off  state 
"16th  bit  -  1  : 
"15th  bit  .  1  :  inform  master 
"14th  bit  -  1  :  drive  BRn  line 
"13*  bit  -  1  :  drive  B8SY  line 
1  thru  Inverting  h 


"signals  going  to  the  bus  are 
bus  driver;  these  include  " 


signed      -  OOOIHHi ;    "other  control  signals 
"FPC  assembler  format  is  -  <label:oulput,statement;  > ,  wtlh  label 


BEGIN 

"wait  for  a  bus  request  to  be  asserted  high" 
(5)  start  :otf_atate        .  while  (not  bus_request)  wart 

else  goto  pt(get_ bus)  I 
©  bus_granted  "assert  BBSY  and  BUSMASTER  signals" 
:  bbsy+busmaster 

.  if  (addr_strobe  -  1)  then 
goto  pKbus_granted) ; 
@  -A9(L)  must  be  HIGH  before  continuing;  this  meina  previous  t 
r  is  not  driving  the  bus  i 
"other  statements" 


"BUS  ACQUISITION  microcode  subroutine:  the  control  output 

ARB  REQUEST  Is  asserted  (BRn  line)  until  the  /BQINn  signet 

is  received  active  LOW;  if  the  bus  request  line  from  the  master 
goes  LOW  before  the  bus  Is  granted  (e.g.,  master  cancels  bus 
request),  then  turn  off  the  ARB—REQUEST  output  and  return 
to  the  wait  state  START 


(B)  get_bua:  arb_request 
arb_request 

off  state 

END,  end  of  source  code 


.  if  (bue_grant  -  0)  then 
goto  pKbus_granted) ; 

,  if  (bus_requeat)  then 
goto  p«get_JDue) ; 

,  if  (not  bus_request)  then 
goto  pKetart) ; 







Fig  3 — You  can  implement  the  requester  logic  for  a  master  module 
in  an  FPC.  The  FPC  handles  all  bus-acquisition  and  data-transfer 
protocols. 


ly  can  be  suspended  to  accommodate  interrupts  and 
higher-priority  operations  should  be  assigned  to  bus- 
request  line  0.  The  BCLR  conditions  will  vary  with 
your  application,  but  you  can  modify  them  just  by 
redefining  the  high-level  logic  expressions. 


Master  modules 

The  master  modules  of  VME  Bus  systems  initiate  all 
data  transfers  over  the  bus.  A  system  can  have  one 
master  (the  CPU  unit)  or  several  (multiple  processors, 
DMA  controllers).  A  master  module  must  control  the 
bus  before  it  can  perform  any  data  transfers.  Most  of 
the  bus-control  logic  resides  in  the  requester  section  of 
the  module;  this  section  handles  bus  acquisition  and 
communication  with  other  master  or  slave  devices  in 
the  system. 

You  can  microprogram  all  the  requester  functions 
into  a  single  FPC,  which  serves  as  the  interface  be- 
tween the  master  device  and  the  VME  Bus  (Fig  3).  The 
FPCs  microprogram  handles  the  bus-acquisition  proto- 
col. Upon  receiving  the  bus-grant  signal  from  the 
arbiter,  the  microprogram  informs  the  master  device 
that  it  has  control  of  the  bus  and  may  initiate  a  data 
transfer.  After  completing  the  transfer,  the  master 
device  releases  the  bus-request  signal,  thus  causing  the 
FPC  to  free  the  bus  by  releasing  the  BBSY  signal.  The 
FPC  may  also  relinquish  control  of  the  bus  at  the 
request  of  the  bus  arbiter. 

In  designing  your  application,  you  need  to  extract  the 
bus-acquisition  phase  of  the  requester  from  the  flow 
diagram  in  Fig  2  and  translate  this  information  into  a 
state  diagram  (Fig  4),  from  which  you  must  write  the 
corresponding  FPC  assembler  source  code.  Listing  3 
gives  an  example  of  this  code. 

FPC-controlled  data  transfers 

Once  a  master  module  becomes  the  bus  master  (that 
is,  once  it  gains  control  of  the  bus),  it  can  begin 
transferring  data  to  a  slave  module.  A  data-transfer 
flow  diagram  (Fig  5)  shows  the  necessary  handshaking 
signals  for  transferring  32-bit  data  between  master  and 
slave.  The  state  diagram  for  the  master  module's  FPC 
(Fig  4)  combines  the  bus-acquisition  (requester)  and 
data-transfer-control  functions  of  the  master  module. 
An  FPC  assembler  (which  the  manufacturer  of  the  FPC 
provides)  simplifies  the  task  of  microprogramming  this 
state  machine  into  an  FPC. 

Handling  unanswered  data-transfer  requests 

To  prevent  bus  lockups,  which  malfunctioning  slave 
devices  may  cause,  you  should  implement  a  bus-time- 
out (BTON)  option  in  the  master  module  by  writing  a 
microcode  routine  that  uses  the  6-bit  counter  in  the 
master's  FPC.  (Listing  4  gives  an  example  of  such  a 
routine.)  The  bus-time-out  option  permits  a  bus  master 
to  abort  a  data-transfer  cycle  if  the  slave  does  not 
respond  within  n  microseconds. 

At  the  start  of  every  data-transfer  operation,  the 
FPC  microprogram  loads  a  value  into  the  6-bit  counter, 
tests  for  the  data-transfer  acknowledge  signal 
(DTACK),  decrements  the  counter  and  tests  it  for  zero, 
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LT  MASTER  BUS 
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Fig  4— You  can  derive  detailed  slate-machine  diagram*  from  your  flow  diagrams.  In  this  diagram  for  a  master  requester,  the 
bus-acquisition  phase  is  surrounded  by  a  dashed  line.  Note  the  close  correspondence  between  the  state  diagram  and  the  assembly-language 
program  in  Listing  S. 
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Fig  5— This  (low  diagram  shows  the  signals  you'll  need  to  perform 
a  S2-bit  data  transfer  once  a  master  has  acquired  control  of  the 
bus. 


The  bus-time-out  option  permits  a  bus 
master  to  abort  a  data-transfer  cycle  if  a 
slave  does  not  respond  within  a  specified 
time. 


LISTING  4 

"Begin  transfer :  Bus  time  out  loop  contains  instructions  1  and 
2  If  count  reg  is  loaded  with  63  and  FPC  is  running  at  20  MHz. 
then  total  time  before  bus  times  out  is  (63+1)"2-50  ns 
wait_loop:  DSO  +  DS1  ,  if  (DTACK  >  0)  then  "1" 
goto  pl(transler„complete) ; 
DSO  +  DS1  .  while  (creg  <>  0)  "2" 

loop  to  pl(wait_loop) ; 
DSO  +  DS1  ,  if  (creg  -  0)  then 

goto  pl(bus_time_out) 


trans!er_complete: 


"goto  to  this  section  if  handshaking  signal 
was  received" 


bus_time_out:  "execute  this  section  when  counter  is  0" 

BTO_signal  "activate  bus  time  out  signal" 


and  finally  loops  back  to  the  instruction  that  tests 
DTACK.  As  soon  as  DTACK  is  detected,  the  program 
branches  to  the  section  of  code  that  handles  a  normal 
data-transfer  operation. 

If  the  counter  value  reaches  zero,  however,  the 
microprogram  must  branch  to  a  section  of  code  that  can 
handle  situations  in  which  data-transfer  requests  are 
not  completed;  such  situations  are,  of  course,  entirely 
dependent  on  the  user  and  the  application.  To  calculate 
the  time  that  will  elapse  before  the  program  generates 
a  bus-time-out  signal,  multiply  the  number  of  instruc- 
tions in  the  time-out  loop  by  the  system  cycle  time  by 
the  initial  value  (plus  1)  that  the  program  has  loaded 
into  the  count  register.  EDM 
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FPCs  and  PLDs 
implement  VME  Bus 
slave  controllers 


Designer's  Guide  to 
VME  Bus  Control — Part  2 


By  using  fuse-programmable  controllers  (FPCs)  and 
PLDs,  you  can  implement  VME  Bus  control  in  your 
computer  system  with  a  minimum  of  hardware.  Part 
1  (October  2,  pg  187)  of  this  2-part  series  described 
the  bus-arbitration  process  and  control  functions  and 
showed  how  to  implement  the  VME  Bus  data-transfer 
protocol  in  an  FPC .  This  second  and  final  part  de- 
scribes the  implementation  of  similar  controllers  for 
slave  modules  and  interrupt  handlers,  and  it  pro- 
vides tools  for  programming  the  FPC. 


Arthur  Khu,  Advanced  Micro  Devices 

When  you  offload  a  system's  bus-control  functions  from 
the  CPU  to  a  hardware  bus  controller,  you  considerably 
reduce  the  time  these  functions  require,  thereby  im- 
proving the  data-transfer  rate  over  the  bus.  You  can 
substantially  reduce  the  cost  and  chip  count  of  such  a 
bus  controller  by  implementing  the  bus-control  logic  in 
VLSI  devices  such  as  fuse-programmable  controllers 
(FPCs)  and  programmable  logic  devices  (PLDs). 

The  VME  Bus  protocol  requires  that  all  data  trans- 
fers over  the  bus  be  initiated  by  a  master  module.  Your 
system  can  include  several  master  modules,  such  as 
CPUs  or  DMA  controllers;  a  bus-arbiter  module  arbi- 
trates simultaneous  data-transfer  requests  from  two  or 
more  masters.  No  data  transfer  can  take  place  until  the 
requesting  master  has  been  given  control  of  the  bus  by 
the  bus  arbiter. 


MEMORY  BOARD  (SLAVE) 


DATA 


ADOBE; 

oer-- 


SERA} 


INTERRUPT 
REQUEST  1 


LATCHJJATA 
STORE-OATA 


PRESENT_DATA 


AmPAL22V10 
PLD 

i7r—x: — 


FPC  Am29PL14! 

ADORJiALID 

OPER, 

OPERi 

;ts  ds,., 

IR  OTACKBERfl;, 

ADDRESS  MODIFIERS  AM,., 


Fig  1 — This  device  controller,  which  is  implemented  with  an  FPC 
and  a  PLD,  handles  data-transfer  ajid  interrupt  protocols  for  a  slave 
device  in  a  VME  Bus  system. 

Because  a  slave  can't  initiate  a  data  transfer,  your 
system  must  include  an  interrupt  mechanism  that  al- 
lows a  slave  to  request  service  from  a  master.  You  w 
implement  such  a  mechanism  in  your  system  by  design- 
ing a  slave  subsystem  like  the  one  in  Fig  1.  This  slave 
subsystem  comprises  a  slave  module  and  a  slave  inter- 
rupt controller.  The  slave  interrupt  controller,  which 
consists  of  an  Am29PL141  FPC  and  an  AmPAL22V10 
PLD,  serves  as  the  interface  between  the  slave  subsys- 
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The  interrupt  controller  for  a  slave  device 
implements  the  interrupt  protocol  in 
hardware. 


tem  and  the  VME  Bus  structure.  Once  the  master  has 
initialized  the  slave,  it  can  issue  commands  to  the  slave. 
The  slave  performs  these  tasks  in  the  background, 
leaving  the  master  free  to  continue  its  own  operations. 
When  the  slave  is  ready  to  return  the  status  or  result  of 
a  task  to  the  master,  it  issues  an  interrupt  request. 

The  VME  Bus  single-cycle  data-transfer  protocol 
defines  the  interactions  between  master  and  slave  con- 
trollers (Fig  2).  When  the  master  drives  the  address 
strobe  (AS)  low,  the  slave  controller  latches  the  address 
presented  on  the  bus,  and  a  separate  address-decoding 
unit  on  the  slave  board  decodes  the  address.  If  the 
address  is  not  valid  (ie,  if  it's  not  in  the  range  associated 
with  this  slave),  the  slave  takes  no  further  action. 
However,  if  the  address  selects  this  slave  subsystem, 
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nam  wansjers~over  we  om  oe  imnatea  ay 
a  master  module. 


the  slave  looks  for  signals  from  the  master  that  specify 
whether  a  read  or  a  write  operation  should  take  place. 
After  presenting  or  storing  data,  the  slave  controller 
drives  the  data-transfer-acknowledge  signal  (DTACK) 
low  to  inform  the  master  of  the  successful  transfer. 

From  the  VME  Bus-protocol  flow  diagrams  (Fig  2), 
you  can  derive  a  state  diagram  (Fig  3)  for  the  slave- 
controller  state  machines;  Fig  4  shows  the  resulting 
timing  pattern  for  the  slave  controller.  You  can  develop 
microcode  for  the  slave  controller's  single-cycle  transfer 
mode  from  the  state  diagram  and  the  timing  pattern. 
Use  the  address-modifier  lines  (AMj.s)  to  specify  the 
other  three  slave  operating  modes  (sequential,  or  block, 
transfers;  read-modify-write  transfers;  and  interrupt 
cycle).  To  recognize  these  special  modes,  the  slave 
controllers  on  each  slave  board  constantly  monitor  the 
six  address-modifier  lines. 

When  the  code  presented  on  the  address-modifier 
lines  specifies  a  block-transfer  operation,  the  master 
retains  control  of  the  bus  throughout  the  operation  by 


holding  AS  and  BBSY  low.  For  a  block  transfer,  bus 
arbitration  takes  place  only  once,  before  the  start  of  the 
operation;  for  single-cycle  transfers,  bus  arbitration 
takes  place  before  the  transfer  of  each  word.  A  block 
transfer,  therefore,  takes  less  time  than  does  the  corre- 
sponding number  of  single-word  transfers. 

At  the  start  of  a  block-transfer  operation,  all  the 
slaves  load  the  address  presented  on  the  bus  into  their 
address  counters  and  decoders,  but  only  the  slave 
whose  memory  range  encompasses  the  decoded  address 
responds  to  the  data-transfer  request.  As  each  word 
transfer  is  completed,  all  the  slaves  increment  (or 
decrement)  their  address  counters  and  decode  the  new 
address.  This  procedure  is  necessary  because  the  mem- 
ory block  being  transferred  may  reside  on  more  than 
one  slave  memory  board. 

In  the  slave  subsystem  in  Fig  1,  the  PLD  decodes 
control  signals  from  the  bus  and  slave  board  and  sends 
two  signals,  OPER„  and  OPER,,  to  the  slave's  FPC.  The 
FPC  decodes  these  two  signals,  along  with  inputs  from 


won 

VALID 


WRITE 
lACK 


SERR 


STORE 


mi 


DTACK 


READY  WHEN  7S  ACTIVE 


-  MASTER  SPECIFIES  A  WRITE  OPERATION 


SLAVE  COMPLETES  OPERATION  WITHOUT  ERROR 


MASTER  HAS  SUPPLIED  OATA 


STORE  DATA  ON  NEXT  CYCLE 


SLAVE  HAS  LATCHED 
AND  STORED  DATA 


MASTER  DRIVES  AS.  SBi 
HIGH;  - 


I;  SLAVE  S 


BUS  ERROR  SIGNAL  HIGH.  NO  DATA-TRANSFER  ERRORS 


DEVICE-SELECT  SIGNAL 
SENT  TO  SLAVE  


Fig  4 — You'll  need  to  generate  a  timing  diagram  for  each  of  the  slave  controller's  operating  modes.  This  diagram  shows  the  timing  for  a 
single-cycle  data  transfer. 
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the  slave,  bus,  and  address-decoding  units  on  the  slave 
board,  to  determine  which  of  the  four  possible  slave 
operating  modes  to  execute.  The  four  modes,  which  are 
designated  by  binary  codes,  specify  the  following  oper- 
ations: 

•  (00)  Perform  a  single-cycle  transfer 

•  (01)  Perform  a  sequential-cycle  transfer 

•  (10)  Perform  a  read-modify-write  cycle 

•  (11)  Perform  an  interrupt  cycle. 

If  OPERo  and  OPER,  are  both  high,  the  FPC  operates 
as  an  interrupter  by  branching  to  an  interrupt  subrou- 
tine (Fig  3). 

When  the  slave  requests  an  interrupt,  the  FPC 
generates  an  interrupt-request  signal  and  waits  for  the 
interrupt-aeknowledge  signal  (IACK)  and  daisy-chain 
signal  (IACKIN).  When  the  data  strobes  become  ac- 
tive, the  PLD  reads  a  3-bit  value  from  the  address  bus 
(A13);  this  value  indicates  which  interrupt-request  line 
was  acknowledged.  The  PLD  decodes  these  three  bits 
to  determine  whether  their  value  matches  its  own 


request  level;  if  it  finds  no  match,  no  further  action 
occurs.  If  it  does  find  a  match,  however,  the  PLD  routes 
a  valid  signal  to  the  FPC,  indicating  that  the  interrupt 
handler  has  acknowledged  the  slave's  interrupt  re- 
quest. The  FPC  then  signals  the  slave  board  to  put  its 
status  or  identification  byte  on  the  data  bus  for  the 
interrupt  handler  to  use  as  an  interrupt  vector.  The 
slave  FPC  waits  in  a  loop  until  the  interrupt  handler 
drives  the  IACK  signal  high  to  signify  that  interrupt 
service  is  complete. 

Besides  containing  master,  slave,  and  bus-arbiter 
modules,  a  VME  Bus  system  usually  has  an  interrupt  - 
handler  module  that  handles  external  I/O  or  special 
system  events  (time-out  or  overflow  errors,  for  exam- 
ple). You  can  reduce  the  logic  complexity  of  the  inter- 
rupt-handler  module  in  your  system  by  offloading  some 
of  the  initial  interrupt-recognition  tasks  to  an  inter- 
rupt-handling  preprocessor  (I  HP).  You  can  use  a  PLD 
as  the  IHP,  programming  it  to  preprocess  interrupt 
requests,  obtain  control  of  the  bus,  and  handle  hand- 


DEVICE  DRIVES  INTERRUPT-REQUEST  LINE  LOW 


BUS  REQUEST 


INTERRUPTER  PUTS  STATUS 
BYTE  ON  BUS 


BUS-GRANT  SIGNAL  IS  NOT  DAISY-CHAINED  OUT 
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2 


SERVICE -DONE  SIGNAL  FORCES  IHP 
TO  RELEASE  5S5Y  SIGNAL 


-  ACKNOWLEDGE  INTERRUPT  LEVEL  6  (1 10) 


5 


INFORM  INTERRUPTER  THAT  3-BfT 
INTERRUPT- ACK NO WLE DGE  LEVEL  IS  ON  THE  8  US' 


LATCH  STATUS/IDENTIFICATION  BYTE 


START 
SERVICE 


IHP  TELLS  INTERRUPT  HANDLER 
TO  BEGIN  INTERRUPT  SERVICE 


SERVICE 
DONE 


INTERRUPT  HANDLER  TELLS  IHP  THAT 
INTERRUPT  SERVICE  IS  COMPLETEO  ~ 


NOTE: 

INTERRUPT- HANDLING  P 


Fig  5 — The  interrupt-handling  preprocessor  monitors  the  seven  VME  Bus  interrupt-request  lines  (IR, :),  identifies  the  interrupting  device, 
and  tells  the  interrupt  handler  when  to  begin  servicing  the  interrupt.  This  timing  diagram  shows  the  signal  states  that  exist  before  and  during 
the  transfer  of  the  identification  and  status  bytes  from  the  slave-interrupt  subsystem  to  the  interrupt  handler. 
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When  the  slave  module  is  in  block-transfer 
mode,  bus  arbitration  takes  place  only  once 
before  the  start  of  the  operation. 


shaking  signals  (such  as  interrupt-acknowledge  sig- 
nals). Only  when  the  IHP  latches  the  interrupt  vector 
will  control  pass  to  the  interrupt-handler  module. 

To  define  interrupt-request  processing,  bus  acquisi- 
tion, and  the  interrupt  vector's  transfer  phase,  you'll 
have  to  use  logic  equations  written  in  high-level  Boole- 
an Inotation.  In  the  design  in  Fig  1,  the  PLD  monitors 
seven  interrupt-request  lines  and  four  data-transfer 
control  inputs,  and  it  sends  10  control  signals  to  the 
interrupt  handler  and  the  VME  Bus  drivers.  The  PLD 
monitors  all  interrupt-request  lines  according  to  the 
following  logic  equation: 


If  any  interrupt-request  line  is  active,  the  PLD  asserts 
BR3  to  initiate  the  bus-acquisition  phase. 

The  next  step  is  to  wait  for  the  bus-grant-in  signal. 
Only  when  BGIN3  is  active  will  BBSY  be  active.  The 
logical  expression  of  this  condition  is  given  in  the 
following  equations: 


IF  (BR3*/BG3IN)  THEN 
BR3 :  =  1  ; 


(2) 


IF  (BR3*BG3IN 
BBSY  : 


BBSY*/SERVICE_DONE)  THEN  (3) 
:  1  : 


IF  (IR1  +  IR2  +  IR3  + 

IR6  +  IR7)  THEN 
BR3 :  =  1  ; 


IR4  +  IR5  + 


(1) 


"THIS  INTERRUPT  HANDLER  USES" 
"THE  BR3  REQUEST  LINE" 


Eq  2  continually  asserts  BR3  as  long  as  BG3IN  is  not 
active;  Eq  3  asserts  BBSY  only  when  request  line  BR3 
and  BG3IN  are  active,  or  if  service  is  not  complete  after 
the  IHP  asserts  BBSY. 
When  the  IHP  is  the  bus  master,  it  puts  the  3-bit 


n 


evelopment  tools  help  you  program  FPCs 


The  Am29PL141  is  a  single-chip  fuse-programma- 
ble controller  (FPC)  that  can  implement  state  ma- 
chines and  distributed  control  functions.  You  can 
express  and  functionally  verify  these  state  ma- 
chines and  functions  by  using  three  FPC-develop- 
ment  tools  provided  by  the  manufacturer:  an  as- 
sembler, a  test-vector  generator,  and  a  simulator. 
These  development  tools,  which  are  written  in  C, 
run  on  an  IBM  PC  or  compatible  computer  under 
MS-DOS.  Source  code  for  the  programs  is  avail- 
able, so  you  can  port  the  software  to  other 


be  high,  low,  or  disabled.  The  lower  eight  output 
bits  are  always  enabled.  The  op-code  field  speci- 
fies which  of  the  29  possible  instructions  the  FPC 
will  execute  if  the  condition  selected  by  the  test 


The  FPC  has  a  control  store  that's  resident  in 
its  PROM;  the  store  is  64  words  long  and  32  bits 
wide.  Instructions  such  as  jumps,  loops,  and  sub- 
routine calls  in  the  PROM  control  store  are  condi- 
tionally executed  by  the  20-MHz  internal  seq- 
uencer. Each  32-bit  instruction  is  partitioned  into 
the  following  format: 

BITS:      15        13       6  16 
FIELD:  OE  OPCODE  POL  TEST  DATA  OUTPUT 

The  output  field  contains  the  16  control  outputs 
that  the  FPC  generates  when  it  executes  an  in- 
struction. The  upper  eight  output  bits  are  con- 
trolled by  the  output  enable  (OE)  bit;  they  may 
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Ta  program  the  Am29PLUl  FPC  to  implement  VME  But- 
control  function,  you  can  use  the  assembler,  test-vector  genera- 
tor, and  simulator  provided  by  the  FPC's  manufacturer.  These 
development  tools,  which  are  written  in  C.  run  on  an  IBM  PC  or 
compatible  computer  under  MS-DOS. 
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LISTING  1 


IF  (BBSY)  THEN 

BEGIN 
(1)  IF  (IR7)  THEN 

INTftp^fl  :  -  7  : 

IF  (/iR7*IR8>THEN 
INTH{2:)1 :  •  6  ; 

IF  (/M7*/IM*IR6)  THEN 
INTRf2:0] :  -  S  ; 


"IR7  w 

"3-btt  value  Acknowledging 
Interrupt  line  7  " 
"IR6  active,  IR7  Inactive 
"acknowledge  Interrupt  line  6" 
"IR5  highest  priority  " 
"Interrupt  Hne  active  " 


(2)  IF  </IR7*/tR«VIRS7'R«7IR3*/iR2-IR1,  THEN 

1NTR(2:0] :  -  1  ;       "acknowledge  Interrupt  Hne  1" 

END; 


interrupt-line-acknowledge  value  on  the  bus  and  as- 
serts  the  data  strobes.  Upon  receipt  of  the  DTACK 
signal  from  the  interrupter,  the  IHP  strobes  the  status 
byte  from  the  bus  into  a  register  on  the  interrupt- 
handler  card.  The  IHP  then  informs  the  interrupt 
handler  that  a  status/identification  byte  is  ready  and 
that  the  handler  should  begin  servicing  the  interrupt. 
The  IHP  resolves  interrupt  priorities  within  a  single 
clock  cycle  because  all  interrupt/input  signal  lines  are 
processed  in  parallel  by  the  logic  array  in  the  PLD. 
Listing  1  shows  how  you'd  express  the  procedure  in  a 
logic-description  language. 

As  you  can  see  from  Listing  1,  if  both  the  IR7  and  the 
BBSY  signals  are  active  in  section  1,  then  the  3-bit 
value  generated  by  the  IHP  will  be  binary  111  (decimal 
7),  regardless  of  the  state  of  the  other  interrupt  lines. 
In  section  2,  if  IRl  and  BBSY  are  active  and  the  other 
six  control  signals  are  inactive,  then  the  3-bit  output 
value  will  be  binary  001  (decimal  1).  In  section  2,  IRl  is 
the  highest  priority  line  that  is  active. 

The  remaining  interrupt-preprocessing  steps  com- 
plete the  transfer  of  the  interrupt-yector  byte;  this 
transfer  is  described  logically  in  the  following 


IF  (BBSY)  THEN  (4) 
BEGIN 

IACK  :=  1;  "begin  interrupt  acknowledge  daisy" 

(X)  IF  (DTACK)  THEN  "chain;  if  device  sends  data 

LATCH. STATUS  :=  1;.    "transfer  acknowledge  (DTACK)  " 
"signal,  then  latch  the  status  " 

END: 
IF  (IACK)  THEN 

AS  :=  1;  "assert  the  address  strobe  signal  to  inform  the 

interrupter  that  the  interrupt  acknowledge 

level  is  ready" 


field  is  true.  The  polarity  bit  (POL)  determines 
whether  a  high  input  or  a  low  input  represents 
true  for  the  test  input  selected.  You  can  use  the 
data  field  as  an  argument  for  the  op-code  field. 
For  example,  if  you  want  the  FPC  to  jump  (op 
code  25)  to  location  34  (22HEx)  when  the  second  bit 
in  the  test  field  is  high  (true),  you  put  the  follow- 
ing 32-bit  word  into  the  control  store  (though 
youH  also  have  to  define  the  OE  bit  and  the 

OE  11001  0  010  100010  OUTPUT 

The  assembler  simplifies  system  design  by  al- 
lowing you  to  use  a  high-level  language.  Instead 
of  specifying  the  bit  patterns  shown  above,  you 
can  write  the  following  section  of  code: 

OUTPUT  ,  IF  (T2  =  1)  THEN 
GOTO  PL  (34); 

The  assembler  translates  this  statement  into  the 
appropriate  bit  patterns;  when  the  FPC  executes 
the  nstruction,  it  generates  the  bit  pattern  de- 
fined by  the  symbol  OUTPUT. 

Every  microinstruction  written  in  assembly 
language  follows  this  format: 


<  LABEL  :  >  OUTPUT  ,  STATEMENT  ; 

The  label  field,  which  is  optional,  simplifies  the 
writing  of  subroutine  calls  and  conditional  branch 
instructions,  which  you  can  express  in  IF-THEN- 
ELSE,  WHILE-DO,  or  COMPARE  forms.  Once 
you've  written  all  the  instructions  with  the  editor 
and  translated  them  to  executable  form  with  the 
assembler,  you  can  cause  the  assembler  to  gener- 
ate a  JEDEC  fuse  map. 

Before  physically  programming  the  PROM  con- 
trol store  that  you've  designed,  you  can  test  your 
design  with  the  help  of  the  test-vector  generator 
and  simulator.  The  test-vector  generator  produces 
test  vectors,  from  a  user-generated  truth  table, 
and  converts  them  into  a  JEDEC-standard  test- 
vector  file  that  the  simulator  can  use. 

The  simulator  performs  two  important  func- 
tions: It  verifies  the  control  flow  of  your  design, 
and  it  checks  the  fuse  map  that  the  assembler 
generates.  By  using  the  simulator's  interactive 
mode,  you  can  inspect,  modify,  or  preset  any  of 
the  internal  registers  in  the  FPC.  The  simulator 
also  has  single-stepping  and  break-point  features 
so  that  you  can  trace  the  operation  of  your  design 
in  detail. 


load  some  of  the  initial  interrupt-recogni- 
tion  tasks  of  your  system's  interrupt-han- 
dler  module. 


Once  the  assertion  of  the  DTACK  signal  indicates  the 
successful  transfer  of  the  8-bit  status  or  identification 
byte,  the  IHP  instructs  the  interrupt-handler  module 
to  begin  the  interrupt-service  routine.  This  instruction 
from  the  IHP  is  logically  defined  as  follows: 

IF  (BBSY*LATCH_STATUS  +  BBSY*  (5) 

START_SERVICE*/SERVICE_DONE)  THEN 
STA RT_SE RV ICE  :  =  1; 

This  definition  states  that  the  IHP  constantly  asserts 
the  START-SERVICE  signal  until  the  interrupt  han- 
dler generates  a  SERVICE_DONE  signal,  at  which 
point  the  IHP  drives  START-SERVICE  low.  The  logic 
described  above  generates  the  timing  diagram  shown  in 
Fig  5. 


Development  tools  provided  by  FPC  and  PLD  manu- 
facturers simplify  the  task  of  programming  these  de- 
vices as  VME  Bus  controllers.  Once  you've  analyzed  the 
bus  protocols  and  converted  these  into  state-machine 
diagrams,  you  can  write  assembly-language  programs 
and  high-level  logic  equations  to  describe  the  state 
machines.  The  assembler  and  logic  software  will  then 
process  these  programs  and  equations  to  fit  into  the 
FPC  or  PLD.  For  a  summary  of  the  programming  tools 
available  for  the  Am29PL141  FPC,  see  box,  "Develop- 
ment tools  help  you  program  FPCs."  EDM 
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The  first  fuse-programmable  controller 
eliminates  bulky  and  expensive  designs,  freeing  distributed 
intelligence  to  carve  out  a  greater  niche  for  itself. 


■  n  much  the  same  way  that  cars  and  highways 
I  spawned  the  suburbs,  standard  microprocessor 
I  buses  and  add-on  boards  have  distributed  pro- 
cessing intelligence,  revolutionizing  the  design  of 
digital  systems.  Breaking  systems  into  independent 
modules  shortens  the  design  cycle,  eases  upgrades, 
and  accelerates  fault  diagnosis.  But  despite  these  ad- 
vantages, an  essential  element  has  been  missing:  a 
one-chip  controller  geared  specifically  to  the  needs  of 
distributed  intelligence. 

Without  that  critical  ingredient,  engineers  have 
been  forced  to  turn  to  less  than  optimal  solutions. 
One  approach  relies  on  boards  packed  with  as  many 
as  35  SSI  and  MSI  devices.  Another  tack  is  to  go  with 
a  powerful— yet  costly— VLSI  chip.  Alternatively,  a 
programmable  logic  device  can  be  pressed  into  ser- 
vice, but  such  circuitry  lacks  the  computing  power  to 
control  peripherals. 

The  missing  element,  a  fuse-programmable  con- 
troller chip,  is  now  here.  By  mixing  intelligence  and 
control,  the  Am29PL141  stakes  out  new  territory  for 
distributed  systems.  The  20-MHz  IC  combines  for  the 
first  time  all  of  the  elements  of  an  intelligent  micro- 
code controller.  Its  powerful  sequencing  logic. steps 
through  the  controller's  64-by-32-bit  pipelined 
PROM.  That  fuse-programmable  memory  stores  a 
user-defined  microprogram  drawn  from  a  set  of  29 


Om  Agrawal  and  Deepak  Mithani 

Advanced  Micro  Devices  Inc.,  901  Thompson  PI., 
P.O.  Box  3453,  MS  47,  Sunnyvale,  CA  94088; 
(408)  749-2903. 


ftepnrted  with  permission  from  Elector*:  Design  Vol.33,  No.24 
CopyrtgJt  Hayden  Publish^  Co.,  he.,  1985. 


microinstructions,  including  a  repertoire  of  jumps, 
multiple  branches  (or  case  statements),  and  sub- 
routine calls.  All  can  be  executed  conditionally,  de- 
pending upon  the  outcome  of  one  of  eight  tests.  In 
addition,  a  serial  shadow  register  on  the  28-pin  chip 
helps  designers  diagnose  system  troubles  right  down 
to  a  particular  IC.  (In  the  past,  expediency  often  dic- 
tated that  complex  trouble-shooting  be  avoided  for 
as  long  as  possible.) 

Four  basic  blocks 

The  controller  comprises  four  main  functional 
blocks.  Three  of  them  — the  microaddress  control 
logic,  condition  code  selector,  and  microinstruction 
decoder— form  the  cornerstone  of  the  controller,  the 
address  sequencer.  The  fourth  is  a  microprogram 
memory  (64  by  32  bits)  with  a  pipelined  register  and 
serial  shadow  register  (Fig.  1). 

For  the  most  part,  the  elements  of  the  address 
sequencer  are  fairly  typical.  Nevertheless,  the  way  in 
which  they  are  organized  and  connected,  as  well  as 
the  instruction  set,  make  the  chip  unique.  For  ex- 
ample, the  microaddress  control  portion  of  the 
sequencer  contains  one  register  for  counting  loops 
and  another  for  stacking  subroutine  return  address- 
es. Yet  either  register  can  be  employed  to  double  the 
capacity  of  the  other.  Consequently,  the  chip  can  nest 
two  levels  of  loop  counting  or  two  levels  of  subroutine 
branching  (the  instruction  set  reflects  those  abili- 
ties). And  the  high  degree  of  interaction  between  ele- 
ments, particularly  within  the  microaddress  control 
logic,  makes  necessary  a  highly  sophisticated  micro- 
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instruction  set. 

The  microaddress  control  logic  is  the  brain  of  the 
address  sequencer,  since  it  generates  the  addresses 
that  access  the  microinstructions.  At  any  time,  the 
address  that  is  called  depends  on  the  preceding  in- 
struction and  the  outcome  of  any  conditional  tests. 

Within  the  control  logic,  a  program-counter  multi- 
plexer supplies  the  PROM's  6-bit  address.  The  multi- 
plexer takes  the  address  from  a  microprogram 
counter,  incremented  program  counter,  branch  con- 
trol logic,  or  subroutine  register.  Because  the  pro- 
gram counter  contains  the  address  of  the  currently 
executing  instruction,  that  instruction  is  executed 
again  when  the  program  counter  is  selected  as  the 
address  source.  As  a  result,  the  counter  plays  a  fun- 
damental role  in  tallying  loops  and  executing  "wait 
until  true"  instructions. 

The  incrementer  holds  the  next  address  in  the 
sequence,  and  is  the  expected  source  when  no  jumps 
are  executed  and  no  branch  or  subroutine  conditions 
exist.  When  conditional  statements  like  "if  .  .  . 
then  . . .  else"  and  multiple  branches  pass  the  re- 
quired tests,  or  when  unconditional  jumps  are  exe- 
cuted, the  branch  control  logic  supplies  the  address. 
Finally,  when  the  program  calls  a  subroutine,  the 
subroutine  register  supplies  the  necessary  address. 

A  multiplexer  selects  one  of  three  address  sources. 
If  only  one  stack  level  is  needed,  the  value  stored  in 
the  subroutine  register  is  chosen.  When  the  count 
register  feeds  the  subroutine  register,  however,  it 
furnishes  an  additional  stack  level.  The  third  source, 
the  incrementer,  supplies  the  subroutine's  return 
addresses. 

Doing  double  duty 

If  not  needed  for  a  second  subroutine  level,  the 
count  register  can,  among  other  functions,  execute  it- 
erative loops  and  time  external  events.  To  accom- 
plish the  former,  the  controller  loads  the  register 
with  the  number  of  iterations  to  be  run.  Each  iter- 
ation decrements  the  register  until  it  reaches  zero. 
The  zero-detection  logic  associated  with  the  counter 
informs  the  chip's  microinstruction  decoding  logic 
when  the  register  "bottoms  out." 

Using  the  same  logic,  an  instruction  can  be  re- 
peated a  set  number  of  times.  Repeated  executions  of 
the  same  instruction  is  a  simple  way  to  insert  wait 
states  and,  therefore,  build  an  interface  to  different 
microprocessors  and  peripherals. 

The  count  register  is  loaded  from  any  of  four 
sources:  a  decrementer,  for  normal  loops;  an  instruc- 


tion field;  the  subroutine  register;  and  the  branch- 
control  logic.  The  last  derives  a  6-bit  value  from  a 
data  field  in  a  microinstruction. 

The  branch-control  logic,  a  powerful  block  within 
the  sequencer,  calculates  the  6-bit  value  either  by  ap- 
plying the  microinstruction  data  field  directly  or  by 
using  it  to  mask  the  chip's  six  test  inputs,  T0  through 
T5.  In  the  second  case,  the  masked  input  actually  be- 
comes the  branch  address.  Moreover,  either  the  data 
field  or  masked  test  value  serves  as  both  a  branch  ad- 
dress and  a  count  value. 

The  same  control  logic  also  compares  the  masked 
test  inputs  to  a  constant  in  a  microinstruction  field. 
The  outcome  of  this  check  affects  a  flip-flop.  The  lat- 
ter's  condition  itself  becomes  a  factor  in  deciding 
conditional  branch  and  subroutine  instructions.  If  a 
match  occurs,  the  flip-flop  is  set.  Alternatively,  the 
flip-flop  remains  unchanged  if  there  is  no  match. 
Because  the  flip-flop  does  not  change  when  there  is 
no  match,  it  is  particularly  useful  for  comparing 
ASCII  characters  and  other  6-bit  fields,  as  well  as  for 
successively  checking  the  chip's  test  inputs. 

Controlling  conditions 

A  set  flip-flop  is  one  of  the  eight  aforementioned 
tests  that  fulfills  a  conditional  branch  or  subroutine. 
Through  its  condition-code  selection  logic,  the  con- 
troller is  able  to  check  each  of  its  six  test  input  lines, 
as  well  as  the  Condition  Code  input.  Further,  an 
exclusive-OR  gate  within  the  selection  logic  switches 
the  meaning,  or  interpretation,  of  a  test  result.  In 
other  words,  with  no  external  hardware,  a  test  condi- 
tion can  be  asserted  either  when  a  match  occurs  or 
when  one  does  not. 

The  final  component  of  the  chip's  sequencer  sec- 
tion is  the  microinstruction  decoder.  That  pro- 
grammable logic  array  generates  the  IC's  internal 
control  signals  based  on  the  microinstruction  being 
executed  and  the  test  results  reported  by  the  con- 
dition-code logic. 

The  IC's  fourth  functional  block  comprises  the 
fuse-programmable  microprogram  memory,  pipe- 
lined register,  and  serial  shadow  register.  The  pipe- 
line register  is  32  bits  wide,  and  stores  the  micro- 
instruction being  run.  The  next  address  is  calculated 
by  the  sequencer  and  its  contents  is  fetched  from  the 
microprogram  memory.  The  upper  16  bits  of  the 
pipeline's  output  remain  within  the  chip  to  sequence 
addresses  and  control  internal  functions. 

Only  the  16  low-order  bits  link  to  the  outside,  as 
user-defined  control  lines.  Of  these,  the  upper  byte  is 
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1.  The  20-MHz  Am29PL141  is  the  first  complete  microprog ram m able  controller  chip,  making  it  an  important 
building  block  for  distributed  processing  systems.  Its  powerful  sequencing  logic  steps  the  controller  through 
its  pipelined  PROM.  The  fuse-programmable  memory  is  64  by  32  bits. 


can  be  cascaded  readily  if  more  than  16  control  bits 
are  needed  (Fig.  2). 

The  serial  shadow  register,  also  32-bits  wide,  sim- 
plifies device-  and  system-level  diagnostics.  It  can  be 
loaded  in  parallel  with  the  contents  of  the  pipelined 
register  or  loaded  serially  from  the  Serial  Data  Input 
pin.  On  the  other  hand,  the  serial  register  can  also 
load  the  pipeline  or  shift  data  out  serially.  It  also  may 
simply  hold  the  data  sent  to  it. 

To  check  out  the  chip,  an  instruction  is  shifted 
serially  into  the  shadow  register  and  then  loaded  in 
parallel  into  the  pipeline.  Doing  so  forces  the  instruc- 
tion to  be  executed,  and  its  results  transferred  back 


registers  in  a  system  are  tied  together,  a  series  diag- 
nostic loop  is  created  that  isolates  a  problem  down  to 
a  single  chip. 

The  shadow  (use 

A  separate  fuse  must  be  blown  to  set  up  the  serial 
shadow  register.  When  that  is  done,  four  pins  are 
redefined  to  handle  diagnostics.  Specifically,  the 
Condition  Code  and  Zero  lines  and  Output  Data  Bits 
6  and  7  become,  respectively,  the  Serial  Data  In 
(SDI),  Serial  Data  Out  (SDO),  Diagnostic  Clock 
(DCLK),  and  Mode  control  lines. 

The  strength  of  any  controller— and  the  advantage 
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2.  When  an  application  calls  tor  more  than  16  control  bits,  two  or  more  chips  can  be 
cascaded  horizontally.  The  lower  16  bits  ol  each  ot  the  chip's  32-bit  microinstructions 
(of  which  there  are  29)  serve  to  control  a  system's  components.  Eight  of  these  16  con- 
trol bits  can  be  put  into  a  high-impedance  state  under  microinstruction  control;  the  oth- 
er 8  bits  are  always  enabled. 
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of  a  microprogrammed  system  that  employs  it— lies 
in  an  engineer's  ability  to  specify  the  sequence  in 
which  microinstructions  are  executed.  To  ensure 
that  ability,  the  controller  executes  all  the  basic 
high-level  constructs  required  for  structured  micro- 
programming. Its  29  op  codes  include  sequential  in- 
structions, conditional  instructions,  dual  branching 
forks,  and  multibranching  case  statements.  Iterative 
executions,  like  For,  While,  When,  and  Until,  round 
out  the  set.  In  addition,  Jump,  Jump  to  Subroutine, 
Loop,  and  Compare  instructions  allow  designers  to 
store  very  complex  algorithms  in  the  chip's  64-word 
memory. 

Instruction  formats  fall  into  two  categories.  The 
first  is  for  general  microinstructions;  the  second  is 
for  the  chip's  Compare  instructions.  The  latter  com- 
pare a  6-bit  test  input  to  a  masked  constant.  The 
Compare  instructions  are  well-suited  for  character 
searches,  as  well  as  key  searches  in  a  look-up  table. 

A  single-precision,  floating-point  peripheral  board 


(Fig.  3)  presents  a  good  example  of  the  part  that  the 
controller  plays  in  a  distributed  system.  As  a  micro- 
programmed design,  the  peripheral  serves  as  an  add- 
on math  accelerator  card  that  plays  with  different 
hosts  and  buses.  The  controller  orchestrates  the  ac- 
tions of  the  floating-point  processor,  and  various  reg- 
isters, register  files,  and  memory  chips. 

Simple  arithmetic 

The  processor  is  simple  to  use,  partly  because  it  in- 
curs no  pipeline  delays.  It  conforms  to  IEEE  and  oth- 
er industry  standards,  and  takes  only  a  single  clock 
cycle  to  add,  subtract,  or  multiply.  It  needs  five  cycles 
to  divide,  using  the  Newton-Raphson  method  that  in- 
verts one  of  the  factors  and  multiplies.  In  operation, 
to  divide  X  by  Y  the  chip  fetches  the  approximate  in- 
verse of  Y  from  a  PROM-based  table  and  multiplies 
it  by  X.  One  or  two  iterations  of  this  method  increase 
the  initial  accuracy. 

The  floating-point  board  works  with  a  microword 
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3.  tn  a  typical  application,  the  controller  oversees  the  workings  of  a  floating-point  pro- 
cessor board.  Host  instructions  sent  to  the  FIFO  and  instruction  registers  initiate  sub- 
routines in  the  controller  that  generate  the  signals  that  run  the  board.  Two  controllers 
are  employed  to  supply  the  necessary  number  of  control  signals. 
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of  at  least  25  bits,  9  more  than  available  with  one  con- 
troller. Thus  the  design  employs  two  controller  chips. 
The  floating-point  processor  requires  five  command 
bits.  Three  are  instructions  and  two  select  the  input 
source.  It  also  needs  three  control  bits  to  enable  its 
trio  of  data  registers.  Two  other  chips  (each  a  dual- 
access  four-port  register  of  64  words  by  18  bits)  tem- 
porarily store  commands.  They  accept  12  register  ad- 
dress bits  (6  for  source  and  6  for  destination)  from 
the  host  or  the  controller's  microprogram  memory. 

Data  passes  to  and  from  the  host  through  input 
and  output  registers  on  the  board,  which  call  for 
their  own  enable  signals.  Another  bit  is  needed  to  ad- 
vance the  FIFO  instruction  register.  One  is  necessary 
to  enable  and  another  to  select  a  status  word.  (The 
floating-point  chip  supplies  status  information, 
which  is  available  to  the  controller  through  its  test 
inputs  as  well  as  to  the  host  through  the  register  file.) 
Seven  control  bits  are  left  for  miscellaneous  tasks. 

Operation  begins  when  at  least  one  16-bit  instruc- 
tion is  loaded  from  the  host  into  the  peripheral's 
FIFO  register.  The  instruction  consists  of  a  4-bit  op 
code,  a  6-bit  source-register  address,  and  a  6-bit  ad- 
dress for  the  second  source  register,  which  also  stores 
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4.  Loading  an  instruction  into  the  FIFO  starts  the  pe- 
ripheral and  activates  the  controller,  which  loads  an 
external  instruction  register  and  decodes  the  op 
code  lield.  The  op  code  initiates  a  subroutine  in  the 
controller,  issuing  the  proper  control  signals,  ad- 
vancing the  FIFO  register,  and  loading  the  next 
instruction. 


the  results.  Until  an  instruction  is  received,  the  con- 
troller is  in  the  wait  state  (Fig.  4).  When  an  instruc- 
tion arrives,  however,  the  FIFO's  Empty  signal  acti- 
vates the  controller,  which  then  reads  the  command 
from  the  FIFO  into  a  separate  instruction  register. 
The  controller  also  loads  the  instruction's  op  code  in- 
to its  test  inputs.  It  then  masks  the  two  unused  test 
bits  and  jumps  to  a  subroutine  that  performs  the 
operation  specified  by  the  op  code.  After  completing 
it,  the  controller  advances  the  FIFO  register  to  load 
the  next  instruction. 

The  peripheral  executes  up  to  16  op  codes.  The  first 
eight  are  single-cycle  operations  and  identical  to 
those  of  the  floating-point  processor.  They  consist  of 
addition,  subtraction,  multiplication,  and  format 
conversion  instructions.  The  remaining  op  codes  are 
used  to  load,  store,  and  divide  data,  and  a  multiple  cy- 
cle instruction  multiplies  and  accumulates  values. 
The  four  remaining  op  codes  can  be  defined  by  the 
user  to  implement  application-related  operations. 

Software  and  hardware  tools  are  a  necessary  part 
of  such  proj  ects  as  the  foregoing  peripheral.  The  soft- 
ware assembles  high-level  microprograms  and  a 
JEDEC  output  file  that  specifies  the  fuse  pattern  to 
be  burned  into  the  PROM  array.  Currently,  a  pro- 
gram called  Fuse  Formatter,  which  runs  on  the  IBM 
PC  personal  computer,  lets  designers  enter  hexadeci- 
mal code  that  corresponds  to  PROM  data.  From  that 
code,  the  program  creates  a  file  that  is  downloaded 
directly  to  one  of  several  PROM  programmers.  The 
latter  blow  the  corresponding  fuses  in  the  micro- 
program memory  and  are  the  only  required  hard- 
ware tools.  □ 
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Designers  who  build  state  machines  from  pro- 
grammable logic  devices  (PLDs)  must  develop 
good  juggling  skills.  The  job  usually  takes  a  careful 
balancing  of  the  I/O  pins,  the  product  terms,  and 
the  registers  that  store  states  and  output  bits.  But 
because  these  resources  all  are  limited  in  a  PLD, 
performance  and  compactness  often  have  to  be  sac- 
rificed in  favor  of  the  desired  function. 

Ideally,  state-machine  designers  need  a  PLD 
with  buried,  or  internal,  state  registers,  compact 
  packaging  and  high 


Housed  in  a  20-pin 
DIP,  the  smallest  PAL 
device  to  date  for 
building  state  ma- 
chines sports  registers 
that  save  I/O  pins,  yet 
it  is  easy  to  test. 


speed.  Buried  registers 
free  valuable  I /O  pins;  a 
small  package  saves 
precious  board  space; 
and  high  operating 
speed  speaks  for  itself. 

Approaching  the 
ideal  is  the  AmPAL- 
23S8,  the  first  20-pin 
programmable  array 
logic  (PAL)  device  de- 
signed specifically  for  building  state  machines  and 
sequencers.  Its  architecture  is  more  flexible  than 
any  24-  or  28-pin  PAL  device  or  programmable 
logic  array  (PLA).  For  example,  it  has  14  edge- 
triggered,  D-type  flip-flops  (Fig.  1 ).  That  alone 
compares  favorably  with  24-pin  bipolar  PAL  de- 
vices, which  have  up  to  10  registers,  and  with  24- 
and  28-pin  PLAs,  which  have  12  to  14  registers. 
Only  40-pin  PAL  devices  with  16  registers  surpass 
the  new  chip. 

As  significant  is  the  fact  that  of  the  chip's  regis- 
ters, six  hold  buried  state  bits.  By  definition,  the 
buried  registers  do  not  merely  free  I/O  pins:  In- 
deed, they  actually  enhance  the  chip's  value  as  a 
state  machine.  Driven  by  the  chip's  AND-OR 
array  the  buried  state  registers,  along  with  the  eight 
registers  dedicated  to  I/O  pins,  make  possible  a 
wide  range  of  states  and  outputs. 

Moreover,  the  buried  state  bits  can  be  easily 
accessed,  controlled,  and  observed.  Although  the 


first  two  attributes  are  common  among  PAL  de- 
vices, the  third  has  always  presented  problems.  The 
23S8  is  the  first  chip  whose  product-term  array 
makes  the  buried  state  registers  observable. 

Of  the  other  eight  registers,  four  serve  in  output 
logic  macrocells  and  four  store  output  states.  Each 
macrocell  contains  three  main  blocks  (Fig.  2).  One 
block  is  the  register,  a  rising-edge-triggered  flip- 
flop  sharing  asynchronous  reset  and  synchronous 
preset  inputs  with  all  the  other  flip-flops.  The  other 
two  blocks  are  multiplexers,  one  of  them  selecting 
from  one  of  four  output  paths,  the  other  from  one  of 
two  feedback  paths. 

The  logic  chip  is  small,  fast,  and  easy  to  program. 
Packaged  in  a  0.3-in.  DIP,  it  is  half  the  width  of 
28-  and  40-pin  PAL  and  PLA  type  devices.  More- 
over, it  comes  in  33-  and  28.5-MHz  versions,  with 
propagation  delays  of  20  and  25  ns,  respectively. 
The  faster  version  is  an  ideal  companion  for  new 
8-MHz,  16-bit  microprocessors,  as  well  as  for 
16-MHz,  32-bit  units..  Even  at  28.5  MHz,  the  new 
chip  is  faster  than  existing  bipolar  PLA-based  se- 
quencers, which  operate  from  20  to  25  MHz.  It  also 
beats  equivalent-density  PAL  devices  operating  at 
or  below  25  MHz. 

Programming  takes  place  by  blowing  reliable 
and  fast  platinum  silicide  fuses,  and  using  available 
software  packages.  Like  all  PAL  devices,  the  23S8's 
fusible  AND-OR  array  is  a  direct  measure  of  its 
capability.  With  6200  fuses,  it  is  three  to  six  times 
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chips,  which  are  three  to  four  times  as  large  and  have 
some  8000  fuses.  The  new  chip's  fuse  array  handles  all  14 
registers,  with  6  to  1 2  logic  products  for  each  output,  and 
up  to  23  inputs— 9  of  them  dedicated.  8  for  feedback,  and 
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1.  The  AmPAl23S8  Is  the  first  20-pln  PAL  device  de- 
signed specifically  for  building  state  machines 
and  sequencers.  It  contains  14  registers,  6  of 
them  burled  and  requiring  no  I/O  pins,  4  dedi- 
cated to  output  macrocells,  and  4  storing  output 
bits.  The  states  of  the  burled  registers  are  easily 
observed,  which  simplifies  debugging. 


terms  are  two  other  measures  of  its  capability.  It  has  alto- 
gether 1 35  product  terms  for  logic  and  control  functions. 
Moreover,  the  AND  array  that  drives  the  14  flip-flops  is 
variably  distributed  (see  the  table ). 

The  distribution  of  the  1 24  logic  product  terms  helps 
adapt  the  chip  to  system  requirements,  while  optimizing 
its  internal  resources.  (Of  the  1 1  other  terms,  8  control 
output,  the  remaining  3  control  preset,  reset,  and  observ- 
ability, respectively.)  For  additional  flexibility  in  de- 
signing powerful  state  machines,  the  chip  can  tradeoff  the 
product  terms  of  its  buried  registers. 

OBSERVABILITY  CLOSE  UP 

The  chip's  observable  product  term  is  generated  by  the 
AND-OR  array,  along  with  an  ability  to  preload  the 
buried  registers.  Observability  helps  check  a  buried  regis- 
ter's state,  so  that  a  designer  can  monitor  any  register  dur- 
ing debugging. 

Under  control  either  of  a  single  pin  or  of  a  combination 
of  six  input  signals,  the  observable  product  term  drives  a 
set  of  inverting  buffers.  In  normal  operation,  the  buffers 
also  serve  the  four  dedicated  output  registers  and  two  of 
the  four  macrocells.  When  activated,  therefore,  the  ob- 
servable product  term  disables  signal  flow  from  those  reg- 
isters and  macrocells  and  simultaneously  connects  the  six 
buried  registers  to  their  respective  I/O  pins. 

Flexibility  in  configuring  a  state  machine  derives  from 
the  independent  control  of  output  and  feedback  multi- 
plexers. Each  output  macrocell  has  three  fuses,  two  to 
determine  the  output  path  and  one  the  feedback  path.  The 
fuses  can  configure  the  macrocells  in  eight  ways:  a  combi- 
natorial or  a  registered  output  with  either  an  I /O  pin  or  a 
registered  feedback.  Each  of  those  four  variations  offers 
either  active  high  or  active  low  outputs. 

In  the  output  path,  fuse  S>  determines  whether  the  out- 
put is  active  high  or  active  low.  The  output-nature  fuse, 
S2,  selects  sequential  or  combinatorial  operation.  When 
both  of  those  fuses  are  intact,  a  flip-flop's  Q  pin  passes 
through  an  inverting  buffer,  making  the  output  active  low 
and  registered. 

On  the  other  hand,  program  ming  or  blowing  S ,  renders 
the  output  term  active  high;  blowing  S2  causes  the  output 
to  bypass  the  register,  making  it  combinatorial.  A  combi- 
natorial output  with  feedback  can,  moreover,  provide 
more  than  one  level  of  ORing  logic.  Finally,  fuse  S3  con- 
trols the  feedback  path,  sending  either  the  flip-flop's  Q 
output  or  the  I/O  pin  back  into  the  array. 

Controlling  each  output's  enable  line  through  a  prod- 
uct term  from  the  AND  array  creates  further  options  by 
transforming  an  I/O  into  a  dedicated  input,  a  dedicated 
output,  or  a  dynamically  controlled  I/O  pin.  A  designer 
can  thus  adjust  the  number  of  input  and  output  pins,  and 
have  the  dynamic  I/O  capability  required  by  most  buses. 
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Each  oulpul  enable  producl  term  in  further  associated 
with  an  individual  polarity  fuse.  This  polarity  fuse  allows 
designers  to  modify  a  control  signal  to  enable  outputs  us- 
ing De  Morgan's  theorem,  which  is  especially  useful  for 
bus  control  applications. 


2.  Each  programmable  macrocell  has  three  main 
blocks:  a  rising  edge-triggered  D  type  flip-flop, 
an  output  multiplexer,  and  a  feedback  multi- 
plexer. The  output  multiplexer  selects  from  one  ol 
tour  output  sources;  the  feedback  multiplexer 
feeds  either  an  I/O  pin  or  the  flip-flop's  Q  output 
into  the  programmable  AND-OR  array. 
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3.  Two  23S8s  lie  at  the  heart  of  an  advanced  dynamic  RAM  controller  that 
compares  well  in  its  data-transfer  features— such  as  dynamic  bus  sizing  and 
handling  misaligned  data— with  recent  32-bit  microprocessors.  One  of  the  two 
main  PAL  devices  controls  the  timing  and  arbitration  of  the  memory  cycles; 
the  other  sizes  and  aligns  data  transfers. 


The  chip's  high  speed  and  buried  slate  registers  make  it 
an  ideal  foundation  for  a  flexible  and  fast  dynamic  RAM 
controller  that  meets  the  complex  requirements  of  the  lat- 
est microprocessors.  Even  at  a  16-MHz  processor  clock 
rate  it  can  become  a  controller  with  few  or  no  wait  states 
that  might  otherwise  have  to  be  built  from  scratch. 

There  is.  for  example,  no  ready-made  controller  that 
can  lake  advantage  of  dy  namic  bus  sizing  the  auto- 
malic  adjustment  lo  the  amount  of  data  movable  in  one 
bus  cycle.  Nor  can  any  off-the-shelf  unit  handle  misal- 
igned data,  that  is.  when  an  operand  falls  outside  its 
proper  memory  boundaries.  Bus  sizing  and  data  align- 
ment are  features  of  the  recently  arrived  Motorola  6X02(1 
and  the  Intel  80386,  bolh  32-bil  microprocessors.  But  a 
controller  able  to  perform  the  two  functions  can  he  buill 
around  two  23S8s,  one  for  liming  and  arbitration  logic 
and  the  other  for  data  sizing  and  alignment  ( Hig.  3 ). 

The  liming  and  arbitration  chip  arbitrates  among 
present,  refresh,  and  CPU  cycles;  executes  read  or  write, 
read-modify-write,  and  refresh  cycles;  and  asserts  inter- 
face signals.  The  chip  first  arbitrates  between  the  proces- 
sor's Memory  Request  (MREQ)  and  Refresh  Request 
(RKCLK)  signals.  It  gives  priority  to  the  cycle  currently 
being  executed,  follows  this  with  the  refresh  cycle,  and 
concludes  with  the  processor  cycle.  Bolh  intermittent  and 
burst-mode  refresh  schemes  are  therefore  possible. 

The  design's  timing  relationships  and  arbitration  re- 
quirements define  a  stale-machine  diagram  (Fig.  4).  The 
actual  stale  machine 
employs  five  of  the  six 
available  buried  regis- 
ters. The  sixth  register 
is  a  flag  and  so  acts  as  a 
small,  independent  slale 
machine.  Either  of  two 
software  packages 
Cupl  from  Personal 
CAD  Systems  Inc.'s 
Assisted  Technology 
Division  (San  Jose, 
Calif. )  or  Abel  from 
Dala  I/O  Corp.' (Red- 
mond, Wash.)  easily 
describes  the  slale  oper- 
ations. (Cupl  is  the  lan- 
guage used  in  this  par- 
ticular example. ) 

No  arbitration  lakes 
place  during  the  exe- 
cution of  a  cycle,  which 
is  indicated  by  a  slate 
other  than  0.  The  lim- 
ing and  arbitration  chip 
stores  any  refresh  re- 
quest and  responds  to  il 
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later,  continuing  lo  follow  the  current  cycle  through  its 
various  states.  Thus  the  current  state  automatically  takes 
priority,  and  no  other  cycle  is  started  until  the  current  one 
is  complete. 

At  the  end  of  the  cycle,  at  state  0.  the  timing  and  arbi- 
tration chip  arbitrates  between  MREQ  (a  CPU  cycle) 
and  RFCLK.  In  the  small  independent  state  machine, 
the  buried  register  flag,  Bs  latches  RFCLK,  which  may 
last  no  longer  than  one  clock  cycle.  When  RFCLK  is 
high.  B,  toggles  to  state  1  and  stays  there  until  the  refresh 
cycle  is  executed.  Then  it  toggles  back  to  state  0  to  be 
reads  for  the  next  RFCLK. 

During  refresh  cycles,  the  timing  and  arbitration  chip 
asserts  the  Refresh  signal  (RFSH),  which  supplies  the  re- 
freshed row  address  to  the  memory  address  bus.  The  chip 
also  generates  the  Row  and  Column  Address  Enable 
Lines,  ROWADD  and  COLADD,  which  multiplex  ad- 
dresses onto  the  memory  bus.  At  state  16.  the  chip  re- 
moves ROWADD  and  asserts  RFSH;  state  17  asserts  the 
Row  Address  Strobe  signal  (RAS)  and  refreshes  all 
memory  banks  before  removing  RAS  and  RFSH.  Re- 


SIG.  COLADD,  ROWADD 


RAS,  RFSH.  ROWADD 


RAS.  COLADD.  ROWADD 


Note:  When  signal 

above  and  below  a  line,  the  upper  Indicates 
Inputs  and  the  lower,  outputs. 


4.  A  state  diagram  tor  the  timing  and  arbitration  chip  reflects  its  oper- 
ation (or  selecting  and  orchestrating  the  memory  cycles.  The  chip 
waits  in  state  0  to  handle  either  a  processor  cycle  or  a  refresh  cycle; 
the  latter  takes  priority  over  any  but  a  cycle  currently  In  progress.  The 
chip  performs  refresh,  read,  write,  and  read-modlfy-wrlte  cycles. 


moving  RFSH  increments  an  external  1 0-bit  refresh-row- 
address  counter  (AmPAL22VT0)  to  the  next  row. 

If  no  refresh  request  is  pending  during  state  0  and 
MREQ  is  asserted,  a  processor  cycle  begins  and  the  ma- 
chine jumps  to  state  I,  Otherwise  the  machine  stays  in 
state  0  and  polls  for  either  a  refresh  or  processor  cycle. 

The  PAL  chip  responds  to  the  processor's  signal  RMC 
to  execute  a  read,  a  write,  or  a  read-modify-write  cycle, 
with  the  internal  registers  controlling  timing  for  the  dif- 
ferent cycles.  If,  for  instance,  the  memory  consists  of 
4  Mwords,  organized  as  four  equal  banks  of  32-bit  words, 
the  chip  asserts  a  RAS  signal  for  each  bank — RAS0 
through  RAS3. 

The  strobe  signal  being  asserted  depends  on  address- 
select  signals  ADSELo  and  ADSEL,,  which  in  most  sys- 
tems are  the  high  bits  of  the  processor's  address  bus.  After 
asserting  the  proper  strobe,  the  machine  jumps  to  state  2 
to  manipulate  the  address-handling  signals.  For  instance, 
to  multiplex  addresses,  ROWADD  and  COLADD  are 
again  asserted. 

In  state  2,  the  timing  and  arbitration  chip  removes 
ROWADD  and  asserts  COL- 
ADD, along  with  SIG,  the  refer- 
ence that  tells  the  data-sizing  and 
alignment  chip  that  a  processor  cy- 
cle  is  in  progress.  SIG  syn- 
chronizes the  two  23S8s  and  in- 
structs the  data-sizing  and 
alignment  PAL  to  send  a  column- 
address  strobe  (CAS)  to  memory. 

The  state  machine  remains  in 
state  2  for  an  extra-cycle,  and  then 
moves  to  state  4  to  al  low  time  to  ac- 
cess the  data.  There  it  decides, 
based  on  processor  signal  RMC, 
whether  to  execute  a  read,  a  write, 
or  a  read-modify-write  cycle.  Be- 
fore completing  a  read  or  write  ac- 
cess, the  timing  and  arbitration 
chip  removes  RAS  and  waits  for 
one  more  cycle  to  allow  for  the 
RAS  precharge  time.  Simulta- 
neously, it  switches  back  to  ROW- 
ADD  from  COLADD  in  prepara- 
tion for  the  next  processor  cycle, 
and  disables  SIG. 

When  the  RMC  signal  requests 
a  read-modify-write  cycle,  the 
timing  and  arbitration  chip  auto- 
matically lengthens  the  cycle.  It 
holds  RAS  until  DSACK  and  WE 
signals  have  been  asserted,  and 
only  then  completes  the  cycle. 

The  data-sizing  and  alignment 
chip  works  only  during  a  CPU  cy- 
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cle.  It  creates  WE  for  the  write  and  read-modify-write  cy- 
cles, and  it  accepts  SIG,  which  synchronizes  the  chip's 
different  states  (Fig.  5)  with  those  of  the  timing  and 
arbitration  chip. 

Four  of  the  data-sizing  and  alignment  chip's  buried 
state  registers  generate  its  state-machine  function.  The 
machine  starts  polling  in  state  0  and  stays  there  until  it  re- 
ceives SIG,  the  start  of  a  processor  cycle. 

The  68020  has  a  32-bit  maximum  data  width  and  a 
data  bus  divided  into  four  byte-wide  segments,  each  inde- 
pendently controlled  by  a  CAS  signal  from  the  data- 
sizing  and  alignment  chip.  The  processor  can  therefore 
read  and  write  byte-wide  data  over  any  segment. 

The  four  CAS  signals,  CASo  to  CAS,,  strobe  the  col- 
umn address  from  the  high-  to  low-order  memory  bytes. 
The  signals  also  allow  the  microprocessor  to  read  and 
write  across  the  low-low,  low-middle,  high-middle,  and 
high-high  bytes  of  a  32-bit  data  port.  The  data-sizing  and 
alignment  chip  must  assert  all  four  CAS  signals  for  a 
32-bit  port;  only  CAS2  and  CAS)  are  needed  to  supply  a 
1 6-bit  port's  low-  and  high-byte  strobes;  while  for  an  8-bit 
port,  CAS3  is  enough. 


5.  The  data-sizing  and  alignment  chip  works  only 
during  a  processor  cycle  and  waits  in  state  0  until 
one  is  requested.  If  the  processor  calls  tor  a  read 
or  a  write  cycle,  the  chip  asserts  DSACK,  CAS, 
and,  tor  a  write  cycle,  WE.  For  a  read-modlly-write 
cycle,  it  asserts  two  DSACK  signals,  one  each  tor 
the  read  and  write  segments. 


The  data-sizing  and  alignment  chip  asserts  the  CAS 
signals  in  state  1 ,  and  selects  them  according  to  the  size  of 
the  port  in  use,  the  width  of  the  data  being  moved,  and 
how  the  data  aligns  with  the  port.  Coding  on  input  pins 
PRT0  and  PRT,  shows  the  port  size;  signals  SIZ0  and 
SIZ,  from  the  microprocessor  show  data  width;  and  the 
microprocessor's  least-significant  address  bits,  A0  and 
A,,  give  the  alignment. 

THE  PROCESSORS  PREFERENCE 

The  size  of  the  port  dictates  the  amount  of  data  that  can 
be  transferred  in  any  one  cycle.  Based  on  the  PRT  signals 
the  state  machine  asserts  DSACK  signals,  which  ac- 
knowledge the  CPU  and  tell  it  how  much  data  it  can 
move,  since  it  will  always  try  to  move  as  much  as  possible. 
The  state  machine  synchronizes  the  acknowledgment 
timing,  asserts  the  required  CAS  signals,  and  holds  them 
until  the  read,  write,  or  read-modify-write  cycle  is  com- 
plete. 

A  read  cycle's  DSACK  signal  is  asserted  in  state  1  and 
remains  until  the  end  of  the  cycle  at  state  4.  A  write  cycle 
is  identical,  except  for  the  additional  WE  signal.  But  a 
read-modify-write  cycle  requires  two  DSACK  signals, 
one  for  reading,  the  other  for  writing.  The  first  DSACK  is 
the  same  as  that  for  a  read  or  a  write  cycle;  the  second 
DSACK  occurs  in  state  8  and  remains  until  state  1 1,  with 
a  delayed  WE  asserted  in  state  8  by  an  internal  4-bit 
counter.  □ 

Om  Agrawal  is  product  planning  manager  for  pro- 
grammable logic  devices  al  Advanced  Micro  Devices. 
He  has  designed  16-bil  and  32-bit  minicomputers  and 
is  co-author  of  a  book  on  high-speed  memory  systems. 
He  holds  a  PhD  in  electrical  engineering  and  computer 
science  from  Iowa  Stale  University  and  an  MBA  from 
the  University  of  Santa  Clara. 

Kapil  Shankar  is  a  senior  product  planning  engineer 
for  programmable  logic  and  memory  devices,  and  has 
designed  advanced  graphics  systems.  He  has  an  MS  in 
computer  and  systems  engineering  and  an  ME  in  elec- 
trical power  engineering,  both  from  Rensselaer 
Polytechnic  Institute. 
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When  it  comes  to  generating 
complex,  high-resolution  dig- 
ital waveforms,  Advanced 
Micro  Devices'  Am2971  picks  up 
where  older  parts  leave  off.  Precise 
time  delays  once  required  the  use  of 
either  a  hybrid  structure  consisting 
of  an  analog  delay  line  combined 
with  digital  logic  or  costly  counters 
driven  by  high-frequency  clocks.  On 
the  one  hand,  analog  delay  lines  have 


r  ,  PROGRAMMABLE 
+5  v   INTERNAL  CLOCK 


TTL 

He 


PLL 


*0 


Affl2971 

PROGRAMMABLE 
A i    EVENT  GENERATOR 


CLKOUT 
TTL  PLL 

GND  GNO  FLTR  CLK/X  i 


<I 


^  3s  pF^8  pf 


short  enough  tap-to-tap  time  delays, 
or  resolutions,  to  be  usable  for  han- 
dling the  tighter  timing  relation- 
ships of  dynamic  and  static  RAMs. 
However,  those  lines  cannot  also  ac- 
commodate the  longer  total  delays 
needed  by  these  devices. 

On  the  other  hand,  the  desired 
waveforms  can  be  obtained  digitally 
from  the  outputs  of  one  or  more 
binary  counters.  But  to  achieve  reso- 
lutions of  10  ns,  counters  must  be 
driven  with  100-MHz  clock  frequen- 
cies, which  are  difficult  to  distribute 
around  a  board. 

The  programmable  event  genera- 
tor (PEG)  a  monolithic  IC,  offers 


a  more  elegant  solution  for  the  prob- 
lem of  generating  complex  high- 
speed timing  waveforms  (see  Fig. 
1).  Applications  for  the  PEG  (see 
box,  "A  new  PEG  in  the  designer's 
tool  box")  range  from  simply  cor- 
recting the  clock  skew  that  results 
from  distributing  a  clock  signal  on 
a  backplane  to  generating  complex 
state-machine  timing.  The  PEG'S  12 
output  lines  can  be  programmed  by 
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Fig.  1.  In  response  to  a  logic  tran- 
sition on  its  trigger  (TRIG)  input,  the 
PEG  will  generate  12  simultaneous 
user-programmed  timing  waveforms 
on  its  T>  to  Tn  output  lines. 

the  user  to  assume  either  a  logic- 
high  or  a  logic-low  level  within  each 
of  32  time  slots  or  events.  The  fre- 
quency of  these  events — that  is,  the 
frequency  of  the  output  waveforms 
— is  also  programmable.  But  it  is  the 
precision  and  resolution  of  these 
waveforms  that  is  the  PEG'S  claim 
to  fame  (see  Fig.  2  ). 

Internal  operation 

The  PEG  is  an  edge-triggered 
logic  device  that  generates  a  pre- 
programmed digital  waveform  in 
response  to  a  triggering  signal.  It 
consists  of  four  basic  blocks :  a  next- 
address  and  event  store,  a  start-ad- 
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dress  store,  an  oscillator  and  clock- 
control  block,  and  control  logic  (see 
Fig.  S).  A  total  of  623  user-program- 
mable platinum-silicide  fuses,  simi- 
lar to  those  used  in  AMD's  bipolar 
PROMs  and  programmable  array 
logic  devices,  are  located  throughout 
these  blocks. 

When  the  PEG'S  fuses  are  being 
programmed,  the  12  timing-tap  out- 
puts (T0  to  T,i)  are  used  as  fuse- 
address  inputs.  A  thorough  descrip- 
tion of  the  procedure  for  program- 
ming the  PEG  can  be  found  on  its 
data  sheet,  so  for  the  purpose  of 
understanding  what  goes  on  inside 
the  chip,  let's  assume  that  it  has  al- 
ready been  programmed, 
j  An  output-timing  sequence  is  ini- 
tiated by  a  logic  transition  at  the 
TRIG  input.  Two  user-program- 
mable fuses  are  located  in  the  trig- 
ger-polarity block.  One  of  them  is  a 
polarity-select  fuse,  which  when  un- 


programmed  will  cause  the  timing 
sequence  to  start  during  a  negative 
transition  of  the  TRIG  input.  If  it 
is  programmed,  the  sequence  starts 
when  a  positive  transition  occurs. 

The  second  fuse  in  this  block  is 
used  to  define  the  end  of  a  timing 
sequence.  If  this  fuse  is  left  unpro- 
grammed,  the  timing  sequence  is 
stopped  on  the  trailing  edge  of  TRIG 
pulse.  Otherwise  if  the  second  fuse 
is  programmed,  the  end  of  the  tim- 
ing sequence  is  defined  by  the  stop 
bits  as  programmed  by  the  user  into 
the  next-address/event  generator 
functional  block. 

Customized  waveforms 

A  transition  at  the  TRIG  input 
latches  three  address  bits,  A0  to  A.,, 
which  are  decoded  by  the  start-ad- 
dress store,  which  serves  as  an  8  x 
5-bit  mapping  PROM.  Each  of  its 
eight  addresses  represents  a  differ- 


INPUT 


ent  starting  point  for  the  12  output 
waveforms.  For  example,  three  dif- 
ferent addresses  in  this  store  could 
be  used  to  initialize  output  wave- 
forms that  implement  the  read-, 
write-,  and  page-mode  timing  se- 
quences of  a  dynamic  RAM. 

The  five  bits  of  each  address  are 
used  to  select  one  of  thirty-two  18- 
bit  locations  from  the  next-address 
and  event  store  (also  a  PROM). 
Each  18-bit  string  contains  three 
groups  of  information.  The  first  5 
bits  define  the  next  address  of  the 
desired  timing  sequence.  The  next 
12  bits  define  the  logic  levels  on  each 
of  the  timing-output  lines  (T0  to 
Tj  j ) .  The  18th  bit  is  a  sequence-stop 
bit.  As  the  timing  sequence  pro- 
gresses from  the  first  to  the  thirty- 
second  event-store  address,  the  out- 
put lines  will  produce  12  independent 
waveforms. 

The  assumption  so  far  is  that  the 


Fig.  2.  Tap-to-tap  resolution  of  an 
analog  delay  line  (a)  is  not  indepen- 
dent ol  its  total  delay  time.  However, 
the  PEG'S  12  programmable  output 
waveforms  (b)  are  independent  of 
one  another.  Resolutions  down  to 
10  ns  can  be  easily  obtained. 
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Fig.  3.  The  PEG  consists  of  tour  basic  blocks:  a  next-address  and  event  store, 
a  start-address  store,  an  oscillator  and  clock-control  block,  and  control  logic. 


stop-function  fuse  has  not  been 
programmed.  If  it  has,  the  timing 
sequence  will  no  longer  stop  on  the 
trailing  edge  of  the  TRIG  pulse.  The 
end  of  a  timing  sequence  will  instead 
be  defined  by  the  stop  bit.  When  a 
timing  sequence  encounters  a  stop 
bit,  the  entire  sequence  comes  to  a 
halt,  and  the  12  timing-output  lines 
remain  at  their  current  logic  level. 

Each  of  the  timing-waveform  out- 
puts has  a  minimum  useful  cycle 
time  of  40  ns  (or  20  ns  per  change) . 
When  the  PEG  is  operating  at  its 
maximum  internal-clock  frequency 
(f,  =  100  MHz),  the  outputs  must 
be  programmed  to  remain  un- 
changed for  at  least  two  clock  pe- 
riods for  each  change  of  logic  level. 
That  is,  although  an  output  can  only 
change  a  minimum  of  every  20  ns, 
the  timing  resolution  between  events 
among  taps  may  be  as  low  as  10  ns. 
When  the  PEG's  internal  clock  is 
programmed  to  operate  at  50  MHz  or 
slower,  the  timing  waveform  can  be 
programmed  to  change  once  each 
clock  period.  And  its  clock-frequency 
range  is  low  enough — from  7  to  20 
MHz — for  the  user  to  drive  the 
PEG's  CLK  input  from  the  system 
clock  via  an  internal  clock  input  pin. 

Alternatively,  the  designer  may 
opt  for  crystal  control  by  connecting 
a  7  to  20-MHz  crystal  to  the  X,  and 
X2  inputs  of  the  PEG.  And  thanks 
to  the  five  fuses  within  the  oscillator 
and  clock-control  block,  the  user  can 
also  set  the  frequency  of  the  output 


waveforms.  Four  of  these  fuses  are 
used  to  select  fc. 

When  an  external  crystal  is  used, 
the  PEG's  internal  phase-locked  loop 
fPLL)  will  treat  the  crystal's  fre- 
quency as  an  input-reference  fre- 
quency, ft,  and  will  multiply  it  by  a 
programmable  value  (1,  5,  10,  %, 
or  >%)  to  obtain  the  desired  value  of 
tt  (  see  Fig.  4) .  Also  an  output-clock 
frequency,  f0,  is  available  on  the 
CLKOUT  pin.  This  output  clock— 
whose  frequency  is  either  Ms  or  Mo 
of  the  internal-clock  frequency — 
may  be  used  to  synchronize  the  out- 

A  new  PEG  in  the 
designer's  tool  box 

Designers  no  longer  have  to  grap- 
ple with  low-resolution  analog  de- 
lay lines  or  costly  timing  logic 
when  high-speed  waveforms  are 
needed.  The  Am2971  program- 
mable event  generator,  trade- 
marked  PEG,  from  Advanced  Mi- 
cro Devices  has  12  output  lines 
that  can  produce  and  accurately 
place  logic  transitions  as  short  as 
20  ns,  with  a  minimum  waveform- 
to-waveform  resolution  of  10  ns. 


puts  of  the  PEG  with  the  remainder 
of  the  system.  And  speaking  of  sys- 
tems, the  PEG  comes  in  very  handy 
for  generating  the  specialty  cycles 
offered  by  today's  DRAMs. 

DRAM-timing  application 

Typically,  the  system  designer 
must  use  more  than  one  analog  de- 
lay line  and  some  external  logic  to 
produce  the  multiple  CAS  pulses 
needed  for  the  page-mode  cycles  of 
some  DRAMs.  Timing  designs  using 
high-speed  clocks  and  counters  are 
equally  cumbersome  in  this  case. 
Once  again,  the  PEG  chip  offers  a 
streamlined  solution  because  it  can 
readily  generate  the  multiple  CAS 
pulses. 

Another  job  for  the  PEG  comes 
about  as  a  result  of  the  use  of  multi- 
plexed address  lines  on  high-density 
DRAM  boards.  Because  the  many 
address  lines  present  a  relatively 
large  capacitive  load  to  the  multi- 
plexer, significant  time  delays  are 
introduced  in  the  address-timing 
waveforms. 

Typically,  a  signal  provided  by  the 
system  switches  one  or  more  ad- 
dress-multiplexer ICs.  This  signal 
must  be  accurately  timed  to  change 
state  after  time  interval  1  in  Fig.  5. 
The  performance  of  the  individual 
DRAM — and  ultimately  of  the  entire 
memory  system  in  which  it  resides — 
depends  on  reducing  interval  2  as 
much  as  possible.  Interval  1  is  15  to 
20  ns  in  most  standard  DRAM  spe- 
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Fig.  4.  The  PEG  can  be  clocked  by 
an  external  crystal  (a)  or  by  the  sys- 
tem clock.  On-chip  programmable 
Clock-control  logic  (b)  enables  the 
user  to  set  the  internal-clock  fre- 
quency, tc. 

cifications.  Thus,  a  resolution  of  10 
to  15  ns  is  desirable  for  the  DRAM- 
timing  waveforms — an  easy  job  for 
the  PEG. 

What's  more,  the  PEG  can  be  used 
to  generate  the  necessary  timing 
waveforms  for  multi-array  mem- 
ories too.  In  video-DRAM  systems, 
for  example,  the  rising  edge  of  the 
transfer-enable  signal,  TRG,  must 
be  accurately  placed  relative  to  the 
rising  edge  of  the  serial-port  clock 
signal,  SC.  When  the  PEG  is  used  to 
generate  both  waveforms,  the  de- 
signer has  full  control  over  the  reso- 


0SCILLAT0R 
AND 

CLOCK  CONTROL 


lb) 


CAS 


lution  between  the  rising  edges  of 
the  TRG  and  SC  signals. 

More  events  and/or  channels 

In  some  applications,  12  output 
channels  are  sufficient  but  more  than 
32  event  states  are  necessary.  These 
additional  states  can  be  obtained  by 
connecting  two  PEGs  as  shown  in 
Fig.  6.  After  PEG  1  has  cycled 
through  its  32  states,  its  T0  line 
triggers  PEG  2.  The  second  PEG 
will  then  begin  its  timing  sequence 
at  the  start  address  specified  by  PEG 
1.  If  necessary,  the  clock  frequency 
of  PEG  2  may  be  changed  by  using 
the  clock  output  of  PEG  1  as  the 
clock  input  to  PEG  2.  Otherwise,  the 
internal  clocks  and  TRIG  polarities 
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Fig.  6  By  connecting  two  PEGs,  de- 
signers can  obtain  timing  waveforms 
that  consist  of  more  than  32  events. 

of  the  two  PEG  chips  are  indepen- 
dently programmable. 

If  the  waveforms  must  be  nested, 
one  of  the  output  taps  of  PEG  2  can 
be  used  to  remove  the  TRIG  signal 
from  PEG  1.  PEGs  can  also  be  par- 
alleled for  applications  in  which  32 
event  states  are  sufficient  but  more 
than  12  output  channels  are  needed. 
For  example,  when  12  to  24  channels 
are  needed,  the  designer  can  simply 
drive  two  PEG  chips  with  the  same 
input  signals  and  use  as  many  of  the 
output  lines  as  necessary.  □ 


Fig.  5.  In  this  DRAM  delayed-write  cycle,  interval  1  must  be  kept  to  within  10 
or  15  ns,  and  interval  2  must  be  as  short  as  possible.  When  mounted  on  the 
same  DRAM  memory  board,  the  PEG  could  be  used  to  generate  the  RAS, 
CAS.  and  WE  signals. 
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The  trends  toward  faster  micropro- 
cessors and  denser  dynamic  ran- 
dom-access memories  have 
sparked  a  conflict.  New-generation  mi- 
croprocessors require  machine  cycle 
times  of  ISO  to  250  ns  (Table  1).  But 
traditionally  as  DRAM  density  increases, 
cycle  times  slow.  Needed  is  a  way  to 
combine  high  performance  with  zero- 
wait- state  operation. 

Although  hardware-intensive  cache 
schemes  can  work,  they  are  very  costly. 
It  is  possible,  however,  to  improve  sys- 
tem performance  with  little  additional 
hardware  by  substituting  enhanced- 
page-  and  static-column-mode  cycles  for 
regular  memory  cycles,  thus  cutting 
down  the  number  of  wait  states.  The 
resulting  system  would  be  especially 
useful  for  small  systems  and  subsystems 
with  space  restrictions. 

Our  design  incorporates  a  DRAM  such 
as  the  Am90C256,  a  CMOS  device  that 
offers  uninterrupted  serial  or  random  ac- 
cess to  512  bits  of  data,  or  the 
Am90C257,  which  combines  dynamic 
storage  and  static  decode  to  nearly  dou- 
ble sequential  access  rates.  These  de- 
vices provide  extended-page  and  static- 
column  access  modes  at  a  small  increase 
in  cost.  These  modes  permit  data  access 
times  of  40  to  60  ns,  enabling  the  mem- 
ory controller  to  remove  wait  states  for 
several  processor  cycles. 

The  only  additional  hardware  is  an 
AmPAL22V10,  a  24-pin  programmable 
logic  device  that  offers  an  AND/OR  logic 
structure  for  custom  programming,  22 
inputs,  and  10  outputs  with  12  product 
terms  each  to  provide  timing  and  arbitra- 
tion for  extended-page-mode  and  refresh 
cycles.  Also  needed  are  two  devices  to 
latch  and  compare  row  addresses  and  a 
counter  to  track  the  extended-page- 
mode  cycles'  time  out. 

Usually,  microprocessors  perform  a 
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FIGURE  1.  Wait  state  remover  arbitration  scheme. 


number  of  continuous  accesses  in  a  lo- 
calized address  range  to  fill  up  the  in- 
struction pipeline  or  to  store  or  retrieve 
data.  Our  scheme  is  based  upon  this 
locality  of  memory  access,  somewhat 
similar  to  cache  schemes.  It  involves 
performing  an  extended-page-mode  (or 
a  static-column/ripple-mode)  access  cy- 
cle, instead  of  a  regular  memory  cycle. 
The  time  required  for  the  data  to  be  read 
or  written  for  this  cycle  is  the  same  as  for 
a  regular  memory  cycle.  The  cycle's 
row  address  is  stored  and  then  compared 
to  that  of  the  subsequent  cycle.  This  row 
address  would  be  the  same  for  a  local- 


ized access  cycle  within  the  same  row  of 
upto512(fora256K)  or  1054  (for  a  1M 
DRAM)  column  addresses. 

If  the  row  address  is  the  same,  the 
system  performs  a  fast-access,  en- 
hanced-page-mode  cycle  that  requires 
only  the  column  address  and  no  proces- 
sor wait  state.  If  the  addresses  are  differ- 
ent, the  current  extended-page-mode  cy- 
cle is  completed,  and  a  fresh  cycle  is 
performed.at  the  new  row  address.  Com- 
pleting the  previous  cycle  and  initiating 
a  new  one  adds  20  to  40  ns  to  the  normal 
cycle,  depending  on  the  speed  of  the 
DRAM  controller.  Usually,  one  wait- 
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state  processor  cycle  provides  additional 
time  of  50  to  125  ns,  enough  to  squeeze 
in  this  overhead  and  not  cause  further 
wait  states. 

Because  row  addresses  can  be  com- 
pared during  the  same  time  interval  as 
the  address  decoding  and  generation  of  a 
memory  request  or  select,  this  action 
does  not  increase  cycle  time.  The  result 
is  a  regular  cycle  speed  with  one  (or  two) 
wait  states  for  random  memory  accesses 
at  different  row  addresses  and  fast,  no- 
wait-state  cycles  for  localized  accesses 
at  the  same  row  address.  Depending  on 
the  prqgram,  execution  speed  can  in- 
crease 5%  to  20%. 

The  AmPAL22V  10  also  acts  as  a  tim- 
er and  arbiter  for  the  extended-page- 
mode  and  refresh  cycles  (Figure  1  and 
Table  2).  Its  refresh-cycle  counter  runs 
at  16  fjtis .  But  the  DRAM  controller  does 
not  perform  a  refresh  every  time  it  re- 
ceives a  request:  because  the  page-mode 
cycle  can  be  extended  up  to  75  |xs  to 
allow  the  most  CAS-only  fast  cycles, 
refreshes  at  1 6-fj.s  intervals  would  dis- 
rupt operation.  Instead,  the  arbitration 
scheme  is  designed  so  that,  during  an 
extended-page-mode  cycle,  the  memory 
controller  counts  up  to  four  refresh  re- 
quests before  performing  the  refresh 
cycles. 

Description  of  Circuit 
And  Boolean  Logic 

For  the  sake  of  clarity,  the  circuit 
diagram  shows  a  general-purpose  arbi- 
tration interface,  not  a  complete  DRAM 
controller.  It  can  be  either  integral  to  a 
PAL-based  DRAM  controller  or  linked  to 
a  VLSI  DRAM  controller,  depending  on 
the  user's  needs. 

For  256K  DRAMs,  an  Am29843  with 
9-bit  latches  stores  a  9-bit  row  address. 
The  arbiter  PAL  controls  the  latch-enable 
signal,  LE.  A  9-bit  Am29809  compares 
the  stored,  row  address  with  that  of  the 
current  cycle  directly  from  the  processor 
on  address  lines  A9-A17.  If  the  address- 
es match,  the  Am29809  sends  a  MATCH 
signal  to  the  arbiter  PAL.  The  arbiter  pal 
also  receives  the  memory-request 
(MREQ)  signal  from  the  address-decod- 
ing circuitry  (not  shown  here). 

Of  the  AmPAL22V  1 0's  two  counters , 
the  refresh  counter  is  free  running  and 
generates  refresh  requests  (RFCLK)  at 


Processor 

Cycle  time  (ns) 

ft  uuaH 

u  wan 

1  uuait 

i  wan 

&  wans 

8088/8086  (8  MHz) 

500 

625 

750 

8088/8086  (10  MHz) 

400 

500 

600 

80286  (8  MHz) 

250 

375 

500 

80286  (10MHz) 

200 

300 

400 

80386  (8  MHz) 

188 

250 

313 

68020  (16  MHz) 

188 

250 

313 

68020  (20MHz) 

150 

200 

250 

TABLE  1.  Microprocessor  cycle  times. 
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/CYCFLAG:  = 
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/SLOC: 
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•  RFSC  *  RCT1  •  RCT0 

cycle 

+  /MREQ  '  /CYCFLAG  1  /MATCH  •  SLOC 
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matched  new 
extended  cycle 
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Hold  till  complete 

/FSTC: 

/MERQ  •  /CYCFLAG  *  MATCH  '  SLOC 

Match  CAS  only 
fast  cycle 

/RCT1: 

(RCT1  •  /RCTO  +  /RCT1  »  RCTO) 
•  RFCLK 

Count  up 

+  CT  *  RFSC 

Count  down 

*  (/RCT1  *  /RCTO  +  RCT1  *  RCTO) 
+  /RFCLK  *  (/CT  +  /RFSC  *  /RCT1 

Hold  count 

/RCT0: 

RFCLK  *  RCTO 

Count  up 

+  CT  *  RFSC  *  RCTO 

Count  down 

+  /RFCLK  *  (/CT  +  /RFSC)  *  /RCTO 

Hold  count 

CT: 

/RFSC 

TABLE  2.  Listing  ol  equations  tor 

fixed  16-u.s  intervals.  The  counter  for 
the  extended-page-mode  cycle  is  con- 
trolled by  the  arbiter  PAL,  which  starts 
the  counter  by  sending  a  signal  (CYC- 
FLAG). On  completion  of  the  75-p.s  tim- 
ing count,  the  counter  sends  a  time-out 
signal,  TOUT,  to  the  arbiter,  forcing  it  to 
complete  the  current  cycle  and  remove 
the  CYCFLAG  signal.  If  the  arbiter  re- 
quires a  new  extended-page-mode  cycle 


arbiter. 

at  a  different  row  address  before  this 
timeout,  it  resets  the  counter  by  remov- 
ing the  CYCFLAG  signal.  So  the  counter 
starts  when  CYCFLAG  is  high  and  stops 
or  resets  when  CYCFLAG  is  low. 

The  arbiter's  operation  is  quite  sim- 
ple. It  uses  registers  SLOC,  FSTC,  and 
RFSC  as  flags  to  indicate  a  cycle  request 
to  the  memory  controller.  SLOC  requests 
the  initiation  of  an  extended-page-mode 


CI  MonoHtMo  KM 


2-649 


Wait-State  Remover  Improves  System  Performance 


cycle.  This  new  cycle  continues,  even 
alter  completion  of  a  data  transfer,  as 
long  as  CYCFLAG  is  high.  FSTC  requests 
the  initiation  of  a  fast  CAS-only  cycle  at 
the  same  row  address  at  which  an  ex- 
tended-page-mode cycle  is  in  progress, 
meaning  that  CYCFLAG  must  be  high  for 
the  FSTC  signal  to  he  asserted. 

The  third  RI  SC  requests  the  initiation 
of  a  refresh  cycle.  All  cycle  requests  for 
new  slow  extended-page-mode  cycles, 
fast  CAS-only  extended-page-mode  cy- 
cles, and  refresh  cycles  are  arbitrated  on 
the  basis  of  inputs  MRF.Q  and  MATCH, 
and  status  feedbacks  of  CYCTLAG  and 
(RCTO.  RCTI ).  a  counter  to  keep  track  of 
number  of  refresh  requests.  A  new  cycle 
request  is  arbitrated  only  when  no  cur- 
rent cycle  request  is  active,  indicated  by 
SLOC.  FSTC.  and  RFSC  being  low. 

When  CYCFLAG  is  low.  the  DRAM 
controller  should  complete  the  existing 
extended-page-mode  cycle  (usually  by 
deactivating  both  RAS  and  CAS  signals  to 
the  memory).  As  shown  in  the  Boolean 
equations.  CYCFLAG  goes  low  if  at  least 
one  of  the  following  three  conditions  is 
encountered: 

1 .  Extended-page-mode  cycle  timeout 
(TOUT)  high. 

2.  Refresh  count  up  to  four  (RCTO  =  I 
and  RCTI  =  1 ). 

3.  New  cycle  at  a  different  row  ad- 
dress MRFQ  and  MATCH. 

This  allows  arbitration  to  begin  afresh 
for  a  new  cycle.  Because  CYCFLAG  is 
generated  for  each  new  extended-page- 
mode  cycle,  it  is  also  used  as  a  latch- 
cnablc  signal  for  the  9-bit  row-address 
latch. 

The  counter  made  by  register  RCTO 
and  RCTI  remembers  refresh  requests 
made  during  an  extended-page-mode 
cycle.  It  counts  up  by  one  upon  receiv- 
ing each  RFCLK,  to  the  maximum  of 
four:  it  counts  down  by  one  when  a 
refresh  cycle  is  executed.  An  extra  regis- 
ter. CT,  monitors  refresh-cycle  execu- 
tion by  detecting  the  rising  edge  of  the 
refresh-cycle  request  RFSC.  CT  and  RFSC 
low  indicate  the  refresh  cycle's  comple- 
tion, and  this  is  used  to  decrement 
counter  RCTO,  RCTI.  The  refresh  cycles 
are  repeated,  and  no  memory  cycles  are 
allowed,  until  this  counter  is  reset  to 
zero.  □ 
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By  using  software  to  define  programmable 
logic  dances  as  run-length-limited  encode/ 
decode  systems,  you  can  design  disk-drive 
systems  that  have  50%  more  data-storage 
capacity  than  drives  that  implement  the 
MFM  code. 


Arthur  Khu  and  Rudy  Sterner, 

Advanced  Micro  Devices  Inc 

When  you're  designing  a  disk-drive  system,  you  can 
implement  run-length-limited  (RLL)  2,7  encoding/de- 
coding circuitry  in  your  design  by  using  only  three 
programmable-logic  devices  (PLDs)  and  two  shift  reg- 
isters. You  design  the  encoder  and  decoder  as  state 
machines  and  use  the  timing  diagrams  to  determine 
what  the  timing  and  control  signals  must  be. 

To  create  a  disk  controller  with  encoding/decoding 
features,  you  can  use  three  AmPAL22V10  PLDs  and  a 
disk  controller  such  as  the  Am95807Am9582  chip  set 
(Fig  1).  The  Am9580  hard-disk  controller  and  the 
Am9582  disk-data  separator  perform  all  the  general 
disk-control  functions.  The  PLDs  have  appropriate 
architectures  for  implementing  the  encoding  and  decod- 
ing state  machines.  Further,  you  can  reprogram  the 
PLDs  to  implement  higher  density  ratios  for  data 
encoding,  and  you  can  increase  your  system's  speed 
simply  by  using  faster  PLDs. 

An  RLL  code  is  a  code  in  which  the  number  of  zeros 
between  ones — the  run  length — is  definite.  The  "2,7" 
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designation  means  that  the  code  for  the  binary  data 
string  has  at  least  two  and  at  most  seven  zeros  separat- 
ing the  ones.  RLL  codes  increase  the  density  of  data 
stored  on  a  disk  by  reducing  the  number  of  recorded 
pulses  (ones)  necessary  to  represent  a  given  amount  of 
data.  This  reduction  allows  the  disk-drive  circuitry  to 
pack  the  ones  closer  together,  increasing  the  amount  of 
data  on  the  disk. 

In  comparison  with  the  (de  facto)  industry-standard 
approach,  MFM,  the  RLL  2,7  code  increases  by  50% 
the  amount  of  data  you  can  store  on  a  disk  drive  (see 
box,  "RLL  2,7  code  vs  MFM  code").  In  addition,  RLL 


DRIVE 
CONTROL 


HARD-DISK 
CONTROLLER 
Am9580 


DRIVE 
INTERFACE 


DISK-DATA 
SEPARATOR 
CONTROL 


77 


DISK- DATA 
SEPARATOR 
Am9S82 


DATA 

RLL-PLD 
SYSTEM 

Fig  I — .4  complete  disk  controller  requires  only  two  VLSI  ICs,  a 
PLD  based  encoding/decoding  system,  and  a  drive  interface.  The 
hard-disk  controller  and  the  disk-data  separator  provide  generic 
disk-drive  control,  and  the  PLDs  provide  RLL  2,7  encoding  and 
decoding,  which  increases  disk  storage  capacity.  


TABLE  1  -  RLL  2,7  CODING  RULES 


DATA 

2.7  CODE 

10 

1000 

11 

0100 

000 

100100 

010 

001000 

011 

000100 

0010 

00001000 

0011 

00100100 

2,7  decoding  circuitry  recovers  quickly  from  code-de- 
tection errors. 

As  Table  1  shows,  RLL  2,7  encoding  circuitry  trans- 
lates seven  data  strings  into  2,7  code.  You  can  break 
any  binary  non-return-to-zero  (NRZ)  data  string  into 
combinations  of  the  seven  data  strings.  To  obtain  the 


decoded  data  string,  the  circuitry  matches  the  2,7  code 
patterns  with  the  seven  2,7  code  strings  in  Table  1. 

Because  2,7  code  strings  have  variable  lengths  (they 
can  be  2,  3,  or  4  bits  long),  your  design  will  need  control 
logic  that  controls  the  output  from  the  encoder/decoder 
as  translation  takes  place.  Encoding  and  decoding  state 
machines  (Figs  2  and  3)  implement  this  control  logic 
from  the  code  in  Table  1  (see  box,  "Convert  RLL  2,7 
code  to  a  state  machine").  The  encoding  state  machine 


KEY: 

DATA  BIT  TO 
CODE 

0*(01W-2  LOOK-AHEAD  BITS 
00.^2.7  CODE  FOR  . 
DATA  BIT  0 
AMC  — IF  ADDESS  MARK 
00_    SIGNAL  HIGH, 

^2.7  CODE  PRODUCED 


NOTE:  STATES  0  3.  4  AND  5  ARE  USED  FOR  ADDRESS-MARK  GENERATION 


TO  CODE  00101100  . 
CODE  CYCLE 


CURRENT  STATE 

0 

2 

0 

CONTENTS 

OF  4-BiT 

INPUT 

SHIFT 

REGISTER 

INREG 

BIT  TO  CODE 

BIT  TO  CODE 

BIT  TO  CODE 

SERIAL  OUT-«-|  0  |  0  |  1  |  0  [-^SERIAL  IN 

SERIAL  OUT  -#-|  0  j  1  |  0  |  1  [-*-  SERIAL  IN 

2-BIT  LOOK-AHEAD 
SHIFT  IN  A  NEW  BIT 
BEFORE  GOING  TO 
CYCLE  T, 

2-BIT  LOOK-AHEAD 
SHIFT  IN  A  NEW  BIT 
BEFORE  GOING  TO 
CYCLE  Ts 

2.  7  CODE  TO 
LOAD  INTO  OUTREG 

00 

00 

NEXT 
STATE 

2 

0 

Fig  2 — The  encoder  state  machine  (a!  describe*  tin-  translation  of  input  data  t  ula  RLL  £.7  code.  The  first  two  cycles  in  the  encoding  of  the 
data  string  00 101 100  (b)  demonstrate  how  the  encoder  determines  the  correct  code  by  examining  both  the  bit  to  be  encoded  and  the  next  two 
(look-ahead)  bits. 
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During  decoding,  circuitry  matches  the 
RLL  2,7  code  patterns  with  the  seven  RLL 
2,7  code  strings  to  obtain  the  corresponding 
decoded  data  string. 


produces  two  2,7  code  bits  for  each  data  bit  received. 
The  decoding  state  machine,  on  the  other  hand,  pro- 
duces one  data  bit  for  every  two  bits  of  2,7  code.  The 
clocking  scheme  in  these  encoding/decoding  state  ma- 
chines is  simpler  than  the  familiar  table-look-up  meth- 


od, which  requires  suspended  operation  during  encod- 
ing and  decoding. 

You  can  implement  both  of  these  encoding/decoding 
2,7  state  machines  with  one  PLD  device  (ICi)  and  two 
shift  registers  (INREG  and  OUTREG)  (Fig  4).  To 


KEY: 

2  CODE  BITS  TO  DECODE 
10J00J-2-COOE-BIT  LOOK-AHEAD 
1  —DECODED  DATA  BIT 

00—2  CODE  BITS  TO 

DECODE;  NO  NEED 
FOR  LOOK-AHEAD 
SD6C00ED  DATA  BIT 
00—2  CODE  BITS  TO  DECODE 
AMF— ACTIVATE  ADDRESS- 
MARK-FOUND  SIGN 


NOTE:  STATES  0,  3.  5.  AND  6  ARE  USED  WHEN  DETECTING  ADDRESS  MARKS 


V' 


TO  DECODE  0O1OOG10 


DECODE  CYCLE 

So 

S, 

S, 

CURRENT 
STATE 

0 

3 

2 

2  CODE  BITS  TO  DECODE 

^^o|!|,|ok-^ 

2  CODE  BITS  TO  DECODE 

Suti4LhV||I°I^?nerial 

2  CODE  BITS  TO  DECODE 

^o|o|,|oh- 

CONTENTS 

OF  4-BIT 

INPUT 

SHIFT 

REGISTER 

INREG 

2-CODE-BIT  LOOK-AHEAD 

SHIFT  IN  TWO  NEW  CODE 
BITS  BEFORE  GOING 
TO  CYCLE  S, 

2-CODE-BIT  LOOK-AHEAD 

SHIFT  IN  TWO  NEW  CODE 
BITS  BEFORE  GOING 
TO  CYCLE  S2 

DECODED  DATA 
BIT  TO  LOAO 
INTO  OUTREG 

0 

1 

NEXT 
STATE 

3 

2 

Fig  3 — This  decoder  state  machine  (a)  represents  the  translation  of RLL  2,7  code  into  an  output  data  stream.  To  determine  the  correct  output 
data,  the  decoder  examines  four  code  bits — two  bits  to  be  decoded  and  two  look-ahead  bits,  as  shown  in  the  example  in  b. 
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Because  the  RLL  2,7  code  contains  two  bits 
for  every  one  bit  of  data,  the  timing  cir- 
cuitry divides  the  clock  signal  for  the  coded 
data,  producing  a  data  clock. 


synchronize  the  encoder/decoder  with  the  hard-disk 
controller,  you  use  two  other  PLDs  (IC>  and  Id)  to 
provide  clock-generation  and  address-mark-control 
logic.  ICi  is  a  AmPAL22V10,  which  has  sufficient 
capacity  to  implement  the  two  state  machines.  IC2,  also 
an  AmPAL22V10,  utilizes  the  PLD's  large  capacity  and 
programmable  output  cells  to  implement  the  address- 
mark-control  circuitry.  An  AmPAL16HD8  (IC:I),  which 
is  sufficient  for  implementing  the  clock  circuitry  and  the 
random  logic,  completes  the  design. 

To  understand  the  state-machine  implementation  of 
the  RLL  2,7  encoder,  consider  the  state  machine  in  Fig 


2.  The  encoding  state  machine  begins  in  state  zero;  the 
first  four  bits  of  the  data  to  be  encoded  are  in  the  shift 
register  INREG.  For  the  data  stream  in  the  figure,  the 
encoder,  beginning  in  the  first  cycle  (T„)  reads  the  first 
bit  in  the  stream  as  0  and  sees  that  the  next  two  bits 
(the  look-ahead  bits)  in  INREG  are  0  and  1,  respective- 
ly. According  to  the  state  diagram,  the  encoder  pro- 
duces a  00  output  because,  as  Table  1  shows,  input  data 
starting  with  001  translates  to  a  character  string  that 
starts  with  00.  The  encoder  then  enters  state  2,  shifts 
the  encoded  0  out  of  INREG,  and  shifts  in  the  next 
(fifth)  bit  of  the  data  to  be  encoded. 


RLL  2,7  code  vs  MFM  code 

Although  the  RLL  2,7  and  MFM 
coding  methods  can  both  in- 
crease a  disk  drive's  capacity, 
RLL  2,7  code  is  more  compact. 
A  disk  drive  that  implements 
the  RLL  2,7  code  can,  therefore, 
store  50%  more  data  than  can  a 
drive  that  implements  the  MFM 
code. 

On  the  magnetic  medium  in 
the  disk  drive  (hard-disk  or  flop- 
py-disk drives),  binary  data  ap- 
pears as  a  change  in  flux  (repre- 
senting a  one)  or  as  no  change  in 
flux  (representing  a  zero).  Be- 
cause the  disk  density  is  limited 
by  the  minimum  distance  be- 
tween flux  transitions,  the  maxi- 
mum data  density  depends  on 
how  the  data  is  encoded. 

MFM  is  an  RLL  code  with  the 
designation  1,3;1,2;1.  RLL  2,7 
code  (its  full  designation  is 
2.7;1,2;3)  allows  a  minimum  of 
two  zeros  between  each  one,  and 
MFM  code  allows  a  minimum  of 
one  zero.  RLL  2,7  code  can 
store  as  much  as  50%  more  data 
in  a  given  number  of  flux 
changes  than  can  MFM  code; 
therefore,  RLL  2,7  code  can 
store  as  much  as  50%  more  data 
on  a  given  section  of  magnetic 


DATA  H0011001 


I      I      I      I      I      I      I      I      I       I      I      I      I      I      I      I      I      I      I      I  I 


C  CLOCK  PULSE 

OATA  I  1 
CODE 


MINIMUM  FLUX- 
TRANSITION  SPACING 

II    0OI1  0011 
0100  00100100  00100100 


Fig  A — Data  storage  that's  50%  more  dense  is  the  principal  advantage  of  RLL  2,7  code 
over  the  de  facto  standard  MFM  code.  Because  it  needs  fewer  transitions  to  describe 
data,  the  RLL  2.7  cade  takes  up  only  67%  of  the  disk  storage  space  that  MFM  code 
occupies. 


medium. 

Consider  the  example  in  Fig 
A.  The  data  stream  1100110011 
is  represented  by  eight  transi- 
tions in  MFM  code,  and  by  five 
transitions  in  RLL  2,7  code. 
When  the  disk  drive  uses  the 
minimum  distance  between  tran- 
sitions to  record  the  RLL  2,7 


code,  the  RLL  2,7  code  takes 
67%  of  the  space  that  the  MFM 
code  takes,  so  it  has  space  avail- 
able to  store  50%  more  data.  In 
most  applications,  the  actual 
storage  increase  is  between  35% 
and  40%,  because  some  disk 
space  is  reserved  for  sector  and 
data-field  markers. 
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In  the  second  cycle  (T|)  of  the  encoding  process,  the 
encoder  sees  that  the  data  bit  is  0  and  the  two  look- 
ahead  bits  are  1  and  0,  respectively.  According  to  the 
state  diagram,  when  the  encoder  is  in  state  2  and  sees 
010,  its  output  is  00.  As  before,  the  encoder  then  moves 
to  the  next  state  (in  this  case,  state  0)  and  shifts  a  new 
bit  into  INREG.  The  rest  of  the  encoding  process 
proceeds  similarly. 

The  decoding  process  is  similar  to  the  encoding 
process.  The  decoder,  which  is  described  as  a  state 
machine  (Fig  3),  accepts  two  input  bits  in  RLL  2,7 
format  and  sees  the  next  two  coded  bits  as  look-ahead 
bits.  In  contrast  to  the  encoder,  the  decoder  produces 
one  output  bit  for  every  two  input  bits. 

To  implement  this  encoder  and  decoder  circuitry  with 


PLDs,  you  can  use  PLD-design  tools  such  as  CUPL 
from  Personal  CAD  Systems  Inc  (San  Jose,  CA)  and 
Abel  from  Data  I/O  Corp  (Redmond,  WA).  These 
software  tools  include  syntaxes  that  you  can  use  to 
describe  state  machines  as  well  as  general  logic  equa- 
tions. 

To  control  the  rate  at  which  ICi  (in  Fig  4)  receives 
data  and  code,  IC,.  and  ICi  implement  the  timing  signals 
shown  in  Fig  5.  Three  signals  (Read,  Write,  and 
Code_Clock)  from  the  hard-disk  controller  and  disk- 
data  separator  form  the  basis  of  the  timing  signals. 

First,  the  timing  circuitry  produces  a  clock  signal, 
Rd_Clk,  from  the  Code_Clock  signal  that  originates  at 
the  disk-data  separator's  FDDAM  output.  Because  the 
RLL  2,7  code  contains  two  bits  for  every  one  bit  of 
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Fig  4 — The  encoding/decoding  circuitry  includes  three  PLDs.  IC,  implements  thv  encoding  und  decoding  state  machines.  /(*,.  monitors  l(\ 
and  provides  special  timing  and  input  signals.  IC.,  implements  glue  logic  ami  timing  signals. 


Before  the  disk-drive  system  can  decode  data 
on  the  disk,  it  must  synchronize  itself  with 
the  disk's  data  clock  to  find  out  when  the 
data  bits  begin. 


data,  the  timing  circuitry  divides  the  clock  signal  for 
the  coded  data  (Code_Clock),  producing  a  data  clock 
signal,  Rd_Clk.  The  timing  circuitry  then  uses  Code- 
Clock  and  RcLClk  to  control  the  timing  of  shift- 
register  control  signals  Shift—Out  and  Shift— In. 

To  obtain  the  equations  you  need  to  program  the 
PLDs,  examine  the  timing  diagrams.  The  timing  dia- 
grams show  that  the  state-machine  design  requires 
only  a  few  timing  signals  to  implement  the  controller. 


Because  the  timing  circuitry  loads  and  shifts  data 
produced  by  ICi  at  the  points  indicated  on  the  timing 
diagram  in  Fig  5,  the  only  clock  signal  that  IC,  requires 
to  code  or  decode  data  in  INREG  is  the  Rd_Clk  signal. 

To  control  the  transfer  of  data  and  code,  the  Shift-In 
signal  latches  data  or  code  into  INREG,  and  the 
Shift-Out  signal  controls  the  output  of  OU.TREG.  For 
example,  when  encoding  data,  the  encoder  produces 
two  bits  of  code  on  each  rising  edge  of  Rd_Clk.  Because 


Convert  RLL  2,7  code  to  a  state  machine 


You  can  use  a  simple  algorithmic 
procedure  to  convert  a  code 
table  to  an  encoding  state  ma- 
chine. For  each  row  of  the  code 
table,  you  create  a  simple  2- 
state  expression  for  the  conver- 
sion of  the  data  into  the  code. 
Then  you  combine  the  expres- 
sions into  one  state  machine  for 
the  entire  table. 

For  the  code  in  Table  1  of  the 
accompanying  article,  you  con- 
sider each  data  bit  and  the  asso- 
ciated pair  of  bits  of  RLL  2,7 
code.  For  example,  the  first  row 
of  the  table  contains  two  data 
bits:  one,  which  is  associated 
with  the  code  10,  and  zero, 
which  is  associated  with  the 
code  00.  The  state  machine  for 
this  row  begins  in  state  zero 
(which  is  arbitrarily  assigned) 
and  changes  state  when  it  sees 
that  the  first  data  bit  is  a  one 
and  the  next  bit  (the  look-ahead 
bit)  is  a  zero.  This  change  of 
state,  which  appears  graphically 
in  Fig  Aa,  results  in  the  out- 
put 10. 

Note  that  the  state  machine 
must  evaluate  the  look-ahead 
bit.  If  this  bit  is  a  one  instead  of 
a  zero,  the  input  data  will  be  11, 
corresponding  to  the  second  row 
of  the  table,  so  your  state  ma- 
chine must  generate  a  01.  For 


RLL  2.7  CODE 


(a)  STATE  TRANSITION  FOR 
2  7  DATA  ROW  10 


(c)  ENCODE  DATA  ROW  1 1  I 
USING  THE  SAME  PROCEDURE  '. 


(b)  DATA  ROW  10  IS 
COMPLETELY  CODED 
BY  THIS  STATE 
MACHINE 


<d)  COMBINE  THE  STATE 
MACHINES  FOR  DATA 
ROWS  10  AND  11 


Fig  A— Creating  a  state  machine  from  a  code  table  is  a  2-part  process.  Firsl,  you 
create  a  simple  state  machine  for  each  row  in  the  table,  and  then  you  combine  the  state 
machines  into  one  state  machine  for  the  whole  table.  


this  code  table,  no  more  than 
two  look-ahead  bits  are  neces- 
sary for  encoding  any  data  bit. 

If  the  state  machine  is  in  state 
one,  and  the  input  data  is  zero, 
the  state  machine  produces  a  00 
output  and  returns  to  state  zero 
(Fig  Ab.)  Note  that  the  data  bit 
encoded  in  this  state  is  the  look- 
ahead  bit  from  the  previous 
state  and  that  no  look-ahead  bit 
is  necessary  for  encoding  be- 
cause the  zero  is  the  last  bit  in 
data  row  10. 


You  use  this  procedure  to  cre- 
ate state  machines  for  the  other 
table  rows  as  well.  Fig  Ac,  for 
example,  contains  the  state  ma- 
chine for  data  row  11.  Because 
all  the  state  machines  have  the 
same  beginning  state  (state 
zero),  you  can  combine  them  to 
form  the  complete  encoding 
state  machine.  Furthermore,  the 
state  machines  may  share  other 
states,  as  shown  in  the  combina- 
tion of  the  two  data  i 
(Fig  Ad). 
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the  Load  signal  is  high  at  the  same  time  that  Rd_Clk  is 
high,  the  bits  are  loaded  into  the  shift  register.  Because 
the  frequency  of  Shift-Out  is  twice  that  of  Rd_Clk, 
Shift-Out  shifts  bith  encoded  bits  out  before  the  next 
encoded  bits  are  loaded.  Shift_In  presents  the  next 
data  bit  to  the  encoder  at  the  same  time  that  the  second 
encoded  bit  is  shifted  out,  starting  the  next  encode 
cycle. 


Before  the  disk-drive  system  can  decode  data  on  the 
disk,  it  must  synchronize  itself  with  the  data  clock  from 
the  disk  and  find  out  when  the  data  bits  begin.  To  assist 
the  circuitry  in  synchronization  and  initialization,  you 
can  place  synchronization  signals,  as  well  as  a  marker 
indicating  the  beginning  of  data,  at  the  beginning  of  the 
RLL  code.  When  the  RLL  circuitry  reads  these  pat- 
terns, it's  ready  to  decode  RLL  data. 
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ON  Am9582  DISK-DATA  SEPARATOR 

READ  WRITE    READ  WRITE  MODE  SIGNALS  FROM  Am9580 
HARD-DISK  CONTROLLER 

RD-CLK   DATA-RATE  CLOCK  DERIVED  FROM  CODE. CLOCK 

SHIFT. IN   COMBINATORIAL  SIGNAL  FROM  ICi 

SHIFT  OUT  LOAD   COMBINATORIAL  SIGNALS  FROM  ICi 


Fig 

that 


S — The  timing  signals  from  ICi  and  Id  control  the  loading  and  shifting  of  code  and  data  in  the  IN REG  and  OCT  REG  rrgwtl'rx,  Xotc 
the  c/ocfr  and  shifting  signals  for  the  coded  hits  have  tutee  the  frequency  of  those  for  the  data  hits,  a  situation  that  corresponds  to  the 


density  ratio  between  code  and  data. 
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If  the  decoder  detects  any  pattern  before  it 
detects  the  address  marker,  the  circuit  resets 
itself  and  begins  the  initialization  sequence 
anew. 
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Fig  6 — The  ability  to  recover  from  an  error  is  a  useful  feature  of 
RLL  1,?  code.  An  error  at  cycle  E,  results  in  incorrectly  translated 
data  until  the  machine  recovers  fully. 


The  synchronization  field  comprises  a  series  of  pat- 
terns that  allow  the  phase-locked  loop  in  the  disk-data 
separator  to  synchronize  to  the  frequency  of  the  incom- 
ing data.  These  patterns  represent  the  maximum  fre- 
quency input.  For  RLL  2,7  code,  that  input  is  100100, 
because  this  code  has  the  fewest  permissible  zeros 
between  ones.  Using  the  highest  possible  frequencies 
minimizes  the  synchronization  time,  so  the  patterns  are 
completed  quickly. 

A  marker  that  follows  the  synchronization  field  indi- 
cates the  beginning  of  data.  This  marker,  the  address 
mark,  must  be  distinct  from  all  other  code  words.  For 
example,  you  can  use  a  pattern  that  violates  the  code 
rules,  such  as  a  string  of  zeros  (this  string  is  called  "dc 
erase"  because  it  removes  all  flux  transitions  from  the 
recording  medium).  Alternatively,  you  can  use  a  pat- 
tern that  obeys  the  code  rules  but  is  not  a  defined  code 
pattern.  In  the  examples  in  this  article,  the  pattern 
00000100  identifies  the  beginning  of  data.  Although  this 
pattern  is  not  a  defined  code  word,  it  doesn't  violate  the 
RLL  2,7  code  rules  as  long  as  it's  preceded  by  no  more 
than  two  zeros  in  a  row. 

When  the  100100  pattern  is  synchronizing  the  circuit- 
ry, the  circuitry  produces  the  Rd_Clk  signal,  which  is 
in  phase  with  the  pattern.  In  order  for  the  decoder  to 
operate  properly,  the  Rd_Clk  signal  must  rise  with  the 
first  one  in  the  pattern  and  fall  with  the  second  one. 
However,  because  of  the  repetitive  nature  of  the  pat- 
tern, the  circuitry  could  produce  a  falling  edge  of 
Rd_Clk  on  the  first  one  in  the  pattern  and  a  rising  edge 
on  the  second  one,  in  which  case  the  circuitry  would  not 


be  in  synchronization  with  the  beginning  of  the  data. 

The  encoding  circuitry  resolves  this  synchronization 
problem  by  inserting  the  pattern  "0100"  eight  times 
between  the  synchronization  field  and  the  address 
mark.  The  phase-locked-loop  system  locks  onto  the 
0100  pattern,  and  the  Rd_Clk  control  circuitry  in  ICz 
sets  itself  to  decode  the  data  correctly. 

To  put  the  synchronization  and  marker  data  into  the 
RLL  2,7  code  on  the  disk,  you  must  design  the  RLL  2,7 
encoder  to  produce  the  signals.  When  the  signal  WG 
(from  the  hard-disk  controller  IC3)  goes  high,  the  data  is 
encoded  to  all  zeros.  The  encoder  translates  the  zeros 
as  the  synchronization  pattern  100100,  which  is  then 
stored  on  disk. 

Next,  the  address-mark-control  (AMC)  signal  from 
the  hard-disk  controller  sets  an  internal  latch  in  IC2, 
producing  the  output  Am_Latch,  which  forces  IC3  to 
put  ones  in  the  INREG  register.  The  encoding  circuitry 
codes  the  ones  as  the  string  0100.  After  the  eighth  0100 
pattern,  IC2  sets  the  Complt  signal  high.  On  sensing 
Complt,  IC,  writes  the  address  mark. 

The  encoding  circuitry  must  have  the  first  four  data 
bits  in  INREG  by  the  time  the  address  mark  is  written. 
ICi  writes  two  patterns  for  the  address  mark.  At  the 
start  of  the  second  address-mark  pattern,  ICi  sets  AMF 
(Address  Mark  Found)  High  and  resets  Am_Latch  low. 
The  assertion  of  AMF  signals  the  hard-disk  controller 
to  begin  shifting  data  into  INREG  for  encoding.  Be- 
cause four  Rd_Clk  cycles  occur  while  the  address-mark 
pattern  is  being  written,  the  first  four  bits  are  shifted 
into  INREG  by  the  time  the  second  address  mark  is 
written. 

The  decoder  circuit  has  two  safeguards  that  prevent 
it  from  identifying  the  address-mark  pattern  incorrect- 
ly. First,  the  decoder  must  detect  at  least  five  0100 
patterns  before  it  acknowledges  the  address-mark  pat- 
terns. Second,  if  it  detects  any  pattern  before  it  detects 
the  address  mark,  the  circuit  resets  itself  and  begins 
the  initialization  sequence. 

IC2  implements  these  two  safeguards  by  monitoring 
the  RLL  code  that  ICi  decodes.  When  the  hard-disk 
controller  asserts  the  RG  and  AMC  signals,  IC2  sets 
Complt  and  Am_Latch  low.  ICZ  sets  Complt  high  when 
it  detects  the  fifth  consecutive  0100  pattern,  enabling 
ICi  to  detect  the  address  mark.  If  the  input  to  ICi  is 
anything  other  than  a  0100  pattern  or  an  address  mark, 
IC2  sets  Complt  low,  and  the  initialization  begins  anew. 

If  ICi  successfully  detects  the  address-mark  pattern, 
it  sets  AMF  high.  The  assertion  of  AMF  causes  IC2  to 
set  Am_Latch  high,  thus  enabling  the  output  of  OUT- 
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REG  to  send  decoded  data  to  the  hard-disk  controller. 
Am_Latch  remains  high  as  long  as  RG  is  high. 

RLL  2,7  encoding  also  provides  for  error  recovery. 
Whenever  a  disk-drive  system  reads  code  from  a  disk, 
the  read/write  heads  or  the  transmission  cables  can 
cause  transmission  errors.  One  of  the  properties  of 
RLL  2,7  code  is  that  any  single-bit  error  (for  example,  a 
coded  one  detected  as  a  zero)  will  correct  itself  after  a 
run  of  at  most  16  correctly  detected  bits. 

In  short,  whenever  a  2-bit  pattern  doesn't  match  any 
of  the  expected  patterns  for  a  particular  state  of  the 
decoding  circuitry,  the  decoding  state  machine  returns 
to  state  zero  and  generates  a  zero  as  a  translation  for 
the  erroneous  code  bits.  Then  the  decoder  shifts  the 
next  two  code  bits  into  the  INREG  register  and  contin- 
ues decoding  from  state  zero. 

In  such  cases,  the  decoding  state  machine  can  cor- 
rectly decode  coded  data,  but  it  may  not  recover 
immediately  upon  returning  to  state  zero.  As  Fig  6 
shows,  although  the  code  bits  that  the  decoder  detects 
may  be  valid,  the  decoded  data  is  incorrect  because  the 
error  has  forced  that  state  machine  into  the  wrong 
state.  In  this  example,  the  decoder  doesn't  recover  until 
6  code  bits  later  (in  cycle  E6),  when  it  again  falls  into 
the  correct  state  and  accurately  decodes  the  data. 
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Chip  designers  are  creating  powerful  CPUs  and  peripherals 
with  16-  and  32-bit  parts.  Mixing  these  with  8-bit  parts 
overcomes  limitations  imposed  by  established  designs, 
incomplete  families,  and  software  incompatibility. 


by  Mark  S.  Young  and 
James  R.  Williamson 


Integrating  16-  and  .12-bit  peripherals  and  CPUs 
into  8-bit  designs,  at  the  simplest  level,  means 
separating  the  control  and  data  paths  from  new 
peripherals  and  the  systems.  Mixing  different  data 
path  widths  and  control  protocols,  however,  makes 
possible  major  improvements  in  function,  perfor- 
mance, and  cost. 

The  price/performance  curve  of  VLSI  chips,  for 
example,  allows  designers  to  obtain  more  and  bet- 
ter functions  for  the  same  amount  of  money  every 
year.  Alternately,  the  functionality  of  a  device  can 
remain  constant  while  the  price  falls. 

Moreover,  these  new  devices  with  wider  data  paths 
can  extend  the  life  of  older  designs.  For  example, 
many  of  the  most  popular  personal  computers  today 
use  the  8088  microprocessor  and,  therefore,  are  con- 
strained to  an  8-bit  data  path.  Designers  of  add-on 
accessories  for  these  personal  computers  prefer  the 
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newer  16-bil  peripherals.  These  peripherals  will  let 
users  preserve  their  software  investments,  improve 
performance,  and  stave  off  obsolescence. 

Mixing  different  data  path  widths  can  also  enhance 
new  designs.  For  example,  it  is  less  expensive  to  use 
an  8-bit  bus  in  a  new  design  because  the  memory 
requirements  are  generally  cheaper.  Only  half  as 
many  dynamic  RAMs  are  necessary  for  the  same 
number  of  kilobytes  of  memory.  In  addition,  an 
8-bit  bus  needs  much  less  control  and  support  logic. 
Designers  can  mix  smaller  data  path  peripherals  with 
wider  data  path  CPUs.  This  allows  them  to  introduce 
systems  based  on  the  newer,  more  powerful  32-bit 
CPUs  even  before  32-bit  peripherals  are  available. 

Designers  can  use  this  mixing  method  to  obtain 
wider  data  paths  from  existing  designs  until  a  new 
system  design  is  warranted.  They  can  also  use  parts 
in  unexpected  applications.  For  example,  cost- 
conscious  terminal  manufacturers  might  want  to  use 
the  Am8052/8l52A  chip  set  (the  8052  is  an  advanced 
CRT  controller  and  the  8152A  is  a  video  system  con- 
troller) in  new  terminals  based  on  the  relatively 
inexpensive  805 1  microprocessor.  Mixing  the  8-bit, 
single-chip  microprocessor  with  the  1 6-bit  CRT  con- 
troller allows  designers  to  maximize  the  cost/perfor- 
mance ratio  of  the  terminal. 

Mixed  data  path  widths  can  improve  bus  utiliza- 
tion as  well.  A  16-bit  peripheral  in  a  32-bit  system 
only  occupies  half  the  data  bus  for  data  transfers. 
If  the  designer  mixes  the  data  paths  correctly,  how- 
ever, the  16-bit  peripheral  could  transfer  data  as 
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WAIT  UNTIL  PEPiPHERAL  TAKES  BUS 
MAKE  SURE  MEMORY  ACKNOWLEDGE  IS 
NOT  I 


WATT  PC*  MEMORY  ACKNOWLEDGE 


WATT  FOR  MEMORY  ACKNOWLEDGE 
TO  GO  AWAY 


STROBE  IN  DATA  TO  PERIPHERAL 

OEASSERT  WAfT 

WATT  FOR  SUCESSFUL  READ  (DS> 


The  Male  flow  control  diagram  lor  u  loc  master  rear] 
operation  illustrate*  the  tnnirol  sequence  employed 
h>  Ihe  H  16-hil  bus  control  logic. 


ji2-bit  chunks  and  improve  bus  efficiency  by  100  per- 
cent for  that  peripheral. 

Two  central  concerns  stem  from  mixing  devices 
jhat  communicate  over  different-sized  buses.  The 
irst  problem  results  when  two  devices  communicate 
n  a  "common"  data  bus.  Consider,  for  example, 
a  32-bit  system  utilizing  8-  and  16-bit  peripherals, 
ivercoming  the  mismatched  data  paths  requires 
me  form  of  controlled  multiplexing/demultiplexing 
f  the  different  data  paths.  In  addition,  extra  con- 
ol  signals  for  partitioning  the  32-bit  word  into  8-, 
6-,  and  32-bit  chunks  may  be  required. 
Many  16-bit  CPU-based  systems  that  use  8-bit 
eripherals  normally  use  just  the  lower  8  bits  of  the 
ata  bus  to  transfer  data  to  and  from  the  peripheral, 
his  method  does  not  work  in  systems  using  16-bit 
peripherals  and  8-bit  CPUs,  however,  and  it  tends 
to  break  down  in  systems  with  8-bit  peripherals  hav- 
ing bus  master  capability. 

A  bus  multiplexing  method  involves  multiple 
transfers  when  taking  data  from  or  adding  data  to 
a  mismatched  data  bus.  For  example,  before  a  16-bit 
peripheral  can  transfer  data  over  an  8-bit  bus,  the 
16-bit  data  must  be  divided  into  two  8-bit  chunks. 
It  is  then  transferred  sequentially.  First,  the  lower 
8  bits  are  transferred  out  on  the  bus.  Then,  in  the 
next  transfer  cycle,  the  upper  8  bits  of  the  16-bit 
word  are  sent  out.  The  major  difference  in  the  oppo- 
site case — a  bus  read  operation  from  an  8-bit  bus 
to  a  16-bit  device — is  that  the  first  byte  read  from 
the  system  must  be  latched.  Once  the  second  byte 
has  been  fetched,  the  16-bit  peripheral  reads  in  the 
assembled  16-bit  (2-byte)  word.  Additional  provi- 
sions may  be  needed  when  the  16-bit  peripheral  only 
wants  to  access  a  single  byte. 


The  other  major  problem  in  mixed  data  path 
transfers  is  the  actual  data  read/write  operation.  The 
nature  of  the  multiple  transfer  forces  designers  to 
guarantee  that  the  stretched  transfer  will  occur  and 
that  it  will  not  be  interrupted.  Two  aspects  of  stretch- 
ing the  transfer  cycle  from  or  to  the  peripheral  illus- 
trate the  complexity  of  this  problem. 

The  first  case,  when  the  peripheral  is  the  bus 
master,  is  the  simplest.  A  16-bit  peripheral  holds  its 
data  available  for  what  normally  would  be  two  com- 
plete bus  transfer  cycles.  This  function  can  be  per- 
formed when  the  transfer  acknowledge  signal  to  the 
peripheral  is  delayed.  If  the  data  was  latched  instead 
of  holding  the  peripheral  in  a  multiple  word  transfer, 
however,  the  device  could  try  to  send  the  next  16-bit 
data  word  and  its  "new"  address.  The  procedure 
of  latching  the  data  and  releasing  the  peripheral 
should  not  be  used,  therefore,  because  it  may  inter- 
fere with  the  addressing  of  the  remaining  (pending) 
8-bit  transfer. 

Whenever  a  device  acts  as  a  bus  slave  to  a  CPU 
that  cannot  access  the  device's  natural  word  width 
in  a  single  operation,  a  different  constraint  appears. 
The  sequence  must  be  set  up  so  the  peripheral  cannot 
obtain  the  bus  while  the  CPU  is  in  the  middle  of  a 
slave  read/write  operation.  In  a  typical  system,  the 
CPU  is  the  last  device  in  the  interrupt  queue.  It  is 
possible  for  the  peripheral  to  become  bus  master 
between  the  first  and  second  read  operations  and  in- 
validate the  results  of  the  first  read  operation  in  a 
realtime  system.  This  is  because  an  8-bit  CPU  would 
have  to  perform  two  consecutive  read  operations  to 
examine  a  16-bit  peripheral  control  register. 

This  function  can  be  handled  two  different  ways. 
If  the  CPU  has  a  bus  lock  instruction,  as  in  the  iAPX 
family  of  CPUs,  the  programmer  must  use  one  of 
these  instructions  before  the  CPU  accesses  the 
peripheral.  Alternately,  the  CPU  needs  to  disable 
the  arbitration  logic  while  it  is  performing  the  unin- 
terruptible access  with  the  16-bit  peripheral. 

Crucial  cycle 

The  uninterruptible  word  transfer  cycle  is  crucial 
for  maintaining  the  integrity  ol  the  data  transferred. 
When  either  the  CPU  or  a  peripheral  on  the  bus 
makes  an  access  using  the  8/16-bit  control  logic,  it 
must  complete  the  larger  device's  word  access  before 
relinquishing  the  bus.  If  this  requirement  is  not  met, 
a  transfer's  integrity  can  be  violated  easily  by  some 
other  device.  This  interrupts  the  transfer,  and  cor- 
rupts or  aborts  the  multiplexing  sequence. 

To  illustrate  this  point,  consider  a  system  consist- 
ing of  an  8-bit  CPU  and  several  8-  and  1 6-bit  periph- 
erals. Assume  one  of  the  peripherals  is  executing  a 
block  transfer  of  16-bit  data  onto  the  8-bit  bus.  If 
the  CPU  interrupted  the  transfer  in  order  to  poll  the 
peripheral  during  a  half-word  transfer,  two  undesir- 
able events  would  occur.  Either  the  multiplexing 
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sequence  would  be  damaged  irreparably  when  the 
CPU  polled  the  peripheral,  or  the  CPU  would  read 
garbage  from  the  peripheral. 

Designing  the  control  interface  to  allow  mixing 
of  8-  and  16-bit  peripherals  requires  attention  to  the 
data  and  control  flow.  During  a  write  dperation,  the 
data  is  written  out  sequentially:  the  lower  byte  comes 
before  the  upper  byte  (or  vice  versa).  The  read  oper- 
ation differs  only  because  the  data  bus  is  8  bits  and 
because  it  forgets  the  last  byte  transferred;  it  knows 
the  current  byte  only.  Hence,  the  interface  requires 
that  one  of  the  bytes  be  latched  until  the  full  16-bit 
word  has  been  assembled. 

The  slave  mode  of  operation  works  almost  the 
same  as  the  peripheral  bus  master  mode.  The  single 
exception  is  the  slave  write  operation.  When  the 
interface  is  defined,  the  designer  must  make  a  con- 
scious choice  about  which  byte  (upper  or  lower)  to 
latch  during  peripheral  read  operations  (or  con- 
\  ersely,  slave  peripheral  write  operations).  Once  this 
decision  has  been  made,  the  CPU  must  always  access 
the  latched  data  byte  first  (during  a  slave  write)  and 
then  access  the  non-latched  byte  to  complete  the 
transfer.  This  restriction  is  minor,  requiring  no  extra 
software  overhead.  It  could  affect  the  ease  of  the 
programmer's  coding  if  not  handled  properly,  how- 
ever. For  example,  if  the  programmer  used  a  com- 
piler to  generate  the  software  for  the  system,  extra 
care  may  be  necessary  to  ensure  the  compiler  gener- 
ates the  correct  addressing  sequence. 

An  alternative  solution  would  be  to  latch  both  the 
upper  and  lower  data  bytes.  In  this  case,  however, 
the  cost  of  the  interface  would  increase,  as  would 
the  complexity,  with  no  appreciable  gain.  The  con- 
trol flow  in  these  designs  derives  from  two  differ- 


ent sources:  the  state  control  flow  itself  and  the  16-bit 
peripheral  interfacing  with  the  8-bit  bus.  A  state  dia- 
gram can  be  used  to  specify  how  uninterrupted  word 
transfers  will  occur  and  how  the  upper  and  lower 
byte  address  is  generated. 

In  addition,  the  specific  bus  timing  of  the  periph- 
eral and  the  data  bus  must  be  examined  to  quantify 
the  state  control  flow.  These  timing  specifics  also 
provide  information  on  data  latching,  read/write 
control  strobes,  and  addressing  to  and  from  the 
peripheral.  The  state  control  flow  is  divided  into  four 
operations:  bus  master  read,  bus  master  write,  slave 
read,  and  slave  write. 

For  a  bus  master  read/write  operation  from  a 
16-bit  peripheral  device  operating  on  an  8-bit  bus, 
four  control  signals  must  be  generated  by  the 
8/16-bit  control  unit:  address  bit  0  (AO),  peripheral 
hold  (WAIT),  bus  read  (RD),  and  bus  write  (WR). 
The  AO  line  is  generated  by  the  8/16-bil  control  logic 
to  indicate  which  byte  is  to  be  transferred  in  bus 
master  modes  only.  Otherwise,  the  AO  generated  by 
the  system  is  used  to  indicate  which  byte  is  being 
accessed.  The  WAIT  line  holds  up  the  peripheral 
during  transfers.  The  RD  and  WR  lines  are  required 
to  indicate  successive  transfer  cycles  on  the  bus. 

Hidden  transfers 

The  peripheral's  signals  will  only  strobe  active 
once  because  it  does  not  know  that  two  transfers  are 
being  executed.  The  slave  transfer  flows  are  almost 
identical,  except  the  CPU  is  generating  the  bus  sig- 
nals and  the  transfer  directions  are  reversed  (ie,  a 
bus  write  goes  into  the  peripheral). 

For  this  16-  to  8-bit  data  flow  example,  the  data 
on  the  upper  byte  only  needs  to  be  latched  when  data 
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In  addition  to  a  slale  flow 
diagram,  a  liming  diagram 
can  be  used  10  describe  such 
data  read/wrile  operations  as 
a  master  bus  read. 
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The  16-to  32-bit  conversation  logic  diagram  indicates 
the  complexity  or  the  bus  and  tunnel  logic  control,  it 
must  convert  between  different  signal  conventions 
and  polarities  as  well  as  generate  extra  functions  and 
bus  arbitration  control  signals. 


is  being  read  (as  bus  master)  or  written  (as  a  bus 
slave).  An  interface  to  handle  this  operation  needs 
to  latch  data  coming  from  the  8-bit  data  bus  into 
the  peripheral,  it  also  needs  to  act  as  transceiver 
when  the  peripheral  is  sending  data  out  to  the  system. 
A  device  wilh  a  clocked,  tri-state  output  that  has  an 
X-bit  wide  latch  in  one  direction  and  a  tri-statc  trans- 
ceiver in  the  other  direction  would  be  ideal  lor 
accomplishing  such  an  interlace. 

The  Am2952  8-bit  bidirectional  I/O  port  provides 
a  good  enough  match  to  the  logic  and  allows  the 
upper  data  bus  latch  and  upper  data  transceiver  chips 
to  be  combined  on  one  IC.  It  provides  two  8-bit 
clocked  I/O  ports,  each  with  tri-statc  output  con- 
trols and  individual  clocks  and  latch  enables.  An 
Am2y49  bidirectional  bus  transceiver  completes  the 
.logic  required  lor  the  data  path  function. 

The  stale  How  control  requires  logic  that  can  move 
sequentially  from  state  to  state,  hold  in  a  particular 
state,  and  be  reset  or  initialized  back  to  a  predefined 
state.  Depending  on  the  number  of  states  required 
(generally  less  than  16  distinct  states  for  a  design  of 
this  complexity),  a  3-  or  4-bit  counter  should  be  able 
to  solve  the  problem  nicely. 

C  onsiderable  bus  control  logic  is  required  to  gen- 
erate the  data  path  flow  logic  and  the  bus  control 
signals.  This  is  especially  true  if  the  peripherals  and 
CPUs  use  different  signal  conventions  (eg,  when  AS, 
DS,  and  R/W  use  address  latch  enable,  RD,  and 
WR),  Conversion  from  one  signal  convention  to 


another,  changes  in  signal  polarity,  and  provision 
for  extra  functions  (such  as  generating  AO)  require 
a  lot  of  logic  synthesis  ability.  I  f  the  peripheral  has 
bus  master  capability,  such  additional  information 
as  bus  arbitration  controls  must  be  led  into  the  next 
state  determination  logic  in  order  to  decide  what  con- 
trol sequence  to  follow. 

Customized  interface  minimizes  cost 

An  8/16-bil  control  interface  between  the  AmX052 
CRT  controller  and  an  X-bit  CPU  provides  a  good 
example  of  how  customizing  a  general  interface  can 
reduce  costs.  (  The  CRT  controller  is  designed  with 
a  16-bil  data  interface.)  The  onboard  DMA  unit 
fetches  data  from  system  memory  and  the  CPU  polls 
the  CRT  controller's  internal  status  and  control 
registers.  Because  the  CRT  controller  does  not 
modify  system  memory,  however,  a  bus  master  write 
operation  is  unnecessary.  Thus,  there  is  no  reason 
to  generate  a  system  write  control  signal  (WR). 

In  addition,  the  control  and  display  information 
must  be  aligned  on  word  boundaries.  This  require- 
ment relieves  the  X/ 16-bit  control  logic  from  tunneling 
the  bytes  and  performing  odd/even  byte  transfers. 
It  also  saves  control  inputs  from  the  CRT  controller 
because  all  transfers  are  words;  that  is,  no  need 
exists  for  upper  and  lower  data  strobes  or  byte  high 
enable  inputs. 

The  bus  master  read  operations  are  standard  16-bit 
data  transfers  divided  into  two  X-bit  transfers.  The 
CPU's  slave  accesses  are  cither  pointer  writes  (to 
select  the  desired  control/status  register)  or  16-bit 
data  read/write  operations.  (Pointer  write  operations 


I  he  slate  mat-hint-  and  the  bus  and  larch  controls 
hase  hi  be  coupled  in  order  in  transfer  data  between 
the  R-  and  16-bil  buses.  This  generalized  machine  is 
designed  with  the  assumption  that  the  peripheral  has 
bus  master  capability.  If  ibis  is  not  the  case,  the 
design  can  he  greatlt  simplified. 
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arc  actually  8-bit  operations  because  only  the  lower 
8  bits  of  the  data  form  the  register  address.)  The  bus 
master  read  operation  can  be  represented  by  a  state 
flow  diagram  or  a  timing  diagram.  Conceptually, 
state  How  diagrams  arc  easier  to  understand,  but  tim- 
ing diagrams  usually  convey  more  information. 
Other  state  flow  diagrams  can  be  derived  directly 
from  the  liming  diagrams  of  the  CRT  controller  to 
8-bii  interface. 

Simplifications  allow  synthesis  on  one  device 

Two  special  conditions  must  be  met  in  the  state 
machine  implemented  in  ihe  8/16  interface.  Kirst, 
before  a  new  transfer  cycle  is  attempted  (when  the 
state  machine  is  waiting  in  t he  initial  siale,  SO), 
memory  acknowledge  (MRDY)  must  be  inactive. 
This  prevents  interference  from  ihe  last  transfer. 

The  second  special  condition  occurs  when  the 
CRT  controller  asserts  the  R/W  line  lo  indicate  a 
write  operation.  Although  ihe  CRT  controller  does 
not  write  data  into  system  memory,  when  il  updates 
the  upper  8  bits  of  the  24-bit  address  latch  ihe  R/W 
line  indicates  a  write  operation  (in  conjunction  with 
AS).  The  CRT  com  roller  is  not  actually  performing 
a  system  data  write,  only  an  address  latch  update. 
The  state  machine,  therefore,  musi  not  siarl  a  bus 
sequence  if  ihe  R/W  line  is  held  aelive  low  by  the 
CRT  controller  during  a  bus  master  operation. 

These  simplifications  in  design  allow  ihe  CR  T  con- 
troller to  8-bii  CPU  control  interface  lo  be  synthe- 
sized in  a  single  Aml'AI.22VIO  programmable  logic- 
array  device.  In  addition,  the  bus  control  signals  are 
converted  from  AS,  DS,  and  R/W  lo  RD  and  WR. 
The  minimum  CR  I  controller  and  bus  control  sig- 
nals lhal  musl  be  generated  are  RI),  AO,  DS,  and 
R/W.  Although  the  CRT  controller  uses  DS  and 
R/W  as  inputs  during  a  bus  master  operation,  the 


PAI.  device  must  convert  the  CPU  RD  and  WR  sig- 
nals lo  DS  and  l/W  for  slave  I/O  operations. 

The  signals  AO  and  RD  are  generated  by  the  con- 
trol logic  when  the  CRT  controller  is  performing  a 
read  access  io  system.  The  WAIT  (or  not  READY) 
signal  to  the  CRT  controller  must  also  be  generated 
by  ihe  control  logic.  The  dala  flow  controls  require 
six  additional  controls  io  load  and  strobe  the  latch, 
and  lo  enable  transceivers  to  pass  dala  lo  and  Irom 
ihe  8-bit  bus.  Theoretically,  4  more  bits  (outputs) 
arc  required  to  represent  all  the  control  stales  needed 
lo  manipulate  the  8/16-bil  control  logic.  This  means 
ihe  design  appears  io  need  14  output  logic  units  in 
a  PAI.  device  lo  perform  ihe  required  task. 

Reducing  ihe  14  output  cells  to  the  10  cells  avail- 
able in  ihe  PAI.  device  requires  a  closer  look  at  the 
timing  and  output  switching  functions.  The  AO  and 
RD  control  lines  are  in  effect  pari  of  the  system  bus 
control  and,  therefore,  cannot  he  multiplexed  easily. 
The  DS  and  R/W  lines  io  ihe  CRT  controller  are 
also  fixed  because  they  must  be  valid  throughout  the 
entire  Iransfer  cycle  as  well. 

This  leaves  6  of  Ihe  10  output  logic  cells  of  the 
PAI.  device  lo  represent  the  remaining  10  identified 
control  lines.  This  method  of  minimization  invokes 
careful  stale  synthesis,  analysis  of  ihe  signal  switch- 
ing functions  during  [he  transfers,  and  utilization 
of  several  control  pins  on  the  CRT  controller.  Bv 
using  Ihe  IJRliQ,  HACK  I ,  BACKO.  CS.  and  C  D 
inputs  to  Ihe  PAI.  device,  wc  can  reduce  the  num- 
ber of  unique  stales  required  lo  K  instead  of  1  5.  T  his 
reduces  the  number  of  logic  cells  required  for  the 
state  machine  from  4  to  3  bits. 

Al  this  stage,  the  design  requires  seven  control  sig- 
nals io  manipulate  the  dala  iransfer  registers  and 
WAIT  line.  I  he  two  latch  enables  (Cl:s  and  CDpJ 
on  the  Am2952  bidirectional  I/O  port  can  be 
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permanently  enabled.  By  controlling  the  clock  signal 
to  the  latches,  the  controls  required  tor  three  pins 
can  be  reduced  to  one.  The  interlace  control  state 
machine  v.  ill  only  use  the  correct  side  of  the  dual 

atches  on  the  bidirectional  I/O  port-. 

The  Am8052  CRT  controller  helps  considerably 
with  its  own  control  bus  interlace.  Two  signals 
provided  by  the  CRT  controller,  TBEN  and  RBEN, 
switch  the  data  transceivers  in  the  correct  direction 
regardless  of  the  type  of  data  transfer  (as  a  bus 
master  or  bus  slave).  When  the  controller  is  a  bus 
master  performing  a  read  operation,  or  when  it  is 
a  bus  slave  undergoing  a  write  operation,  therefore, 

he  RBfcN  signal  is  strobed  to  obtain  the  correct 
polarity.  By  using  this  line,  two  of  the  remaining 
six  control  lines  can  be  eliminated  (REN  on  the 
Am2949  and  OEAS  on  the  Am2952).  Although  the 
TBEN  line  performs  a  similar  function,  it  does  not 
function  correctly  in  a  16-  to  S-bit  multiplexed 
bus  environment. 

Two  of  the  remaining  control  lines  (OE.\s  011  lne 
Am2952  and  10  on  the  bidirectional  bus  transceiver) 
must  be  generated  by  individual  cells  in  the  PAL 
device.  The  two  clock  enables  on  the  Am2952  are 
permanently  enabled.  The  two  Am2952  clocks  are 
lied  together  to  minimize  the  amount  of  logic  re- 
quired in  the  PAL  device  used  to  generate  clock 
strobes  to  the  latches. 

This  leaves  the  design  with  three  logic  cells  and 
lour  output  functions  (the  WAIT  line  to  the  CRT 
controller  and  the  3  state  bits).  Careful  analysis  of 


the  state  flows  and  timing  diagrams  indicates  that 
the  WAIT  line  is  only  asserted  in  4  of  the  X  states. 
A  clever  assignment  of  stale  numbers  to  the  state 
flow  sequence  allows  the,  WAIT  line  to  be  absorbed 
into  the  3  state  encoding  bits.  The  logic  equations 
for  the  Am  PA  1.22V  10  device  can  be  derived  directly 
from  the  liming  diagrams. 

An  unusual  problem  might  occur  when  a  periph- 
eral device  operates  as  a  bus  slave  on  a  smaller  data 
bus,  such  as  a  16-bit  peripheral  to  8-bit  CPU.  During 
the  first  slaw  write  operation,  the  chip  select  CS  is 
enabled  by  the  bus  master  making  the  access.  No 
actual  data— just  the  data  latch— is  strobed  into  the 
peripheral,  however.  After  the  first  byte  of  data  has 
been  written,  the  second  access  causes  the  full  16-bit 
data  to  be  strobed  into  the  peripheral. 

If  the  designer  is  using  a  common  CS  function  to 
both  the  peripheral  and  the  8/16-bit  control  logic, 
the  controller  logic  must  be  designed  not  to  glitch 
or  strobe  any  of  the  control  lines  to  the  peripheral 
(it  must  prevent  DS,  R/W  from  being  enabled,  for 
example).  For  some  peripheral  devices,  glitches  on 
the  control  lines  might  cause  the  register  to  be  written 
accidentally  onto  a  register  that  will  be  overwritten 
in  the  next  write  cycle  anyway.  With  other  periph- 
erals this  might  be  a  catastrophic  event.  Many 
devices  acting  as  bus  slaves  have  write  recovery  time 
requirements  (ie,  a  certain  minimum  interval  between 
consecutive  write  operations).  Glitches  on  the  con- 
trol lines  might  force  the  next  (and  final)  w  rite  oper- 
ation to  be  delayed — or  cause  a  violation  of  the 
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device  specifications.  Glitches  might  evade  anj  spe- 
cial addressing/register  accessing  scheme  used  in  the 
peripheral.  This  might  occur,  for  example,  if  Ihe 
slave  device  requires  the  user  to  write  the  address 
of  the  register  that  was  accessed  immediately  before 
the  register  was  written.  In  this  case,  glitches  or  use- 
less control  strobes  could  wreck  the  sequence. 

The  problem  can  also  be  solved  by  using  two  lines. 
In  this  solution,  one  of  the  lines  would  go  to  the 
peripheral  device  and  the  other  would  connect  to  the 
8/16-bit  controller.  The  chip  select  to  [he  peripheral 
is  activated  each  time  a  slave  read  occurs  (for  both 
upper  and  lower  byte  accesses),  or  when  a  slave  write 
operation  occurs  and  [be  unlatched  8-hil  daia  is 
being  written.  The  chip  select  function  to  the  8/16-bit 
controller  is  chosen  each  time  the  peripheral  is 
selected  normally  (for  slave  read/writes  on  both 
upper  and  lower  8-bit  data  transfers).  This  problem 
is  bypassed  completely  when  two  separate  chip  select 
functions  are  used:  one  for  loading  up  the  Am2952 
latch  during  a  slave  write/read  and  one  to  strobe  the 
Am8052  controller  into  action  when  it  is  needed  by 
the  8-bit  CPU. 

Bus  conversion  maximizes  flexibility 

A  data  bus  and  control  interface  to  an  X088  8-bit 
microprocessor  and  Am9516  16-bit  DMA  controller 
can  be  created  using  four  devices:  an  AmPAL22V10 
for  the  control  block,  a  74LS161  counter  for  the  state 
sequencer,  an  Am2952  bidirectional  I/O  port,  and 
an  Am2949  bidirectional  transceiver. 

This  design  incorporates  certain  simplifications. 
The  DMA  controller  requires  word  accesses  only 
during  command  chaining  and  for  slave  register  ac- 
cesses. The  8/16-bit  data  transfer  interface  for  btis 
master  operations  (ie.  DMA  data  transfer  functions) 
is  handled  automatically  as  a  programmable  option. 
During  slave  write  operations,  the  first  byte  output 
to  the  DMA  controller  must  have  an  odd  address 
and  the  following  second  byte  an  even  address.  Con- 
versely, during  a  slave  read  cycle,  the  first  byte  read 
from  the  DMA  controller  must  be  at  an  even  address 
and  the  second  at  the  ne\t  higher  odd  address. 

f  urthermore,  for  bus  master  operations,  the  sys- 
tem must  use  the  latched  address  line  AO  (I  AO)  from 
the  AmPAI.22VI0  as  its  sole  AO.  Because  [he  logic 
is  already  available,  I  he  system  docs  not  hav  e  (o  pro- 
vide this  function.  I. AO  now  becomes  the  system  ad- 
dress hit  0  with  full  24-mA  drive  capability. 

Deciding  on  a  means  lor  controlling  the  tunneling 
of  [he  data  stream— dial  is,  transforming  16-bit  data 
into  K  bit  daia  and  vice  versa  — was  i he  I'jrsl  step  in 
deriving  ihis  example.  As  mentioned  earlier,  simply 
dividing  each  16-bii  access  into  iwo  8-hil  daia  trans- 
fer cycles  presents  one  way  of  doing  this.  On  out- 
going accesses  ( 16-bii  path  from  lite  DMA  controller) 
during  die  first  cycle,  die  upper  half  of  die  16-bit 
path  is  latched  while  die  lower  half  passes  ill  rough 
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IF  (RESET)  THEN  ARESETI  |, 
This  section  defines  Ihe  wiggles  when  the  Am9516  is  bus  mjstet 

IF      [HLOAI  THEN  ENABLE!  I: 
IF      (/SUJ  '  HLOA'  THEN  BEGIN 
IF  (S|l|  '  /S(0||  THEN 

LAO     =  /CK    BW  +  /BW    AO  ' 

ALE  +  /  BW    LAO    /ALE  : 

ELSE 

LAO     =  BW  +  /BW  AO 

ALE  +  /BW  '  LAO  •  /ALE  ; 

END; 

IF      IHLDAI  THEN 
ICASEI  |S[2:01I 
BEGIN 
l|  BEGIN 

RO    =  /RW  DS 
A     =  /BW  '  /RW  ■  /CK 
WR   =  /BW  •  RW  OS 
C     =  /BW  RW 
WAIT  =  I 
END: 
21  BEGIN 

RD    =  'RW  -  OS 
B     =  8W 
A     =  /BW  /RW 
WR   =   BW  -  RW  '  DS 


END: 


C  =  /BW 
WAIT  =  /BW 

END; 
31  BEGIN 
RD  =  /RW 
B  =  BW 
A  ■  /BW 
WR  =  /BW 
C  =  /BW 
WAIT  =  BW 

END: 
51  BEGIN 
RD    -  /BW 
A     =  /BW 
WAIT  =  BW 

END; 
61  BEGIN 
RD    =  'RW 
A     =  BW 

END. 
7|  BEGIN 
RD    =  /RW 
A      =  RO 

END; 


RW 


DS 
CK 
RO 
RW 
RW 


DS 
/CK 


This  section  defines  the  wigjets  when  the  8088  is  bus  mastei 
BEGIN 

LA0=  AO  •  ALE    SEL  +  LAO  '   ALE  '  SEL 

B  •  LAO    WR  SEL 

A   =  /LAO    WR  •  SEl 

OS  =  A  +  /LAO    RO  '  SEL 

C  =  /LAO  '  RD  '  SEl 

0  =  LAO    RD  SEL 

END; 

END 

this  PI. PI  lilt  implements  an  interface  between  ihe 
S-bil  8088  and  Ihe  16-hil  Am95l6. 
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Programming  the  PAL  and  the  counter 


In  writing  the  Programming  Language  for  Program- 
mable Logic  (PLPL)  file  to  control  the  operation  of 
the  AmPAL22V10  and  the  74LS161  counter,  the 
inputs  to  the  PAL  device  from  the  counter  are  as- 
signed SO,  S1,  and  S2,  respectively.  Then,  it  is  pos- 
sible to  apply  a  "sculptured  design"  technique  to 
the  entire  timing  diagram  (see  figure  in  Panel,  "A 
matter  of  timing")  by  using  the  Case  statement  from 
PLPL.  By  assigning  combinatorial  equations  to  only 
one  binary  partition  or  column  at  a  time  (Case),  the 
designer  can  ignore  all  other  aspects  of  the  design 
for  the  time  being  and  generate  simple  equations 
directly  from  the  timing  waveforms. 

During  clock  time  T1  of  the  Am9516's  word  read 
cycle  the  state  of  the  74LS161  (SO,  S1,  S2)  is  cleared 
to  000  by  the  assertion  of  address  latch  enable 
(ALE).  LAO  is  the  only  output  control  signal  from  the 
CRT  controller  asserted  during  this  period.  This  sig- 
nal is  handled  as  a  special  case,  however.  During 
tirtici  T2  of  the  DMA  controller's  word  read  cycle, 
the  RD  and  WAIT  outputs  from  the  CRT  controller 
must  be  asserted.  This  time  partition  corresponds 
to  the  state  inputs  S2,  St,  S0  =  001.  Therefore,  the 
first  Case  equations  are 

CASE  (S|2:0]l 
BEGIN 
1)  BEGIN 

RD=  /RW'DS   ;  Transform  Control 
;  Signals  /RW  and  DS 
;  into  Intel  /RD 


WAIT=  1 


,  Assert  Wait 
,  unconditionally 


END; 


During  time  T2  of  the  DMA  controller's  byte  read 
cycle,  A  is  the  only  additional  output  not  already 


accounted  for  in  the  Case  statement.  This  signal 
allows  a  byte  of  data  to  flow  through  the  bidirec- 
tional bus  transceiver  into  the  DMA  controller 
during  byte  read  operations.  Some  additional  con- 
straints are  placed  on  this  signal,  however:  it  must 
only  be  asserted  in  time  T2  on  byfe  read  operations 
(the  BAA/  input)  and  it  must  be  delayed  by  a  half 
clock  period  from  the  rising  edge  of  T2  (CK  signal). 
Thus  the  Case  statement  becomes 

CASE  (S[20)) 
BEGIN 
1)  BEGIN 

RD  =  /RWDS  ; 
A=  /BW/RW'/CK       ;  enable  the 
;  receiver 


END; 

_Finally,  by  examining  the  last  time  T2  elements 
(WR  and  C)  during  the  DMA  controller's  byte  write 
cycle,  the  remaining  terms  in  Case  1  are  derived. 
With  the  exception  of  LAO,  the  remaining  equations 
were  developed  in  the  same  fashion.  Clearly,  this 
"sculptured"  technique  is  a  very  simple  and 
methodical  means  for  arriving  at  the  Boolean  re- 
quirements for  a  logic  block. 

As  the  PLPL  listing  shows,  the  signal  LAO  was 
handled  slightly  differently  from  the  previously  dis- 
cussed method.  The  number  of  product  terms  gen- 
erated via  the  Case  statement  made  this  approach 
necessary.  The  number  exceeded  the  upper  limit 
(16  terms)  for  a  programmable  logic  array.  As  a  prac- 
tical matter,  therefore,  it  was  necessary  to  optimize 
this  signal  manually.  However,  it  should  be  noted 
that  this  step  will  not  be  necessary  once  the  fully 
optimized  version  of  PLPL  becomes  available. 


a  tri-state  buffer  onto  the  8-bit  bus.  During  the  sec- 
ond cycle,  the  tri-state  buffer  is  turned  off  and  the 
previously  latched  half  of  the  data  is  driven  onto  the 
bus.  On  incoming  accesses  (8-bit  path  to  1 6-bit  path), 
the  process  is  reversed. 

The  control  mechanisms  that  perform  this  cycling 
depend  on  the  WAIT  and  R/W  signals  passing  to 
and  from  the  DMA  controller,  and  on  the  ability 
to  enable  or  disable  the  latches  and  transceivers  selec- 
tively. The  Am2952  bidirectional  I/O  port  was 
chosen  because  of  its  dual  registers  and  its  flexible 
control.  The  AmPAL22V10  device  was  chosen  to 
match  the  required  number  of  control  pins  and  func- 
tions. Since  the  complexity  of  this  design  requires 
the  use  of  all  of  the  PAL's  I/O  pins  for  control  func- 
tions, however,  it  was  necessary  to  use  a  74LS16I 
counter  to  provide  the  state  sequencer  function. 


Programming  with  PLPL 

It  has  long  been  the  logic  designer's  "art"  to  merge 
the  often  very  different  concepts  and  notations  of  tim- 
ing information  with  Boolean  logic.  Yet,  the  evolu- 


tion of  a  syntax  to  fully  express  this  art  has  taken 
a  long  time.  AMD  recently  developed  such  a  language 
for  programming  the  AmPAI.22Vl0,  however. 

"Programming  l  anguage  for  Programmable 
Logic,"  or  PLPL,  allows  the  designer  to  specify  a 
design  using  multiple  input  formats.  This  specifica- 
tion flexibility  supports  the  variety  of  design 
approaches  necessary  to  express  different  design 
problems  efficiently.  These  formats  range  from  sim- 
ple sum-of-products  Boolean  equations  to  high  level 
constructs.  PLPL  also  supports  the  input  specifica- 
tions for  many  types  of  AND  OR  based  de\  ices,  in- 
cluding all  of  the  current  AMD  programmable  logic 
array  and  PROM  devices. 

PLPL  is  block  struclurcd,  and  includes  the  high 
level  language  constructs  If-Then-Llse.  Case,  and 
l  or;  all  familiar  to  many  programmers  of  the  C"  and 
Pascal  languages.  Macro*,  functions,  constants,  and 
variables  may  also  be  used  in  PLPL.  The  language 
also  facilitates  use,  clarity,  and  self-documentation. 

Such  current  programmable  logic  technology  and 
associated  programming  languu;""-  as  PLPL  allow 
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A  matter  of  timing 


The  complex  AmPAL22V10  design  used  the  accom- 
panying timing  diagram  to  correspond  to  the 
desired  waveforms.  They  are  partitioned  by  the 
respective  binary  state  (or  count)  from  the  counter. 

The  desired  timing  requirements  during  the 
period  when  the  DMA  controller  is  bus  master 
appears  below.  During  time  T1,  address  latch 
enable  (ALE)  is  asserted  by  the  DMA  controller  to 
denote  the  beginning  of  the  cycle;  a  short  time  later, 
an  address  is  driven  onto  the  bus.  This  address  is 
valid  at  the  falling  edge  of  ALE.  The  control  signal 
LAO  (latched  AO),  therefore,  must  be  valid  at  this 
time,  as  well.  In  this  phase  of  the  cycle,  it  must  also 
be  high  to  enable  the  odd  byte  from  memory  to  be 
loaded  into  the  bidirectional  I/O  port.  In  addition, 
the  assertion  of  ALE  performs  the  function  of  reset- 
ting the  74LS161  counter  to  0000  in  order  to  syn- 
chronize the  cycle. 

During  time  12,  the  DMA  controller  will  assert  its 
DS  signal.  The  timing  for  this  signal,  in  conjunction 
with  the  RAW  signal  (asserted  in  T1)  must  be  trans- 


formed into  an  8088-equjvalent  RD  signal.  During 
a  word  read  cycle,  this  RD  signal  also  must  be  arti- 
ficially negated  and  then  reasserted  to  accomplish 
a  double  byte  read.  At  the  same  time,  the  DMA  con- 
troller must  be  "parked"  in  order  to  multiplex  or 
assemble  a  word.  Thus,  the  WAIT  signal  is  also 
asserted  at  time  T2.  During  time  TW  (S2,  S1, 
SO  =  010),  the  receiver  clock  enable  control  signal 
B  must  be  asserted  in  order  to  allow  the  next  sys- 
tem clock's  rising  edge  to  strobe  the  upper  byte  into 
the  bidirectional  I/O  port.  This  is  accomplished  dur- 
ing the  next  TW  period  (S2,  S1,  SO  =  011).   

During  the  remainder  of  the  word  read  cycle,  RD 
is  negated  and  then  reasserted  after  LAO  has  been 
forced  low  to  address  the  even  byte.  A  is  then 
asserted  to  allow  both  the  previously  latched  upper 
byte  and  the  current  lower  byte  to  be  driven  onto 
the  DMA  controller  s  pins.  And  finally,  the  WAIT  sig- 
nal is  negated,  allowing  the  DMA  controller  to  finish 
its  read  cycle  by  strobing  in  the  16  bits  of  command 
data  on  its  data  pins. 
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highly  organized  application-oriented  control  blocks 
to  be  formed  easily.  These  tools  can  conceptually 
raise  the  designer  above  the  details  of  the  design  at 
the  logic  level  and  directly  translate  the  necessary 
response  characteristics  from  a  timing  diagram. 

This  approach  can  be  referred  to  as  a  "sculptured 
design"  technique  because  it  is  analogous  to  the  way 
solid  stone  is  formed  according  to  an  artist's  image. 
Raw  logic  can  be  transformed  directly  into  useful  con- 
trol functions  from  the  desired  timing  information. 


The  AmPAL22V10  is,  in  essence,  a  fuse-program- 
mable gate  containing  up  to  22  inputs  and  10  out- 
puts. It  can  define  and  program  that  architecture  of 
each  output  on  a  pin  by  pin  basis.  Thus,  the  designer 
is  free  to  optimize  the  design  mix  between  registered 
and  combinatorial  functions  as  needed. 

The  AmPAL22V10  is  programmed  by  opening  fus- 
able  links  in  any  or  all  of  its  10  output  macrocells, 
as  well  as  in  its  AND  gate  array.  The  AND  gate  struc- 
ture is  very  similar  to  other  PAL  devices;  therefore 
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ii  allows  the  same  powerful,  yet  familiar  features. 
However,  it  is  the  AmPAL22V10's  10  output  logic 
macrocells  that  give  the  designer  substantial  new 
design  freedom.  Moreover,  at  each  macrocell  out- 
put is  a  tri-state  output  buffer  controlled  by  a 
separate  output-enable  AND  gate. 

These  macrocells  provide  the  AmPAL22V10's  key 
features.  They  can  be  configured  to  make  any  or  all 
of  the  I/O  pins  act  cither  in  sequence  or  in  combi- 
nation and  have  either  active-high  or  active-low  char- 
acteristics. Furthermore,  the  output  enables  can 
individually  control  the  direction  of  the  pins  so  they 
act  as  outputs,  inputs,  or  bidirectional  ports. 

A  number  of  trade-offs  and  limitations  are  appar- 
ent in  a  design  that  so  dramatically  affects  the  input 
and  output  of  the  system.  The  most  obvious  limita- 
tion stems  from  under  utilization  of  16-bit  periph- 
erals on  an  8-bit  bus — the  speed  of  all  I/O  operations 
are  cut  in  half.  As  a  result,  bus  utilization  will 
increase  if  the  16-bit  peripheral  represents  a  signifi- 
cant factor  of  the  bus  use.  A  CRT  controller  such 
as  the  Am8052  might  use  5  to  10  percent  of  the  bus 
bandwidth  for  display  information  when  using  16-bit 
I/O.  Converting  to  8-bit  I/O  would  double  bus  use 
to  10  to  20  percent.  Another  factor  that  might  affect 
the  bus  usage  is  the  efficiency  of  the  8-  to  16-bit  con- 


version control  logic.  If  the  state  machine  designed 
to  perform  the  8/16-bit  (or  16/32-bit)  conversion  is 
improperly  designed,  extra  transfer  overhead  might 
be  introduced.  This  might  mean  a  sequential  transfer 
of  two  8-bit  values  would  take  twice  as  long  a  single 
16-bit  transfer. 

The  design  constraints  might  limit  the  use  of  the 
peripheral  to  byte-only  operations  during  data  trans- 
fers (as  in  the  design  using  the  DMA  Am9516  con- 
troller), and  slow  it  down  by  a  factor  of  two  during 
command  operations.  For  such  a  DMA  device  as  the 
Am9516,  the  extra  time  required  for  command  fetch- 
ing is  not  usually  a  significant  portion  of  bus  time. 

System  designers  will  have  to  weigh  the  cost  of 
the  extra  overhead  on  a  case-by -case  basis.  The  ben- 
fits  may  well  justify  these  limitations — particularly 
when  the  bus  is  self-limiting,  but  the  device  charac- 
teristics allow  for  value-added  designs.  In  addition 
to  bus  degradation  lor  certain  configurations,  extra 
logic  and  design  effort  are  involved.  Most  interfaces 
outside  a  system's  immediate  family  require  some 
kind  of  extra  interface  logic,  however.  By  manipulat- 
ing the  signals  and  incorporating  them  into  program- 
mable logic  devices  such  as  the  AmPAL22V10  device, 
therefore,  most  of  this  logic  is  free. 


Programmable  Logic  Chip 
Rivals  Gate  Arrays  in  Flexibility 


Hanging  a  user-customized  macrocell  at  each  output  of  an  LSI-level 
fuse-programmable  array  logic  gives  it  the  flexibility  of  a  gate  array. 
The  result:  Less  hardware  is  needed  for  high-level  functions. 


Raprlfrt  with  permission  from    ELECTRONIC  DESIGN 


Designers  of  semicustom  ICs  have  long  appre- 
ciated the  in-house  programming  capabilities  of 
programmable  array  logic  devices.  But  gate  arrays, 
with  their  LSI  densities  and  wider  opportunities  for 
customization,  are  an  inviting  alternative,  even  if 
they  must  be  sent  out  for  metal-mask  processing. 
Which  approach  should 
the  designer  use? 

An  agonizing  choice  be- 
tween convenience  and 
versatility  is  no  longer 
necessary.  The  first  fuse- 
programmable  chip  capa- 
ble of  implementing  LSI 
circuits  blends  the  advan- 
tages of  both  types.  The 
result  is  more  cost- 
effective  designs  with 
higher-level  functions 
than  previous  pro- 
grammable logic  devices 
offered  and  a  shorter 
turn-around  time  than  is 
possible  with  gate  arrays. 

The  new  chip,  the 
AmPAL22V10,  can  be 
thought  of  as  a  fuse- 
programmable  gate  array, 
since  at  twice  the  density  of  previous  programmable 
array  logic  devices,  it  replaces  the  equivalent  of  log- 
ic circuits  having  500  to  1000  gates  (see  "Blending 
Programmable  Logic  and  Gate  Arrays,"  p.  97).  In 
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fact,  because  of  its  flexible,  programmable  architec- 
ture, the  chip  spans  the  breadth  of  standard  SSI, 
MSI,  and  present-generation  PAL  devices,  often  ac- 
complishing what  could  not  be  practically  done  with 
several  such  chips. 
Designers  program  the  22V10  by  opening  fusable 
links  in  any  or  all  of  its  10 
output  macrocells,  as  well 
as  in  its  AND-gate  array, 
developing  the  needed  log- 
ic functions.  Though  the 
AND-gate  structure 
makes  the  unit  similar  to 
other  PAL  devices,  the 
chip's  10  output  logic  mac- 
rocells afford  a  substan- 
tial new  degree  of  design 
freedom  (Fig.  1). 

What's  more,  at  the 
chip's  output  pins  are  10 
three-state  output  buff- 
ers, each  fed  by  a  mac- 
rocell and  controlled 
through  a  separate  out- 
put-enable AND  gate. 

The  macrocells  provide 
the  22V10's  key  features: 
They  can  be  configured  to 
make  any  or  all  of  the  I/O  pins  act  either  sequen- 
tially or  combinatorially  and  have  either  active 
highs  or  active  lows.  Additionally  the  Output  En- 
able can  individually  control  the  I/O  terminals  to 
act  as  output  pins,  input  pins,  or  bidirectional  ports. 
Thus  the  designer  can  readily  change  the  chip's  ar- 
chitecture. 

Within  the  array,  each  AND  gate  is  fed  by  the 
true  and  complementary  levels  of  the  chip's  12  input 
lines,  one  of  which  doubles  as  a  clock  line.  Also,  each 
AND  gate  is  fed  by  the  true  and  complementary 
levels  of  the  10  feedback  outputs  from  the  output 
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logic  macrocells. 

Each  of  the  AND  gates  therefore  has  44  inputs, 
and  there  are  a  total  of  132  AND  gates  in  the  array. 
The  result  is  over  5800  fuses  that  can  be  pro- 
grammed to  perform  the  desired  function. 

A  total  of  120  of  the  input  AND  gates  are  distrib- 
uted to  drive  10  OR  gates,  forming  the  logical  part 
of  the  AND  array.  In  addition,  one  AND  gate  is 
assigned  to  control  each  of  the  10  three-state  out- 
puts, and  the  remaining  two  gates  activate  synchro- 
nous register  presets  and  asynchronous  register 
resets. 

The  array  itself  consists  of  10  AND-gate  macro- 
cells  (five  similar  pairs),  each  logically  ORing  the 
outputs  of  anywhere  from  8  to  16  AND  gates.  The 
AND  gates  are  distributed  so  that  the  two  out- 
ermost macrocells  in  the  array  logically  OR  to- 
gether eight  AND  gates.  The  next  two  outermost 
macrocells  logically  OR  together  10  AND  gates;  the 


1.  A  fuse-programmable  LSI  device  combines  the  efficiency 
of  programmable  logic  arrays  with  the  flexibility  of  gate 
arrays.  Ten  user-definable  output  macrocells  allow  the 
designer  to  set  the  logic  outputs  individually  for  one  of  four 
configurations.  They  can  even  be  used  as  inputs  or 
dynamically  controlled  I/O. 


next,  12  AND  gates;  and  so  on,  traveling  inward 
along  the  array.  In  other  words,  moving  toward  the 
array's  center,  two  AND  gates  are  added  to  succes- 
sive pairs  of  macrocells,  ending  with  the  two  inner- 
most macrocells,  each  ORing  16  AND  gates  apiece. 

The  variable  distribution  of  AND  gates  is  partic- 
ularly useful  when  implementing  counting,  ex- 
clusive-OR  functions,  and  complex  state  machines. 
For  example,  a  10-bit  binary  counter  has  only  one 
AND  gate  on  the  LSB  but  requires  an  additional 
gate  for  each  succeeding  bit,  for  a  total  of  10  AND 
gates  to  implement  the  MSB.  Should  a  hold,  parallel 
load,  or  specialized  function  also  be  desired,  the 
MSB  requires  even  more  AND  gates. 

To  program  the  array  into  its  familiar  sum-of- 
products  Boolean  function,  a  designer  blows  the  cor- 
responding fuse  links.  Those  links  initially  con- 
nected the  chip's  pins  and  their  complements  to  all 
132  of  the  44-input  AND  gates.  As  mentioned,  the 
designer  can  also  program  any  OR-gate  output  to  be 
a  dedicated  input  or  a  dynamically  controlled  I/O 
pin.  This  is  possible  because  the  connection  from  an 
output  macrocell  to  its  output  pin  is  through  a 
three-state  output  buffer  and  is  controlled  by  an 
output-enable  AND  gate.  Turning  off  the  connec- 
tion turns  the  output  pin  into  an  input.  Otherwise 
the  pin  remains  connected  to  the  output  macrocell, 
fed  by  the  OR  gate. 

An  output  with  a  difference 

The  chip's  output  section  is  where  it  appears  more 
like  a  gate  array  than  a  PAL  device.  The  output 
macrocells  allow  the  designer,  by  individually  set- 
ting the  outputs  to  be  either  sequentially  or  combi- 
natorially  and  either  active  high  or  active  low,  to 
program  either  the  true  or  complementary  version 
of  a  logical  term— whichever  makes  the  most  effi- 
cient use  of  the  chip's  AND  array  and  fits  the  chip 
to  the  application.  Furthermore,  independent  con- 
trol over  the  active  state  helps  the  designer  conform 
to  common  logic  conventions,  thus  avoiding  con- 
fusion and  error. 

For  example,  certain  functions,  such  as  chip  se- 
lects and  resets,  tend  to  be  active  low.  The  output 
macrocells  avoid  contradictions  with  convention  by 
inverting  outputs,  individually  changing  them  when 
necessary  to  conform.  Finally,  by  individually 
selecting  either  sequential  or  combinatorial  out- 
puts, the  macrocells  impart  to  the  chip  the  capacity 
for  implementing  more  functions  than  is  possible 
with  existing  PALs. 

The  macrocell  (Fig.  2a)  contains  three  main  logic 
blocks:  a  rising-edge-triggered  D-type  flip-flop 
with  asynchronous  reset  and  synchronous  preset  in- 
puts, a  four-to-one  output-path  selection  multiplex- 
er, and  a  two-to-one  feedback-path  selection  multi- 
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control  the  four  possible  output  configurations:  K„ 
selects  sequential  or  combinatorial  operation;  l'„  se- 
lects active  low  or  active  high. 

With  both  fuses  intact,  the  output  is  sequential 
and  active  low  (Kin  2b).  The  y  output  of  the  flip- 
flop  passes  to  an  inverting  output  buffer,  and  the  Q 
output  feeds  back  to  the  AND  array  internally.  This 
configuration  is  particularly  suitable  for  state- 
machine  designs.  Alternatively,  by  programming 
(blowingl  P„,  the  designer  can  make  the  output  ac- 
tive high. 

On  the  other  hand,  by  programming  only  R„,  the 
designer  makes  the  output  combinatorial  and  active 
low.  For  this  scheme,  the  OR-gate  output  bypasses 
the  flip-flop  and  feeds  the  inverting  output  buffer 
directly.  At  the  same  time,  the  feedback  multiplexer 
passes  the  output  back  into  the  array  (Fig.  2c). 

By  programming  both  fuses,  the  designer  makes 
the  output  combinatorial  and  active  high,  and, 
again,  the  feedback  multiplexer  passes  the  output 
back  to  the  array.  This  configuration  serves  best  for 
"glue"  logic  and  for  generating  complex  control 
functions. 

Finally,  when  the  device  is  programmed  in  the 
combinatorial  configuration,  the  programmable 
Output  Enable  can  be  used  to  transform  an  output 
pin  into  a  dedicated  input  or  a  dynamically  con- 
trolled I/O  terminal  Once  enabled  as  a  dedicated 


nii>uL,  iiwwi-vi-i ,  a  pm  .1  uutpuL  luncuon  is  sacrmceu. 
This  capability  is  extremely  valuable  when  par- 
titioning functions  into  the  device.  The  limitation  is 
no  longer  the  numbers  of  input  devices,  but  the  total 
number  of  device  pins.  Lastly,  as  a  dynamically  con- 
trolled I/O,  the  pin  serves  both  functions,  meeting 
most  bus  requirements. 

Intelligent  use 

One  of  the  most  challenging  jobs  in  designing 
microprocessor  systems  is  installing  intelligent  pe- 
ripheral controllers— that  is,  controllers  that  can 
transfer  data  themselves  and  so  unburden  the  pro- 
cessor from  inefficient  polling  operations.  One  solu- 
tion is  to  build  a  custom  intelligent  controller  from 
scratch  using  the22V10  (Fig.  3).  This  controller  gov- 
erns data  flowing  between  a  microprocessor  and  up 
to  eight  peripheral  devices,  all  conforming  to  the 
Small  Computer  Systems  Interface  standard  (SCSI, 
the  ANSI  X3T9.2/82-2). 

The  interface  consists  of  an  8-bit  bidirectional 
data  port  (DB7-DB[,);  three  control  outputs- 
Acknowledge  (ACK),  Select  (SEL),  and  Reset 
(RST);  and  five  control  inputs-Request  (REQ), 
Control/Data  (C/D),  Message  (MSG),  Busy  (BSY|, 
and  Input/Output  (I/O).  There  ate  additional 
signals,  unrelated  to  the  SCSI  standard.  They  in- 
clude several  microprocessor  controls,  such  as  ad- 
dress lines  A,  and  A2  for  setting  the  data's  desti- 


Blending  programmable  logic  and  gate  arrays 


Centered  on  a  fuse-programmable  AND  array,  the 
architecture  of  a  PAL  (programmable  array  logic) 
device  offers  designers  a  powerful,  instantly  mod- 
ifiable logic  building  block.  As  a  result,  PAL  devices 
are  used  in  a  wide  variety  of  applications,  ranging 
from  microprocessor-based  systems  to  supermini- 
computers. 

Since  gate  arrays  are  not  programmed  in  house, 
they  have  a  much  slower  turnaround  time  and  higher 
cost.  Yet  gate  arrays'  simple  logic  structure  makes 
them  easier  to  customize  than  PAL  devices,  and  their 
larger  size  makes  for  denser  logic  structures. 

A  gate  array's  simpler  structure  can  be  a  draw- 
back, however.  For  example,  typical  gate  arrays  are 
based  on  just  two-  or  three-input  gates;  therefore,  to 
implement  complex  functions,  they  often  require 
many  gate  levels,  which  results  in  slow  logic  struc- 
tures. In  contrast,  PAL  devices,  like  memories,  make 
more  efficient  use  of  their  silicon  area  and  can  be 
configured  into  fast  logic  structures.  Consequently, 
even  with  a  slight  overhead  for  programming  cir- 
cuitry, PAL  densities  are  beginning  to  rival  those  of 
gate  arrays. 

However,  innovative  circuit  techniques  and  an 
oxide-isolated  bipolar  process  team  up  in  the 
AmPAL22V10  ;>nd  combine  the  advantages  of  PAL 


devices  and  gate  arrays.  For  example,  the  pro- 
grammable output  macrocells  are  designed  to  add 
only  one  Schottky  diode  in  the  delay  path.  As  a 
result,  the  chips  selected  for  high  speed  have  a  worst- 
case  input-to-output  propagation  delay  and  setup 
time  of  only  25  ns,  as  well  as  a  worst-case  clock-to- 
output  delay  of  a  speedy  15  ns,  over  the  commercial 
temperature  range.  Also,  the  oxide-isolated  process 
fits  the  chip's  ability  to  implement  complex  func- 
tions into  a  24-pin,  0.3-in.-wide  package. 

For  convenience,  a  feature  called  Preload  loads  the 
chip's  internal  registers  with  an  arbitrary  value,  so 
that  the  designer  can  perform  a  complete  post- 
programming  functional  test.  In  fact,  Preload  is  a 
necessity  for  testing  state  machines,  where  the  suc- 
cessive states  depend  on  past  and  present  values,  as 
well  as  on  the  inputs.  Without  Preload,  complex  and 
sometimes  impossible  sequences  would  have  to  be 
entered  before  the  first  test  could  be  performed. 

What's  more,  platinum  silicide  fuse  technology 
makes  programming  faster  and  increases  reliability, 
and  a  well-controlled  melting  rate  yields  large,  sta- 
ble, nonconductive  gaps.  Address  circuitry,  much  like 
that  of  a  PROM,  blows  one  fuse  at  a  time,  and  pro- 
gramming is  done  with  an  algorithm  that  is  easily 
adapted  to  a  wide  variety  of  available  machines. 
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nation,  read  and  write  controls  (IORD  and  IOWR), 
Interrupt  Request  (INTREQ),  Chip  Select 
(CSSCSIl,  System  ClockjSYSCLK),  and  buffer  and 
latch  enables  (BE  and  LEl  for  passing  bus  data  be- 
tween the  processor  and  its  peripherals. 

In  operation,  the  controller  assigns  one  of  eight 
peripheral  devices  to  be  a  master  by  activating  SEL 
and  at  the  same  time  sends  the  master's  address  to 
the  SCSI  data  port.. From  then  on,  when  a  device 
notifies  the  controller  that  it  needs  to  transfer  ei- 
ther a  command  or  data,  the  controller  interrupts 
the  host  processor.  In  turn,  the  processor  reads  the 
controller's  status  register  to  determine  which  de- 
vice is  requesting  service  and  the  type  of  request  it 
is  making.  After  its  status  register  is  read,  the  con- 
troller removes  the  interrupt  request  and  carries 
out  the  required  transfer. 

Meanwhile  the  selected  device  issues  a  BSY 
signal,  letting  the  other  devices  Know  that  the  bus  is 
in  use.  The  data  transfer  is  managed  by  the  hand- 
shake signals  ACK  and  REQ,  and  the  processor  ac- 
tivates  RST  as  needed  by  pulling  both  WR  and  RD 
low  at  the  same  time. 


Matching  the  chip  to  its  application  starts  with 
an  assessment  of  the  available  resources  against 
those  needed  to  do  the  job.  In  general,  that  means 
establishing  that  the  chip  has  enough  input  and 
output  pins  and  product  terms  to  accommodate  the 
logic  functions  required  by  the  application. 

As  to  the  pinouts,  the  10  outputs  required  by  the 
application  do,  in  fact,  match  the  10  available  on  the 
chip.  Four  of  the  pins  are  used  for  the  bidirectional 
data  bus  and  the  rest  are  dedicated  outputs.  As  for 
inputs,  the  SCSI  controller  requires  11,  which  the 
chip  more  than  satisfies  with  its  total  of  12. 

Next  the  designer  develops  the  product  terms 
needed  to  execute  the  controller's  job  and  compares 
the  number  and  size  of  the  required  terms  with  the 
number  available  from  the  chip.  The  resulting  con- 
troller equations  (see  Table  1 )  number  10,  and  these 
nestle  neatly  into  the  chip's  10  available  AND-gate 
macrocells.  Also,  the  maximum  number  of  terms  in 
any  one  equation— in  this  case,  those  making  up  the 
acknowledgment  function  — is  only  6,  compared 
with  the  chip's  capacity  of  8  to  16.  Thus  the  control- 
ler function  fits  easily  into  a  single  programmable 
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2.  Each  of  10  output  macrocells  (a)  are  independently  fuse-programmed  (or  one  of  four 
configurationa.  Fuae  Rn  control*  whether  the  outputa  are  aequential  or  combinatorial,  and 
fuae  P„  daterminea  whether  they  are  active  high  or  active  low.  With  no  fuae*  blown,  the 
output  is  aequential  and  active  low  (b).  Blowing  only  R„  givea  a  combinatorial  active-low 
output,  and  blowing  only  P„  give*  a  sequential  active-high  output.  Finally,  blowing  both 
fuses  changes  the  output  to  combinetorial  and  active  high  (c). 


aMono,itMcmMemoriBsa 


2-673 


Programmable  Logic  Chip  Rivals  Gate  Arrays  in  Flexibility 


Semicustom  ICs:  Programmable  Logic 


logic  array,  as  can  more  complicated  micro- 
processor peripheral  interfaces. 

The  22V10's  real  power,  however,  becomes  appar- 
ent when  it  is  needed  to  fill  an  application  that 
appears  to  demand  more  than  the  chip's  resources 
can  deliver.  Insufficient  input  or  output  pins  or 
product  terms  are  frequent  shortcomings  of  many 
devices  when  the  job  is  very  formidable.  With  its 
highly  versatile  I/O  architecture,  however,  the 
22V10  can  accommodate  conditions  that  would 
make  chips  even  larger  than  itself  inadequate. 


Table  1.  Peripheral  controller  equations 

Combinatorial  equations 

BE  -  CS  •  HD  •  A2  •  A, 
LE  =  CS  •  WR  ■  A2 
D3  -  A,  •  C  +  A,  •  ACK 
D2  -  A,  ■  M  +  A,  •  SEL 
D,  -  A,  •  B  +  A,  ■  RST 
Do  -  A,  •  I  +  A,  •  R 

Sequential  aquations 

INTREQ  -  R  •  (I  •  C  • M  +  J  •  C  •  M  +  I  •  C  •  M '+  ICM) 

ACK-CS  •  WR  •  A2  ■  A,  ■  D3+  R  ■  ACK  •  I  CS  '  WR  -  *2  -  Al  >  + 
R  •  CS  •  RD  •  A2  •  A, 

SEL  -  CS  •  WR  ■  A2  •  A,  •  D2  +  8  •  SEL  •  (  cs  •  WR  ■  A2  •  A,  ) 

RST  -  RD  •  WR  *  CS  •  WR  •  A2  •  A,  •  D,  + 
"ST  •  (  CS  '  WR  •  A2  •  A,  ) 


3.  The  problem  of  designing  an  intelligent  peripheral 
controller  is  simplified  by  the  22V10's  flexible  I/O  structure. 
With  the  treedom  to  assign  different  architectures  to 
different  pins,  the  designer  can  make  better  use  of  the 
chip's  input  and  output  pins  and  product  terms. 


For  example,  one  of  the  most  complicated  func- 
tions to  perform  with  a  digital  computer  is  floating- 
point arithmetic,  or  more  specifically  the  nor- 
malizing of  numbers  in  the  process.  As  a  result,  no 
standard  MSI  or  LSI  logic  for  normalization  exists. 
Yet  the  programmable  array  is  well-suited  to  inte- 
grating at  least  part  of  this  complicated  logic  func- 
tion. 

Normalization  is  necessary  because  a  floating- 
point number  is  represented  in  two  parts:  a  man- 
tissa and  an  exponent.  The  mantissa  is  a  fractional 
number  that  is  adjusted,  or  normalized,  so  that  a 
binary  1  always  occupies  the  most  significant  bit.  To 
normalize  a  floating-point  number  without  chang- 
ing its  value,  a  compensating  adjustment  must  be 
made  to  the  exponent.  In  effect,  the  exponent,  a 
power  of  two,  is  incremented  for  each  bit  position 
that  the  mantissa  is  shifted  to  the  left. 

Because  the  process  of  normalization  can  be  very 
costly  if  done  wholly  in  parallel,  it  is  more  efficient 
to  process  a  number  one  byte  at  a  time.  Further- 
more, because  of  the  function's  complexity,  it  is  rea- 
sonable to  partition  the  normalization  unit  into  two 
units:  an  execution  unit,  which  acts  on  the  incoming 
data,  and  a  control  unit,  which  directs  the  execution 
unit.  Thus  the  two  units  together  form  a  byte-wide 
floating-point  normalization  (BFN)  unit,  with  the 
programmable  logic  array  making  up  the  execution 
unit  (Fig.  4). 

The  design  of  the  execution  unit  breaks  up  log- 
ically into  the  four  main  instruction  categories;  De- 
tect, Normalize,  Merge,  and  Shift.  Detect  tests  the 
incoming  data  for  a  binary  one  in  any  bit  position. 
When  such  a  nonzero  byte  appears,  a  flag,  F, 
informs  the  control  unit  and  tells  it  to  begin  the 
normalization  process.  Next,  the  Normalize  com- 
mand determines  the  number  of  bit  positions  to 
shift  the  incoming  byte,  in  order  to  put  its  first  bit 
in  the  most  significant  position.  Then  the  Shift  com- 
mand adjusts  the  input  data  to  the  left  by  the  spec- 
ified number  of  bit  positions,  filling  the  lower  bits 
with  Os.  Finally,  the  Merge  command  adds  in  the 
full  numbers'  remaining  bits,  adjusting  them  so 
that  the  entire  number  is  intact  and  shifted. 

From  the  beginning,  even  before  the  equations  of 
the  execution  unit  have  been  fully  developed,  it  ap- 
pears that  the  resources  of  the  programmable  array 
will  be  insufficient  for  the  job  (see  Table  2).  Four- 
teen input  pins  are  needed,  against  the  chip's  12, 
and  12  output  pins  are  needed,  whereas  the  chip  has 
only  10.  Nevertheless,  a  closer  look  at  the  execution 
unit  equations,  with  an  eye  toward  consolidating 
I/O  pins  and  functions,  turns  up  paths  around  the 
seeming  shortages. 

One  "problem"  that  proves  not  to  be  one  is  the 
number  of  outputs.  Although  they  number  13,  not 
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all  are  active  at  the  same  time.  For  example,  the 
outputs  Vu-V2,  which  specify  the  number  of  shift 
positions  as  a  three-bit  binary  number,  are  active 
only  when  no  other  outputs  are.  Thus'they  can  share 
their  output  circuitry  with  other  signals,  relieving 
the  output  shortage. 

However,  combining  the  shift  outputs  with  other 
terms  creates  a  product-term  deficiency.  Yet,  again, 
careful  observation  reveals  a  solution.  One  way  to 
reduce  the  product  terms  of  a  function  is  to  invert  it, 
using  the  programmable  array's  output  macrocell. 
The  most  likely  candidate  for  such  a  reduction  is  the 
function  with  the  most  product  terms.  Although 
there  are  several,  the  best,  because  it  is  relatively 
removed  functionally  from  the  others,  is  the  detec- 
tion flag,  F.  Its  function  is  given  by: 

F  =  nv  +  D.  +  D-.+  D.  +  D.+  D.+  D.  +  D,, 

where  D7  through  D„  are  input  data  bits,  which,  if 
any  are  high,  indicate  a  nonzero  byte. 


When  I)e  Morgan's  theorem  is  applied,  the  long 
OR  string  of  active-high  data  reduces  to  a  single 
AND  term,  saving  a  total  of  seven  product  terms. 
Still,  two  input  pins  must  be  found  for  the  pro- 
grammable array  to  fully  incorporate  the  logic  of 
the  execution  unit.  Since  De  Morgan's  theorem  can- 
not minimize  the  logic  any  further,  the  only  answer 
is  to  locate  any  product  terms  that  underuse  the 
chip's  resources,  combine  them,  and  lend  their  out- 
put pins  to  the  cause  by  making  them  I/O  pins. 

Examination  of  the  instruction  input  specifica- 
tion (see  Table  3)  reveals  that  during  Detect  only 
three  of  the  five  instruction  inputs  are  necessary, 
freeing  the  two  others  to  be  used  as  outputs.  There- 
fore instruction  bits  I,,  and  I,  can  share  the  outputs 
that  were  previously  dedicated  to  shift  bits  V„  and 
V,,  thus  meeting  the  execution  unit's  final  require- 
ments in  a  single  chip.a 


Table  2.  Allocation  of  chip  resources 


Initial 
require- 
ments 


Fnal 
require- 
ments 


Input  pins 


CLK, 
t>7-D0. 

U-lo 


CLK, 

D7-O0, 

I4-I2. 

b'v,, 
Iq/Vq 


Output  pins 


F  {detection  flag). 

V2-V0, 

R7-R0 


l0/V0. 
R7/V2. 
Bo/F, 
R«-R, 


Product  terms 


108  required 


Table  3.  Instruction  definitions 

Instruction 

Detection 

code  (U-lo) 

flsg 

Detect 

ooxxx 

0 

Normalize 

ooxxx 

1 

Merge 

01N2N2N0 

X 

Shift 

10N2N,No 

X 

Data  input  (100  ns).  Exponent  input 

D7-D0  I 


vaiuws.  _ 

V  _V  Exponent 

— I  ,  »  adjustment 

functional 
unit 


T  

Result  output,  Exponent  output 

R,-Ro 


4.  Floating-point  arithmetic  is  a  hardware-intensive  process  that,  for  the  moat  part,  has 
defied  integration.  Yet  by  partitioning  the  tasks  into  smaller  pieces,  a  designer  can 
apply  the  power  of  the  programmable  array  to  the  job.  The  chip's  flexibility 
contributes  to  its  successful  implementation  as  a  floating-point  execution  unit,  even 
though  its  resources  may  seem  insufficient. 
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XOR  PLDs  Simplify  Design  of 
Counters  and  Other  Devices 


XOR  PLDs'  unique  architecture  is  better 
suited  than  that  of  conventional  AND- OR 
PLDs  to  certain  applications.  Once  you  un- 
derstand the  basic  theory  behind  their  oper- 
ation, you  can  easily  configure  them  to  per- 
form the  tasks  of  such  devices  as  binary 
counters  and  video  shift  registers. 

Chris  Jay,  Monolithic  Memories  Inc 

Many  designs — notably  high-speed  counters  for  ac- 
cessing memories,  for  counting  events,  or  for  sequential 
control  in  high-speed  processor  systems — require  state 


machines  that  step  through  their  sequence  of  operation 
by  a  binary  count.  XOR  PLDs  are  ideal  devices  for 
these  types  of  jobs,  but  they  are  rarely  considered  for 
such  applications — perhaps  because  design  engineers 
don't  understand  their  useful,  but  unusual,  architec- 
ture. 

Like  the  more  sophisticated  of  today's  conventional 
devices  (Fig  la),  exclusive-OR  (XOR)  versions  have  a 
registered  architecture  (Fig  lb).  The  distinguishing 
features  of  the  latter  are  two  pairs  of  product  terms 
(multiple-input  AND  gates),  which  drive  two  summing 
terms  (OR  gates),  which  in  turn  feed  two  inputs  of  an 
XOR  gate.  The  combinational  XOR  gate's  output  con- 
nects directly  to  the  D  input  of  the  register.  (For  a 
theoretical  discussion  of  the  advantages  of  XOR  PLDs 
over  AND-OR  PLDs,  see  box,  "Principles  of  XOR  PLD 
operation.") 


INOUTS,  FEEDBACK  AND  I/O 


(a) 


(b) 


INOUTS,  FEEDBACK  AND  I/O 


Fig  I— You  can  easily  distinguish  between  a  conventional  ANDOR  PLD  (a)  and  an  XOR  PLD  (b).  The  latter  has  an  XOR  gate  preceding  the 
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XOR  PLDs  Simplify  Design  of  Counters  and  Other  Devices 


The  distinguishing  features  of  an  XOR 
PLD  are  two  pairs  of  product  terms,  which 
drive  two  summing  terms,  which  in  turn 
feed  two  inputs  of  an  XOR  gate. 


The  applications  described  here  make  use  of  MMI's 
20X8A  and  20X10A  PAL  devices.  These  24-pin  XOR 
PLDs  have  an  improved  22.2-MHz  clock  and  worst-case 
propagation  delay  of  30  nsec  compared  with  older, 
12.5-MHz,  50-nsec  parts. 

XOR  PLDs  bring  benefits  to  counters 

The  prime  use  of  XOR  PLDs  is  in  counters.  Fig  2a 
shows  the  pinouts  and  Fig  2b  the  Palasm  logic  equa- 
tions of  a  10-bit,  loadable,  up/down  counter.  You  can 
design  such  a  counter  into  a  single  20X10A  PAL  device. 
The  counter  has  ten  registers,  Q0  through  Q9,  capable 
of  binary,  up/down  counting.  The  up  control  is  high  for 
an  up  count  and  low  for  a  down  count.  Registers  Ql 
through  Q8  load  synchronously  through  data  inputs  Dl 
through  D8;  the  loading  i&  enabled  by  an  active-low  /LD 
input  (see  box,  "Palasm  notational  conventions").  Be- 
cause of  a  pin  limitation  of  the  package,  Q9  loads  via  the 


up/down-control  pin.  This  pin  (the  D9  input  during 
loading)  controls  the  counter's  up-  or  down-counting 
mode.  Q0  is  always  loaded  with  a  logic  low. 

A  counter  larger  than  10  bits  requires  two  or  more 
XOR  PLDs.  Unlike  MSI  counters,  which  come  with 
carry-in  and  carry-out  pins,  you  must  incorporate  in 
your  design  provisions  for  cascading  XOR  PLDs.  When 
cascading  two  XOR  PLDs,  you  must  provide  a  carry- 
out-enable  signal  from  the  least  significant  counter  to 
the  count-enable  input  of  the  most  significant  counter. 

When  the  least  significant  counter  has  reached  its 
maximum  count,  it  should  enable  the  most  significant 
counter  to  increment  its  count  after  the  next  rising 
edge  of  the  clock  pulse.  However,  time  delays  arising 
from  the  time  needed  to  decode  and  propagate  the 
Carry  signal  could  slow  the  system  down.  To  avoid  this 
delay,  your  design  should  generate  a  look-ahead-carry 
output. 


Fig  2b 
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Fig  2a — This  diagram  shows  the  pin  designations  for  a  20Xl0A  XOR  PLD  configured  as  a  10-bit,  loadable,  up/down  counter.  This  design  is 
not  cascadable. 
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Principles  of  XOR  PLD 

The  configuration  of  an  exclu- 
sive-OR  (XOR)  PLD  simplifies 
counters  and  other  state  ma- 
chines that  progress  through  a 
binary-encoded  counting  se- 
quence. You  do  not,  of  course, 
necessarily  need  a  PLD  with 
XOR  gates  to  design  a  binary 
counter;  you  can  use  the  simpler, 
sum-of-products  (SOP)  architec- 
ture of  a  conventional  PLD  (see 
Fig  la  in  the  main  text)  to 
implement  the  logic  equations 
that  define  a  binary-count 
sequence. 

However,  the  conventional 
PLD  requires  a  larger  number 
and  AND  gates,  or  product 
terms,  for  a  counter's  higher- 
order  registers  than  does  an  XOR 
PLD.  The  Boolean  equations  that 
follow  prove  this  point.  They  are 
required  for  a  generalized 
binary-count  sequence  and  are 
rendered  in  Palasm  notation  (see 
box,  "Palasm,  notation  conven- 
tions"). 

If  QO  is  the  least  significant 
register,  then 

/QO  :=  QO 

This  is  a  straightforward  binary 
function — divide  by  2.  If  Ql  is 
the  nest  significant  register,  it 
requires,  the  Boolean  sum  of  two 
product  terms: 

/Ql  :=  Q0*Q1  +/Q0*/Q1 

For  register  Q2  the  equation 
becomes 

/Q2  ;=  Q2*Q1*Q0  +  /Q2*/Q1  + 
/Q2*/Q0 

which  requires  three  product 


operation 

terms.  From  a  generalized  counter 
willr  N  registers,  the  Nth  regis- 
ter, QN,  requires  N+l  product 
terms: 

/QN  :=  QN*Q(N-1)*  ...  Q1*Q0 
+  /QN*/Q(N-1) 

+  /QNVQN1 
+  /QN*/Q0 

For  example,  the  most  signifi- 
cant register  is  a  10-bit  counter, 
Q9,  would  require  the  summing 
of  10  product  terms,  using  10 
AND  gates,  in  a  convential 
AND-OR  PLD.  Consequently, 
although  you  can  write  abstract 
counter  equations  for  AND-OR 
PLDs,  you  may  not  be  able  to 
find  a  real  PLD  with  enough 
product  terms  to  embody  those 
equations. 

An  XOR  PLD,  on  the  other 
hand,  uses  its  product  terms 
more  economically;  no  matter 
how  significant  the  weighting  of 
the  register  in  a  binary  counter, 
each  register  requires  only  two 
product  terms. 

Before  delving  into  the  de- 
tailed Boolean  equations  that 
support  the  preceding  contention, 
you  should  get  a  feel  for  the 
logic  underlying  the  XOR  PLD. 
Consider  that,  in  a  binary-count 
sequence,  the  polarity  of  a 
register's  output  toggles  one 
clock  period  after  the  state  in 
which  all  of  the  less  significant 
registers  were  a  logic  one. 

Next,  imagine  that  you  have  a 
2-input  XOR  gate  driving  a  reg- 
ister. The  output  of  the  register 
is  fed  back  to  one  XOR  input; 
the  other  input  sees  a  product 


term  comprising  all  the  less-sig- 
nificant digits.  The  XOR  gate 
will  hold  the  register  in  its 
existing  state  until  the  product 
terms  of  the  less-significant 
digits  go  True,  whereupon  the 
XOR  gate  will  toggle  the  output 
of  the  register. 

The  following  is  a  generalized 
equation  for  a  register  (QN)  in  a 
binary-count  sequence  using  the 
XOR  PLD: 

QN  :=  QN  :+:  Q(N-l)*Q(N-2)* 
...  Q2*Q1*Q0 

You  can  consider  the  term 
Q(N-l)*Q(N-2)*... 
Q2*Q1*Q0  as  a  Carry  input  to 
register  QN. 

When  Q(N-l)*Q(N-2)*... 
Q2*Q1*Q0  =  0,  the  register  QN 
is  in  a  hold  condition,  and  QN's 
output  feeds  back  through  the 
fuse  array  and  the  XOR  gate  and 
loads  into  itself.  Only  when 

Q(N-l)*Q(N-2)*... 
Q2*Q1*Q0  =  1 

will  the  XOR  function  invert  the 
polarity  of  QN  and  strobe  the 
inverted  state  into  the  register  in 
synchronicity  with  the  rising 
edge  of  the  next  clock  pulse. 
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Unlike  MSI  counters,  which  come  with 
carry-in  and  carry-out  pins,  you  must  in- 
corporate in  your  design  provisions  for  cas- 
cading XOR  PLDs. 


The  logic  in  the  least  significant  counter  can  decode 
the  penultimate  count  and  feed  the  D  input  of  a 
carry-output-enable  register.  When  the  counter  incre- 
ments to  the  final  count,  the  carry -output-enable  signal 


is  then  clocked  into  the  carry-output-enable  register. 
The  propagation  of  the  carry-output-enable  signal  to 
the  next  counter  stage  is  therefore  coincident  with  the 
generation  of  the  output  of  the  ultimate  count  from  the 
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Fig  3a— Unlike  the 

the  counter  cascadable. 


ov 

illustrated  in  Fig  2a,  this  counter  uses  a  look-ahead-carry -out  technique.  Provision  of  this  signal  renders 
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registers  in  the  least  significant  counter.  By  anticipat- 
ing the  ultimate  count  in  this  fashion,  this  scheme 
avoids  the  propagation  delay  involved  in  decoding  the 
final  count. 

The  counter  designs  specified  by  the  sets  of  logic 
equations  presented  in  Figs  3b  and  3c  use  this 
look-ahead-carry-out  technique.  The  designs  are  cas- 
cadable,  9-bit  up/down  counters;  Fig  3c's  specification 
generates  both  Carry  and  Borrow  signals  on  a  look- 
ahead  basis.  You  can  use  these  specifications  to  create 
an  18-bit  counter  by  routing  the  Carry/Borrow  output 
(CEBEO)  of  the  first  up/down  counter  to  the  count- 
enable  input  (CEI)  of  the  second  up/down  counter  (Fig 
3a).  These  counters  are  capable  of  clocking  at  rates  as 
high  as  22.2  MHz. 

To  cascade  more  than  two  counters  for  high-speed 
counting,  your  design  would  require  a  composite  carry- 
enable  output  to  the  more  significant  counters  in  the 
chain.  You  would  need  to  generate  a  look-ahead-carry 
output  from  only  the  least  significant  counter;  higher- 
order  counters  in  the  chain  would  require  a  count- 
enable  output  generated  from  the  ultimate,  or  final, 
count  rather  than  the  penultimate  count.  Fig  4a  shows 
a  technique  for  cascading  more  than  two  counters  in  a 
synchronous-count  chain.  Depending  upon  the  availa- 


Palasm  notation  conventions 

For  consistency  with  the  design  spercifications, 
the  text  of  this  article  uses  the  Palasm  notational 
conventions  for  Boolean  operators  instead  of  the 
classical  symbols.  Palasm  is  MMI's  PAL  assem- 
bler, which  converts  Boolean-logic  equations  into 
fuse  maps  for  PLDs.  (Ed  Note:  Readers  familiar 
with  other  PLD  compilers  may  be  more  used  to 
the  C-language  conventions  used  in  Cupl  and 
Abel.  One  compiler,  loglCfrom  Kontron,  allows 
you,  thankfully,  to  choose  your  own  favorite  sym- 
bol set.) 

•  /  represents  a  logical  inversion. 

•  *  represents  a  logical  AND. 

•  +  represents  a  logical  OR. 

•  =  represents  a  logical  equality. 

•  :+:  represents  an  exclusive-  OR  (XOR. 

•  :=  represents  the  state-machine  operation 
"Becomes  equal  after  the  rising  edge  of  a 
clock  pulse." 


bility  of  input  pins,  you  could  incorporate  the  required 
gates  into  the  PLD  itself,  as  Fig  4b  shows. 

XOR  PLDs  have  many  applications  outside  of  binary 
counting.  The  example  of  a  video  shift  register  shows 
how  video  applications  can  benefit  from  the 
programmable-polarity  feature  of  the  XOR  gate. 

Fig  5a  shows  a  typical  video  system  using  the 
PAL20X8A.  Fig  5b  gives  the  PLD  design  specification. 
An  XOR  PLD,  when  functioning  as  a  video-shift  regis- 
ter, loads  row  data,  in  parallel,  from  a  character  ROM 
or  RAM.  A  control  input  to  the  XOR  PLD  (reverse 
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Fig  4 — You  can  cascade  two  or  more  XOR-PLD  counters  in  a  synchronous-count  chain  with  either  external  (a)  or  internal  (b)  logic. 
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The  XOR  PLD  series  has  many  applica- 
tions outside  of  binary  counting. 
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Fig  5a — XOR  PLDs  suit  other  applications  besides  counters.  The 

device's  programmable  output  polarity  is  a  vital  part  of  this  design, 
which  uses  an  XOR  PLD  as  a  video  shift  register  that  incorporates  a 
cursor  control. 


Fig  5b 


■■  /DO*LD*/REV*/CURS*/RST 
+  DO*LD*REV*/CURS*/RST 
:+:  /Q1*/LD*/CURS*/RST 
+  RST 


■  /D1*LD*/REV*/CURS*/RST 

+  D1*LD*REV*/CURS*/RST 
:+:  /Q2*/LD*/CURS*/RST 
*  RST 

i  /D2*LD*/REV*/CURS*/RST 
+  D2*LD*REV*/CUHS*/RST 
/Q3*/LD*/CURS*/RST 
+  RST 

■  /D3*LD*/REV*/CURS*/RST 

+  D3*LD*REV*/CURS*/RST 
:+:  /Q4*/LD*/CURS*/RST 


'  /D4*LD*/REV*/CURS*/RST 
+  D4*LD*REV*/CURS*/RST 
/Q5*/LD*/CURS*/RST 
+  RST 

■  /D5*LD*/ REV*/ CURS*/ RST 
+  D5«LD*REV*/CURS*/RST 
/Q6*/LD*/CURS*/RST 
+  RST 

/D6*LD*/REV*/CURS*/RST 
+  D6*LD*REV*/CURS*/RST 
:+:  /Q7*/LD*/CURS*/RST 
+  RST 

/D7*LD*/REV*/CUHS*/RST 
+  D7*LD*REV*/CURS*/RST 
:+:  /LD*/CURS*/RST 
+  RST 


; LOAD  DO 

; LOAD  INVERSE  OF  DO 
;SHIFT  FROM  Ql 
; RESET  QO  LOW 

; 

; CURSOR  LOW  CLEARS  /QO 
; LOAD  Dl 

; LOAD  INVERSE  OF  Dl 

,-SHIFT  FROM  Q2 

; CURSOR  LOW  CLEARS  /Ql 

; RESET  CLEARS  Ql 

r LOAD  D2 

; LOAD  INVERSE  OF  D2 

;SHIFT  FROM  Q3 

; CURSOR  LOW  CLEARS  /Q2 

; RESET  CLEARS  Q2 

; LOAD  D3 

; LOAD  INVERSE  OF  D3 
; SHIFT  FROM  Q4 
; RESET  CLEARS  Q3 
.-CURSOR  LOW  CLEARS  /Q3 
; LOAD  D4 

; LOAD  INVERSE  OF  Di 

; SHI FT  FROM  Q5 

; CURSOR  LOW  CLEARS  /Q4 

; RESET  CLEARS  Q4 

; LOAD  D5 

; LOAD  INVERSE  OF  D5 

; SHIFT  FROM  Q6 

; CURSOR  LOW  CLEARS  /Q5 

; RESET  CLEARS  Q5 

; LOAD  D6 

; LOAD  INVERSE  OF  D6 

;SHIFT  FROM  Q7 

; CURSOR  LOW  CLEARS  /Q6 

; RES ET  CLEARS  Q6 

; LOAD  D7 

; LOAD  INVERSE  OF  D7 
,-SHIFT  HIGH  INTO  /Q7 
; RESET  CLEARS  Q7 


video,  or  REV)  determines  whether  or  not  the  data 
loaded  into  the  register  is  to  be  inverted.  Therefore, 
the  XOR  PLD  can  supply  normal-video  or  reverse- video 
information. 

This  design  also  incorporates  a  cursor  control,  which, 
when  active,  sets  all  of  the  register's  outputs  high.  The 
design  also  has  a  reset  line  to  clear  all  the  registers. 
When  you  release  the  load  input,  the  clock  input  will 
serially  clock  the  data  out  of  register  QO  as  a  video-data 
stream.  The  clock  input  to  the  PLD  determines  the  dot 
rate  of  the  video  information  and  must  come  from  an 
external  dot-rate  generator. 

The  Palasm  logic-equation  files  presented  in  this 
article  omit  comments,  pin  definitions,  and  device- 
simulator  specifications  because  of  space  limitations.  If 
you  want  the  entire  Palasm  file,  you  may  call  the  author 
at  (408)  970-9700. 
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The  PAL20RA1 0  Story— 


Th 


Standard  Product 


Customization  of  a 


Among  the  greatest  challenges  one  faces  when  design- 
ing with  the  new  generation  of  32-bit,  high- 
performance  microprocessors  is  maintaining  both 
byte  and  word  addressability  within  the  microprocessor's 
32-bit-wide  word  format  and  accommodating  both  highl- 
and moderate-speed  memory  components  and  peripherals  in 
the  same  system.  Consider  this  example:  Motorola's  32-bit 
m  croprocessor,  the  MC68020,  has  a  feature  known  as  dy- 
namic bus  sizing.  This  feature  helps  maintain  both  byte  and 
word  addressability  within  the  microprocessor's  32-bit  word 
format.  The  microprocessor  has  internal  logic  that  manipu- 
lates and  directs  words  smaller 
than  32  bits.  However,  the 
MC68020  requires  additional  ex^ 
ternal  logic  to  handle  8-  and 
16-bit  memory  arrays  and  pe- 
ripherals and  to  maintain  byte 
addressability  of  32-bit  memory 
arrays.  Consider  this  as 

well:  The  MC68020  is  available  in  both  12-MHz  and 
16.7-MHz  speed  grades.  The  16.7-MHz  speed  translates  into 
60-ns  machine  cycles  and,  consequently,  into  90-ns  memory 
cycles.  Thus,  the  MC68020  can  keep  up  with  many  of 
today's  fastest  memories.  Moreover,  in  systems  that  use 
both  fast  memories  and  slower,  low-cost  bulk  ones,  the 
16.7-MHz  version  can  interface  with  the  slower  memories 
without  holding  back  the  faster  ones.  But  to  do  this,  the 
MC68020  must  insert  wait  states  to  extend  the  memory  cycle 
for  the  slower  memories. 

Both  design  problems  call  for  additional  circuitry — that 
is,  some  "glue"  logic.  And  because  the  MC68020  employs 
asynchronous  control  signals  and  the  memories  need  syn- 
chronous control,  the  additional  circuitry  must  be  capable 


A  new,  registered,  asynchronous 
PAL  device  originated  with  suggestions 
from  the  manufacturer's 
customers. 


of  both  asynchronous  and  synchronous  operation.  A  prod- 
uct from  Monolithic  Memories,  Inc.,  the  PAL20RA10,  pro- 
vides what  we  believe  is  an  elegant,  single-chip  solution  to 
these  problems. 

The  PAL20RA10  is  one  of  the  most  recently  developed 
members  of  Monolithic's  programmable  array  logic 
product  family.  As  with  all  other  PAL  devices,  it  contains 
an  array  of  programmable  links  that  the  user  can  configure 
to  implement  a  variety  of  logic  functions.  The  PAL20RA10 
is  unusual  in  that  the  customer  not  only  can  configure  the 
internal  logic  transfer  function  but  also  can  implement  logic 
functions  to  control  the  clock, 
the  output  enables,  and  the 
set  and  reset  operations  for  each 
register.  In  addition,  the 
user  can  configure  the  outputs 
to  be  registered  or  nonregistered, 
and  active  high  or  active 
low.  We  will  discuss 
these  features,  as  well  as  a  universal  port  interface  adapter 
application,  in  greater  detail  later  in  this  article. 

PLDs  and  the  semicustom  market 

Programmable  logic  devices,  or  PLDs,  fit  between  stan- 
dard logic  and  customer -designed  circuits  and  thus  are  con- 
sidered semicustom  circuits,  along  with  gate  arrays  and 
standard  cells.  PAL  products  are  a  type  of  PLD.  Discus- 
sions of  semicustom  integrated  circuits  often  center  on 
technological  issues:  process  capabilities,  device  speed  and 
power  dissipation,  number  of  available  gates,  and  so  on. 
But  the  customization  of  a  device  involves  more  than  just 
meeting  the  customer's  functional  requirements.  The  device 


Copyright  ©  1 986  The  Institute  of  Electrical  and  Electronics  Engineers.  Inc. 
Reprinted  with  permission  from  IEEE  MICRO, 
10662  Los  Vaqueros  Circle,  Los  Alamitos,  CA  90720. 


2-683 


The  PAL20RA10  Story— The  Customization  of  a  Standard  Product 


Table  1. 

Logic  devices  rated  by  various  selection 
criteria. 


Standard  Gate   Standard  Full 

Criterion  logic     PLDs  arrays      cells  custom 


Sourcing  and 
availability 

Performance 

Space 
efficiency 

Ease  of 
design* 

Ease  of  design 
changes 

Design 
cost** 

Time  to 
market 

System-level 
mfg.  cost 


*  With  the  exception  of  standard  logic,  this  means  ease  of  use  of  the  design 
tools,  i.e.,  ease  of  understanding  the  design  tool  concepts,  ease  of  doing  the  re- 
quired input,  etc. 

**  Design  cost  is  the  sum  of  the  NRE  (non-recurring  engineering)  costs  and  such 
non-recurring  manufacturing  costs  as  masking,  test  vector  generation,  etc., 
that  are  part  of  the  product  development  cycle. 


should  also  meet  the  customer's  requirements  for  price, 
availability,  risk,  ease  of  use,  and  support.  Only  when  all  of 
these  requirements  have  been  met  has  a  device  been  truly 
customized. 

Within  this  definition,  even  standard  7400  series  logic 
might  be  considered  customized.  It  meets  most  customers' 
requirements  for  low  cost,  availability,  low  risk,  and  high 
ease  of  use,  and  it  provides  just  about  any  desired  function. 

PLDs  and  gate  arrays — semicustom 
alternatives  and  trade-offs 

The  PLD  was  first  conceived  as  a  replacement  for  dis- 
crete TTL  logic  designs.  Customers  wanted  higher  integra- 
tion than  that  possible  with  standard  SSI/MSI  logic,  but 
they  could  not  find  the  functions  they  wanted  in  standard 
LSI  circuits  and  did  not  want  to  pay  for  fully  customized 
ones.  The  PLD  provided  a  new  option— semicustom 
LSI— that  had  the  advantages  of  SSI/MSI  availability  and 
many  of  the  advantages  of  fully  customized  LSI  without 
such  drawbacks  as  high  cost  and  long  development  time. 


Today,  gate  arrays  are  rapidly  becoming  competitive  in 
the  same  applications  as  PLDs.  However,  PLDs  provide 
several  advantages  over  gate  arrays  that  make  them  much 
more  amenable  to  customer  needs  for  many  applications. 

The  primary  advantages  of  PLDs  are  their  immediate 
availability,  the  lower  risk  they  entail,  and  their  lower  over- 
all cost  per  application.  A  design  engineer,  upon  completing 
his  design,  can  immediately  customize  a  PLD  to  his  specifi- 
cations. If  he  discovers  that  he  has  made  a  mistake  in  his 
design,  he  can  correct  it  and  program  another  device.  Since 
he  customizes  a  part  just  before  it  is  used,  he  undertakes  no 
risk  in  committing  to  a  particular  design.  If  he  needs  to 
make  a  change,  he  can  program  a  part  from  inventory  to 
the  new  specification — no  parts  are  wasted.  He  can  program 
the  new  part  in  minutes. 

According  to  a  survey  that  appeared  in  the  January  12, 
1984,  issue  of  EDN,  the  two  greatest  disadvantages  of  gate 
arrays  are  their  high  development  cost  and  the  lack  of  sec- 
ond sources  for  them.  PLDs,  in  contrast,  have  an  extremely 
low  development  cost  and  offer  multiple  sources.  The  short, 
simple  development  cycle  for  PLDs  translates  into  easier 
use,  lower  costs,  shorter  time  to  market,  and  greater  flex- 
ibility. Table  1  rates  standard,  semicustom,  and  full-custom 
integrated  circuits  according  to  various  criteria  used  in 
choosing  an  IC  for  a  design.  A  "1"  indicates  the  best 
choice;  a  "4"  indicates  the  worst. 

Dataquest,  a  market  research  firm,  lists  in  the  December 
28,  1984,  issue  of  its  Nielsen  Dataquest  Research  Newsletter 
("Gate  Array  Impact  on  the  ASIC  Marketplace,"  by  Katy 
Guill  and  Andy  Prophet)  five  needs  of  the  gate  array  user, 
all  of  which  are  met  efficiently  by  PLDs.  The  first  is  for 
quick  delivery  of  prototype  units.  With  PLDs,  prototypes 
are  available  in  minutes  once  the  design  is  complete.  The 
second  need  is  for  easy-to-use  design  automation  tools  that 
require  a  minimum  amount  of  time  to  learn.  PLD  software 
tools  include  assemblers  and  higher-level  languages  that  en- 
able a  design  engineer  to  automatically  convert  whatever  de- 
scription he  has  created  into  a  fuse  plot.  The  PLD  can  then 
be  programmed  as  easily  as  a  PROM. 

The  third  need  is  for  prototype  units  that  work  the  first 
time.  PLDs,  being  standard  parts,  are  fully  tested  during 
production  and  can  be  quickly  verified  after  customization. 
If  a  design  error  has  been  made,  the  design  can  be  quickly 
modified  and  another  prototype  can  be  programmed  in 
minutes.  Using  personal-computer-based  software  and  a 
standard  programmer,  the  design  engineer  can  stay  at  his 
own  desk  and  proceed  to  alter  his  design  on  the  PC,  and 
then  instantly  download  the  new  design  to  the  programmer 
and  the  new  device.  The  device  can  then  be  dropped  into 
the  target  application  and  the  design  verified.  The  ease  with 
which  PLDs  can  be  verified  encourages  experimentation 
with  designs. 

A  fourth  need  is  for  low  development  and  production 
costs.  PLDs  have  extremely  low  development  costs — only 
the  up-front  costs  of  the  software  and  programmer.  Their 
design  cost  consists  simply  of  the  design  engineer's  time. 
The  production  costs  for  PLDs  are  driven  down  by  the  mul- 
tiple sourcing  and  high-volume  production  characteristic  of 
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standard  products.  Gate  arrays  provide  neither  of  these 
advantages. 

The  fifth  need  is  for  convenient  design  centers  and  train- 
ing. The  PLD  design  center  is  very  convenient  indeed — the 
engineer's  personal  computer.  Our  company,  Monolithic 
Memories,  offers  tutorial  aids  in  several  formats.  Moreover, 
the  standard  nature  of  the  PLD  makes  it  easier  to  under- 
stand and  use  than  the  more  customized  and  thus  less 
"friendly"  gate  array. 

There  are  many  other  factors  that  influence  the  choice  of 
semicustom  device  technology,  of  course,  such  as  architec- 
tural efficiency.  The  ease  of  use  of  the  PLD  unfortunately 
leaves  it  much  less  efficient  architecturally  than  other  forms 
of  semicustom  logic.  In  a  gate  array,  the  user  constructs  log- 
ic functions  using  two-input  NAND  gates.  In  a  PLD,  the 
user  constructs  logic  functions  using  higher-level  macrocells. 
Naturally,  there  will  be  more  wasted  silicon  in  the  PLD,  al- 
though its  optimized  macrocell  and  efficient  fuse  array  will 
still  provide  an  effective  structure.  It  is  the  gate  array's  very 
architectural  efficiency  that  is  the  source  of  its  high  develop- 
ment cost,  since  that  efficiency  derives  from  computerized 
trial-and-error  routing  and  a  custom  mask,  both  of  which 
are  expensive. 

Figure  1  gives  the  1985  sales  volume  for  various  types  of 
ICs  and  estimates  the  1990  volume  for  them.  All  types  show 
healthy  growth.  Because  standard  cells  are  starting  at  a  low 
level  and  because  they  are  expected  to  become  very  popular 
as  the  IC  market  matures,  they  are  expected  to  show  the 
highest  growth  rate.  Linear  arrays,  though  relatively  new 
and  technologically  difficult,  should  also  show  impressive 
growth.  PLDs  and  gate  arrays  should  also  show  very 
healthy  growth,  with  the  growth  in  PLDs  (367  percent)  ex- 
ceeding that  in  gate  arrays  (234  percent).  Part  of  the  greater 
growth  of  PLDs  is  attributable  to  their  ease  of  use,  but  part 
is  also  attributable  to  the  ease  with  which  they  can  be  repro- 
grammed— for  circuit  designs  that  are  likely  to  change,  pro- 
grammable logic  is  the  only  practical  solution. 

It  is  very  difficult  to  directly  compare  PLD  cost  and  gate 
array  cost — one  is  comparing  apples  and  oranges.  PLDs 
have  a  minimal  development  cost  and  a  standard  unit  cost. 
Gate  arrays,  on  the  other  hand,  have  a  large  development 
cost  and  a  small  unit  cost.  Moreover,  it  is  very  difficult  to 
determine  the  PLD  architecture  that  is  equivalent  to  a  par- 
ticular gate  array  and  vice  versa.  Currently  available  PLDs 
such  as  Monolithic's  MegaPAL  devices  have  as  many  as 
5000  four-input  NAND-gate  equivalents.  However,  a 
5000-gate  gate  array  probably  cannot  replace  a  5000-gate- 
equivalent  PLD  exactly,  and  such  a  PLD  cannot  replace  a 
5000-gate  gate  array  exactly.  Such  replacement  is  very 
application-dependent.  A  MegaPAL  array,  for  example,  is 
limited  to  64  inputs  and  32  outputs.  If  a  gate  array  has  more 
inputs  and  outputs,  the  MegaPAL  device  cannot  replace  it. 
And  if  the  gate  array  cannot  duplicate  a  feature  of  the 
MegaPAL  array — such  as  its  large  fan-in  on  its  AND  gates, 
for  example — then  the  gate  array  cannot  replace  MegaPAL 
array.  Even  at  lower  gate  densities,  comparisons  remain 
application-dependent . 


1985 


Figure  1.  Growth  of  the  worldwide  application-specific 
IC  market  from  1985  to  1990.  Dollars  are  in  millions. 

(Source:  Integrated  Circuits  Engineering  Corp.) 

For  purposes  of  providing  a  general  idea  of  the  cost  effi- 
ciencies over  volume  of  gate  arrays  and  PLDs,  we  can  com- 
pare them  as  if  they  did  have  one-to-one  equivalency,  how- 
ever. The  nonrecurring  engineering,  or  NRE,  charge  for  a 
gate  array  can  vary  from  $15,000  to  $75,000,  depending  on 
complexity  and  design  methodology.  If  a  gate  array  has  a 
$35,000  NRE  charge  and  a  $1.00  unit  cost,  its  cost  per  unit 
drops  almost  inversely  with  volume,  as  shown  in  Figure  2. 
Now  let  us  examine  a  specific  type  of  PLD,  the  PAL  device. 
If  a  PAL  device  costs  $5.00  for  a  single  unit  and  drops  in 
price  with  volume  according  to  standard  pricing  curves,  it 
shows  a  more  steady  decline  in  price  with  volume  (the  PAL 
plot  in  Figure  2).  The  crossover  point  in  this  comparison  is 
almost  at  100,000  units.  The  graph  shows  quantatively  the 
reasons  why  gate  arrays  become  cost-effective  over  larger 
volumes,  but  it  leaves  out  several  other  factors,  including 
gale  equivalencies,  risk,  and  time  to  production. 

Let  us  compare  the  design  and  production  costs  for  PLDs 
and  gate  arrays.  For  a  PLD,  design  cost  means  the  design 
engineer's  time,  as  we  mentioned  above.  This  is  already  a 
committed  cost.  However,  for  a  gate  array,  design  cost  can 
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Figure  2.  PLD  (PAL  and  HAL  device)  cost  per  unit 
vs.  gate  array  cost  per  unit.  The  break-even  point 
for  PAL  devices  and  gate  arrays  occurs  at  about 
100,000  units. 


100K 
units 


be  a  sum  of  several  factors.  The  time  a  designer  spends  on  a 
computer  to  create  a  design  can  be  very  expensive,  and  the 
software  he  needs  to  create  and  test  the  design  can  also  be 
expensive.  Production  costs  for  both  types  of  device  include 
not  just  the  cost  of  the  silicon  but  also  the  cost  of  masks, 
fabrication,  and  labor.  And  if  a  device  does  not  work  as  ex- 
pected, the  entire  process  must  be  repeated. 

With  a  PLD,  the  only  purchase  cost  is  that  for  the  silicon 
itself.  The  PLD  is  mass-produced,  so  all  the  supporting 
design  efforts— fabrication,  test  generation,  and  reliability 
enhancements — are  spread  over  time  and  many  devices.  The 
result  is  much  lower  cost  to  the  user. 

For  high-volume,  fixed  applications,  a  PAL  device  can  be 
replaced  with  a  mask -programmed  hardwired  array  logic,  or 
HAL,  device.  The  HAL  device  replaces  the  fuses  of  the 
PAL  circuit  with  a  final  mask  step  that  permanently 
metallizes  the  logic  connections.  This  alternative  is  an  ex- 
cellent match  for  the  volume  applications  that  are  attractive 
to  gate  arrays.  The  HAL  circuit  combines  the  lower  unit 
cost  of  the  mask-programmed  logic  device  with  the  pro- 
totyping capabilities  of  the  PAL  circuit.  It  offers  advantages 
unmatched  by  any  other  form  of  semicustom  logic.  Assum- 


Figure  3.  The  basic  PAL  consists  of  two 
arrays— a  fuse-programmable  AND  array 
and  a  fixed  OR  array.  This  arrangement 
follows  the  sum-of-products  format 
familiar  to  the  design  engineer  who  uses 
Boolean  logic. 
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Figure  4.  Two-input  AND  array  with  four  AND 
gates.  An  input  (in  or  /in)  is  disconnected  from  an 
AND  gate  when  the  fuse  associated  with  that  input 
blown. 
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Figure  5.  An  "x"  indicates  an  intact  fuse.  The  two- 
input  AND  array  with  four  AND  gates  (product 
lines)  from  Figure  4  is  shown  along  with  a  two- 
output  OR  array.  Note  that  the  AND  array  is  user- 
programmable,  whereas  the  OR  array  is  hardwired. 
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ing  an  NRE  charge  for  a  HAL  device  of  $5000  and  a  unit 
cost  of  $2.00,  we  can  plot  the  relationship  of  the  HAL 
device  to  the  PAL  and  gate  array  (see  Figure  2  again). 

PAL  devices — historical  background 

PAL  devices  were  introduced  by  Monolithic  Memories  in 
1978.  They  consist  of  two  arrays  of  logic  gates.  The  first 
array  consists  of  AND  gates  that  form  Boolean  products  of 
the  inputs,  and  the  second  array  consists  of  OR  gates  that 
sum  these  products  (see  Figure  3).  Hypothetical^,  there  are 
n  inputs  to  the  AND  array  and  m  outputs  from  the  OR 
array.  The  AND  array  consists  of  p  AND  gates  (and  p  prod- 
uct terms),  and  all  n  device  inputs,  both  inverted  and  non- 
inverted,  are  potential  inputs  to  any  of  the  p  AND  gates.  In 
an  unprogrammed  PAL  device,  every  device  input  is  actu- 
ally connected  to  every  AND  gate.  A  device  input  ceases  to 
be  an  input  to  a  particular  AND  gate  if  the  fuse  that  con- 
nects that  input  to  the  gate  is  blown.  Fuses  are  blown  ac- 
cording to  a  fuse  map  that  is  generated  by  the  user  via  soft- 
ware. Figure  4  shows  a  simple  two-input  array  with  four 
AND  gates.  Figure  5  shows  a  simplified  notation  for  this 
circuit — an  "  x  "  is  used  to  indicate  that  a  fuse  is  intact,  and 
a  single  line  is  used  to  represent  all  lines  of  connection  with 
an  AND  gate.  (Note  that  the  use  of  the  single  line  does  not 
imply  the  connection  of  those  lines  to  each  other.)  Figure  5 


also  shows  a  two-output  OR  array  that  uses  the  same  con- 
ventions for  intact  fuses  and  lines  of  connection.  Most  PAL 
devices  have  a  considerably  greater  number  of  inputs,  prod- 
uct terms,  and  outputs  than  shown  here,  but  this  example 
illustrates  the  general  PAL  structure. 

The  arrangement  of  the  AND  and  OR  gates  follows  the 
sum-of-products  format  familiar  to  the  design  engineer  who 
uses  Boolean  logic  to  express  his  logic  requirements.  As 
implied  above,  the  engineer  can  program  the  AND  array  in 
a  PAL  device  by  using  links  similar  to  those  used  in  a 
PROM  circuit.  He  opens  the  particular  links  needed  to  con- 
figure the  logic  to  the  user's  specification. 

Before  discussing  the  PAL20RA10  in  particular,  we 
should  note  the  differences  between  PAL  devices  and  other 
PLDs.  Most  PLDs  consist  of  an  AND  array  followed  by  an 
OR  array.  This  is  also  the  architecture  of  memory  devices, 
including  bipolar  PROMs.  In  a  PROM,  the  AND  array  is 
fixed  and  completely  decodes  the  inputs.  The  OR  array  is 
programmable  and  holds  the  information  at  each  decoded 
location,  or  the  word  at  every  address  (Figure  6a). 

The  first  PLD  was  developed  by  Signetics  and  is  called  a 
field-programmable  logic  array,  or  FPLA.  The  FPLA  is  an 
implementation  of  the  general  PLD  logic  combination — an 
AND  array  followed  by  an  OR  array — in  which  fuses  are 
used  for  interconnections  in  each  array.  The  FPLA  allows 
both  arrays  to  be  programmed  (Figure  6b)  and  thus  pro- 


Figure  6.  PROM  AND-OR  array  (a) — the  AND  array  is  fixed  to  decode  the  address,  whereas  the  OR  array 
is  programmable  for  the  stored  data;  field-programmable  logic  array,  or  FPLA  (b) — both  the  AND  array  and 
the  OR  array  are  programmable;  PAL  device  (c)— the  AND  array  is  programmable  whereas  the  OR  array  is 
fixed.  The  PAL  device  offers  much  of  the  flexibility  of  the  FPLA  along  with  the  low  cost  and  easy  program- 
mability  of  the  PROM. 
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Figure  9.  PAL16R4  logic  diagram.  This  device  has  four  registered  outputs  with  a  single  (synchronous) 
clock. 
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Figure  10.  History  of  the  speed  im- 
provements and  power  consumption 
reductions  for  devices  in  the  Medium 
PAL  family. 
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New  product  development 

New  product  ideas  in  semiconductor  companies  come 
from  a  variety  of  sources — from  customers  and  those  in 
direct  contact  with  them  to  design  engineers  who  have 
ingenious  ideas  about  how  to  add  features  without 
adding  cost,  power,  or  timing  delays.  Since  customers 
are  usually  willing  to  describe  the  product  that  would  be 
ideal  for  them,  they  are  often  a  good  source  of  ideas. 
Product  ideas  usually  pass  through  the  following  steps. 

First,  the  idea  is  generated.  Typically  this  is  in  response 
to  a  limitation  in  a  current  product  and  is  an  evolutionary 
step.  Revolutionary  ideas  such  as  the  original  PAL 
concept  are  rare  and  usually  come  from  approaching 
old  problems  in  a  new  way.  If  the  idea  is  a  customer's 
suggestion,  it  is  usually  reported  to  the  factory  in  a 
trip  report. 

The  idea,  whether  generated  internally  or  by  a  cus- 
tomer, is  next  submitted  to  a  group  typically  consisting  of 
representatives  from  product  planning  (including  some- 
one from  the  product  area  into  which  the  idea  falls, 
whose  job  is  to  examine  such  ideas) ,  marketing,  design, 
and  product  engineering.  The  group  discusses  the  new 
idea  and  responds  to  it;  it  either  drops  the  idea,  giving 
reasons  for  doing  so,  approves  it  for  further  considera- 
tion, or  asks  for  more  details. 

If  a  product  idea  is  approved  for  further  consideration, 
a  member  of  the  product  planning  group  carries  out 
market  surveys  and  analyses.  He  contacts  customers, 


often  in  person,  and  locates  third-party  market  studies 
and  research.  He  analyzes  internal  capabilities  and  seeks 
out  information  about  products  that  could  compete  with 
the  proposed  product.  At  most  companies,  the  result  of 
his  efforts  is  a  new  product  approval  document  consisting 
of  a  description  of  the  product  and  its  applications,  alter- 
natives to  the  product,  a  discussion  of  the  market  for  the 
product,  and  a  marketing  strategy  for  it. 

If  the  product  is  approved  by  all  the  representatives  in 
the  group,  it  is  assigned  to  design  engineering.  The 
design  engineer  is  free  to  use  whatever  tools  he  chooses 
within  the  definition  set  out  for  him.  This  definition  in- 
cludes such  items  as  performance  requirements  but  rarely 
specifies  the  process  technology.  It  is  often  modified  as 
new  requirements  arise. 

Design  engineering  is  finished  with  the  product  only 
after  it  has  been  successfully  produced  on  the  production 
line.  It  is  then  released  to  product  engineering,  which 
takes  on  full  responsibility  for  the  product  for  its  lifetime. 
Product  engineering  is  typically  involved  before  this  point, 
however,  making  sure  the  product  is  progressing  well  and 
problems  are  being  eliminated  quickly. 

The  final  step  is  release  to  the  marketplace.  For  most 
companies,  release  to  market  means  sending  out  data 
sheets  and  samples  to  the  field  once  production  quan- 
tities have  become  available.  For  programmable  products 
such  as  PAL  devices,  software  and  programmer  support 
must  also  be  in  place. 
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Figure  7.  Some  of  the 
first  PAL  devices 
from  Monolithic 
Memories. 


Figure  8.  The  first 
registered  PAL 
devices. 


vides  a  great  deal  of  flexibility.  However,  its  programming 
circuitry  and  fuses  lead  to  a  large,  slow,  power-consuming 
device  and  make  programming  expensive  and  difficult. 

In  1977,  John  Birkner  of  Monolithic  Memories  got  the 
idea  of  taking  the  FPLA  architecture  and  hardwiring  the 
OR  array,  leaving  only  the  AND  array  programmable  (Fig- 
ure 6c).  The  result  was  called  a  PAL  circuit,  PAL  being  the 
acronym  for  programmable  array  logic.  (Monolithic  Memo- 
ries has  since  registered  PAL  as  a  trademark  describing  its 
products.)  A  PAL  device  offers  higher  speed  and  consumes 
less  power  than  an  FPLA.  The  first  PAL  devices  were  de- 
signed so  that  engineers  could  program  them  just  like  512  x 
4  PROMs,  using  phantom  fuses  to  fill  in  beyond  the  512 
fuses  in  the  actual  devices.  Thus,  engineers  could  program 
them  on  widely  available,  inexpensive  programmers. 

The  first  family  of  PAL  devices  included  both  combina- 
torial and  registered  devices.  Combinatorial  devices  consist 
of  just  a  programmable  AND  array  followed  by  an  OR 
array,  with  no  registers  or  other  macrocells.  They  are  fairly 
simple  and  provide  little  more  than  a  varying  number  of 
inputs  and  outputs  (see  Figure  7).  Registered  devices,  on  the 
other  hand,  provide  clocked  registers  for  the  outputs  and 
allow  the  previous  output  state  to  be  stored  and  used  for 
determining  the  next  state.  These  types  of  devices  are  much 
more  challenging.  A  register  allows  the  device  to  be  used  as 
a  state  machine,  one  in  which  the  outputs  depend  not  only 
on  the  present  inputs  but  also  on  the  previous  state. 


Since  a  state  machine  can  be  implemented  in  a  variety  of 
forms,  the  registered  PAL  architecture  is  difficult  to  define. 
To  do  so,  one  must  specify  the  total  number  of  registers, 
whether  a  register  can  feed  its  output  back  into  the  logic 
array,  the  nature  of  the  control  of  the  output  enable  follow- 
ing the  register,  and  common  register  features  such  as  preset 
and  clear  functions.  These  specifications  have  a  major  influ- 
ence on  the  potential  applications  of  the  device. 

Monolithic's  original  registered  PAL  product  family, 
whose  members  were  designed  to  be  used  as  state  machines, 
is  the  Medium  20  family,  where  Medium  describes  the  logic 
density  and  20  is  the  number  of  pins.  The  family  consists  of 
four  devices,  all  with  16  complemented  inputs  to  the  AND 
array  and  eight  outputs  from  the  OR  array.  They  have  vary- 
ing numbers  of  registers — zero,  four,  six,  or  eight — which  is 
reflected  in  the  part  numbers,  PAL16L8,  PAL16R4, 
PALI6R6,  and  PAL16R8  (see  Figure  8).  The  16R4  directly 
implements  Mealy  state  machines,  whereas  the  16R8  directly 
implements  Moore  state  machines.  In  the  16L8,  16R4,  and 
16R6,  combinatorial  outputs  are  enabled  by  individual 
product  terms  from  the  AND  array.  The  product  term 
enables  allow  the  combinatorial  outputs  to  function  as  in- 
puts. These  inputs,  along  with  feedback  from  the  registers 
and  eight  dedicated  inputs,  provide  the  16  inputs  to  the 
AND  array.  The  logic  diagram  of  the  PALI6R4  is  shown  in 
Figure  9. 

The  architecture  of  this  family  has  proven  capable  of 
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Figure  1 1.  7474  dual  D-type  flip-flop.  Note  the  two 
separate  clocks  that  allow  asynchronous  operation  of 
the  registers. 


handling  a  remarkable  percentage  of  what  customers  re- 
quire in  programmable  logic.  The  improvements  customers 
typically  request  are  higher  speed  and  lower  power  con- 
sumption. We  have  made  progressive  improvements  in 
speed,  to  a  10-ns  propagation  delay  in  the  Medium  20D 
Series  and  a  6-ns  one  in  the  ECL  PAL  IC.  We  have  also  re- 
duced power  consumption  without  sacrificing  speed.  The 
20B-4  Series  requires  one  fourth  the  power  of  the  original  20 
Series  yet  provides  the  same  speed.  With  a  propagation  de- 
lay of  35  ns,  the  20B-4  devices  have  a  maximum  supply  cur- 
rent of  55  mA,  compared  to  180  mA  for  the  original  parts 
at  the  same  speed.  CMOS  mask-programmed  versions  re- 
quiring zero  standby  power  are  also  available.  Figure  10 
shows  the  history  of  speed  and  power  consumption  im- 
provements in  the  Medium  20  family. 

While  we  have  been  making  these  improvements,  we  have 
also  introduced  new  families  with  higher  logic  densities  and 
the  40-  and  84-pin  MegaPAL  families.  We  have  added  new 
features  to  these  families,  including  additional  logic  capabil- 
ities such  as  exclusive-OR  gates,  arithmetic  feedback,  pro- 
grammable output  polarity,  product  term  sharing,  register 
bypassing,  and  register  preloading.  We  made  these 
enhancements  in  response  to  specific  customer  needs,  and 
they  have  opened  new  applications  for  programmable  logic. 

Most  of  these  new  features  were  simply  extensions  of  the 
original  PAL  concept.  The  original  PAL  family  was  a 
breakthrough,  a  new  alternative  offering  completely  new 
advantages  in  semicustom  logic.  In  building  on  this  family, 
we  concentrated  on  speed  and  power  improvements  and  not 
on  finding  revolutionary  new  architectures.  However,  the 
PAL20RA10,  which  we  will  discuss  shortly,  is  another 
breakthrough  concept.  It  did  not  derive  from  extensions  to 


the  original  idea  but  arose  as  a  new  response  to  the  needs  of 
customers  using  PAL  devices. 


The  PAL20RA10 

The  PAL20RA10  was  first  proposed  at  an  internal 
Monolithic  Memories  conference  in  1981 .  The  original  con- 
cept was  described  as  an  "unstructured  PAL"  having  inde- 
pendent programmable  clock,  reset,  and  set  controls  on 
each  flip-flop.  Until  then,  all  PAL  devices  had  been  struc- 
tured— or  synchronous — in  architecture;  all  transitions  on 
register  outputs  occurred  simultaneously  in  response  to  the 
rising  edge  of  the  clock  signal.  The  proposed  ability  to  con- 
trol the  clock  independently  for  each  flip-flop  would  enable 
the  engineer  to  create  asynchronous  logic  within  a  PAL 
chip. 

The  idea  was  proposed  by  Harry  Hughes,  who  was  then  a 
field  applications  engineer,  or  FAE,  in  Great  Britain.  FAEs 
are  the  company  representatives  who  have  primary  contact 
with  customers;  they  have  responsibilities  ranging  from 
helping  existing  customers  with  complex  design  problems  to 
introducing  Monolithic  Memories  to  potential  customers. 
As  the  primary  customer  contact,  they  are  quite  familiar 
with  real  user  needs. 

The  concept  of  the  unstructured  PAL  circuit  had  origi- 
nated with  several  European  customers.  They  had  been 
using  the  Medium  20  PAL  Series  in  large  quantities. 
However,  the  application  logic  they  had  been  integrating 
had  not  always  fitted  well  in  PAL  devices.  They  had  often 
had  sections  of  individual  7474  flip-flops  mixed  among  ran- 
dom logic.  A  7474  device  (see  Figure  1 1)  consists  of  two  in- 
dependent D-type  flip-flops,  each  with  its  own  data  input 
and  complementary  outputs,  and  clock,  preset,  and  clear.  A 
standard  PAL16R8  has  eight  of  these  D-type  flip-flops  but 
only  one  clock  and  no  preset  and  clear  functions.  Thus,  in- 
tegrating the  collection  of  7474s  and  the  glue  logic  had  re- 
quired the  customer  to  convert  a  multiclock  design  into  a 
single-clock  one.  This  had  been  difficult  or  impossible  to  do 
with  standard  products. 

The  customer  requested  an  independent  clock,  set 
(preset),  and  reset  (clear)  for  each  flip-flop.  Since  the  FAE 
had  found  other  customers  requesting  similar  architectures, 
he  proposed  the  device.  He  took  the  16R8  logic  diagram 
and  added  the  four  control  terms,  as  shown  in  Figure  12. 
He  described  the  concept  at  Monolithic's  FAE  conference  in 
July  of  1981. 

FAE  conferences  are  held  to  foster  technical  exchange  be- 
tween the  factory  and  the  field.  Problems  are  identified  and 
ideas  are  discussed.  Now  held  every  six  months,  these  con- 
ferences are  critical  for  planning  future  product  directions. 

After  it  had  been  presented  at  the  FAE  conference,  the 
unstructured  PAL  circuit  was  discussed  within  the  factory. 
For  an  idea  to  go  somewhere,  factory  discussion  must  pro- 
duce a  business  plan  for  the  product  that  describes  it  com- 
pletely and  estimates  the  profits  that  can  be  expected  from 
it .  New  product  ideas  must  pass  several  stages  of  analysis 
and  approval.  (See  "New  product  development,"  at  left.) 
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Figure  12.  Harry  Hughes' modification  of  the  PAL16R8  logic  diagram.  To  illustrate  his  proposal  for  a  new  device, 
he  added  four  product  terms  (heavy  lines)  to  control  the  enable,  set,  reset,  and  clock. 


The  major  problem  with  the  unstructured  PAL  circuit 
was  its  inherent  lack  of  testability.  The  power  its  architec- 
ture gives  the  customer  also  demands  that  the  customer  use 
it  carefully.  For  example,  the  minimum  setup  time  required 
before  the  register  can  be  clocked  must  be  carefully  con- 
sidered when  the  clock  signal  is  a  complex  product  term. 
Data  must  be  made  available  to  a  flip-flop  early  enough  so 
that  even  the  minimum  time  to  activation  of  the  clock  satis- 
fies the  setup  time  required.  This  required  delay  can  be  very 
difficult  to  calculate,  especially  if  the  clock  term  is  under  the 
control  of  another  asynchronous  register.  All  of  the  internal 
asynchronous  signals  must  be  handled  with  care  to  prevent 
invalid  combinations. 

These  and  other  testability  issues  caused  great  concern 
and  led  to  several  unique  solutions.  Interna]  test  features 
were  added,  including  the  ability  to  test  both  DC  and  AC 
characteristics  on  unprogrammed  devices  and  even  the  abili- 
ty to  test  programming  characteristics  without  affecting  the 
fuses  in  the  actual  array.  Two  external  pins  were  given  dedi- 
cated functions:  one  is  an  output  enable  that  allows  the 
device  to  be  disabled  during  board-level  testing;  the  other  is 
for  register  preloading,  which  allows  the  user  to  gain  access 
to  the  flip-flops  without  going  through  the  fuse  array. 

The  final,  approved  product  was  called  the  20FF10  at 
first.  It  had  20  array  inputs  and  10  outputs,  and  enough 
new  features  to  earn  it  the  name  "Feature  Freak" — the 
source  of  the  "FF"  in  the  designation.  However,,  a  name 
must  uniquely  identify  a  product  and  must  do  so  in  terms 
the  customer  can  recognize.  The  "FF"  for  "feature  freak" 
was  unique  but  not  very  meaningful  to  the  customer,  so  this 
part  of  the  name  was  changed  to  RA,  for  "registered  asyn- 
chronous," and  the  new  device  became  the  20RA10. 

The  approval  document  for  the  device  described  its  archi- 
tecture, which  is  shown  in  Figure  13.  The  clock,  set,  and 
reset  are  controlled  by  product  terms,  as  shown  in  Hughes' 


original  sketch  of  the  unstructured  PAL  circuit  (see  Figure 
12  again). 

A  compromise  was  made  for  the  output  enable.  An  exter- 
nal output  enable  is  required  for  most  applications,  especial- 
ly for  disabling  the  device  during  board-level  testing.  But  an 
internal  (asynchronous)  output  enable  for  independent 
enabling  of  the  flip-flops  is  needed,  too.  One  solution 
would  have  been  an  external  enable  pin  for  each  output,  but 
that  would  have  been  a  highly  inefficient  use  of  external 
pins.  The  solution  that  was  used  is  an  enable  signal  that  is 
the  product  of  the  external  common  enable  pin  and  an  in- 
ternal independent  product  term.  Thus,  the  enable  can  be 
activated  either  externally  (in  common)  or  internally  (asyn- 


Figure  13.  The  PAL20RA10  output  cell.  Note  the 
separate  product  terms  that  control  the  set,  reset, 
enable,  and  clock.  Note  also  the  output  enable  that 
can  be  controlled  either  externally  (synchronously) 
or  internally  (asynchronously). 


The  PAL20RA1 0  Story— The  Customization  of  a  Standard  Product 


Figure  14.  Logic  diagram 
for  the  complete 
PAL20RA10  cir- 
cuit—AND  array,  OR  ar- 
ray, and  asynchronous 
registered  outputs. 


chronously),  satisfying  both  the  design  engineer,  who  wants 
the  internal  enable,  and  the  test  engineer,  who  wants  the  ex- 
ternal control. 

Another  relatively  new  feature  was  implemented  in  this 
circuit— programmable  polarity.  This  allows  the  output 
polarity  to  be  either  active  low  or  active  high.  The  design 
engineer  can  then  write  logic  equations  either  way,  allowing 
his  software  to  match  the  sense  of  an  equation  to  the  actual 
device.  This  enables  the  designer  to  fit  more  logic  into  a 
single  PAL  circuit. 

The  problem  of  how  many  registered  and  combinatorial 
outputs  to  provide  had  been  solved  in  previous  devices  by 
offering  a  family  of  combinations.  The  solution  to  this 


problem  for  the  PAL20RA10  came  somewhat  by  accident. 
The  situation  of  set  and  reset  going  active  simultaneously 
had  to  be  dealt  with.  We  decided  that  allowing  this  situation 
to  result  in  bypassing  of  the  register  provided  a  beneficial 
new  feature.  If  both  the  set  and  reset  product  terms  are 
activated,  the  register  becomes  transparent  and  the  output 
becomes  combinatorial.  Because  of  this  feature,  the  logic 
designer  can  decide  at  programming  time  how  many 
registers  he  wants  (from  zero  to  ten)  and  on  which  outputs 
he  would  like  them.  This  architectural  flexibility  is  unique 
within  programmable  logic. 

These  features  are  shown  in  the  logic  diagram  in  Figure 
14.  However,  though  they  describe  exactly  what  the  user  en- 
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counters,  they  do  not  actually  exist  within  the  device  itself. 
The  logic  diagram  is  only  a  logical  equivalent  to  the  actual 
layout  of  the  circuit. 

The  first  thing  to  note  about  the  actual  design  is  that  the 
programmable  AND  array  and  following  OR  array  com- 
mon to  all  PAL  circuits  are  actually  implemented  as  an  OR 
array  and  a  following  NAND  array!  This  type  of  logic  is 
required  because  of  the  architecture  of  the  circuitry  that 
senses  the  ones  and  zeroes  in  the  programmable  array.  Opti- 
mized for  bipolar  PROMs,  this  circuitry  senses  current 
flowing  from  one  or  more  of  the  transistors  in  a  column  of 
the  array.  Since  it  senses  current  if  at  least  one  of  the  tran- 
sistors is  on,  the  logic  is  equivalent  to  an  OR  gate.  Thus,  the 
programmable  array  must  be  an  OR  array,  as  it  is  in  a 
PROM.  To  create  the  logic  of  a  PAL  device,  we  placed  a 
logical  NAND  array  after  the  programmable  OR  array, 
creating  an  OR-NAND  structure,  that  is,  an  inverted  prod- 
uct of  sums,  which  by  Boolean  arithmetic  is  equivalent  to  a 
sum  of  products. 

To  maintain  the  flexibility  of  the  20RA10,  we  had  to 
make  various  trade-offs.  One  example  is  the  output  drive. 
The  original  proposal  specified  an  output  drive  of  32  mA. 
We  lowered  this  to  8  mA  to  reduce  the  power  requirements. 

Although  standard  24-pin  products  were  running  as  fast 
as  25  ns,  the  highest  speed  we  could  guarantee  on  the 
PAL20RA10  was  30  ns.  This  slight  decrease  in  speed  was 
due  to  the  additional  features  of  the  device— a  combina- 


torial output  signal  travels  through  the  polarity  control, 
register  bypass  multiplexer,  and  output  buffer  before  it 
reaches  an  output  pin.  Monolithic  Memories  has  been  pro- 
gressively improving  its  process  capabilities,  and  the  part  is 
being  transferred  to  a  faster  process  to  increase  its  speed. 

Beyond  speed,  the  fabrication  process  used  for  the 
PAL20RA10  created  no  specific  limitations.  The  process 
was  the  workhorse  of  the  bipolar  efforts  at  Monolithic 
Memories  at  the  time.  It  is  a  standard  junction-isolated 
bipolar  process,  optimized  for  high  speed  (see  "Process 
technology,"  at  right).  Extensive  experience  with  the  process 
results  in  high  manufacturing  yields  and  high  reliability  and, 
as  a  result  of  these,  lower  costs.  In  fact,  the  greatest  effect 
of  the  process  on  the  product  is  probably  in  the  cost.  The 
PAL20RA10  process  is  well  understood  and  provides  excel- 
lent results  for  what  it  costs. 

A  die  photograph  of  the  20RA10  is  shown  in  Figure  15. 
The  various  blocks  are  outlined.  The  die  had  to  meet  a  cus- 
tomer requirement— that  it  fit  into  a  24-pin,  300-mil-wide 
SKINNYDIP  package.  The  additional  features  had  in- 
creased the  die's  width,  yet  to  fit  into  this  package  it  had  to 
be  less  than  140  mils  wide.  Some  squeezing  of  the  layout 
during  the  final  stages  of  design  allowed  this  requirement  to 
be  met. 

The  PAL20RA10  is  much  more  than  a  die,  however.  It 
also  requires  programming  and  software  support.  Program- 
ming support  is  the  one  critical  service  Monolithic  Memories 
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does  not  provide,  and  thus  it  depends  on  the  major  PLD 
programer  manufacturers  for  it.  But  to  be  able  to  support  a 
product  at  market  release,  the  programmer  manufacturers 
must  be  provided  with  the  programming  specification  for 
the  product  months  before  it  is  released.  Monolithic  there- 
fore keeps  in  close  contact  with  these  manufacturers. 

Software  support  is  provided  by  Monolithic  Memories 
through  its  PAL  assembler  software  package,  PALASM  2. 
PALASM  2  takes  Boolean  logic  equations,  tests  them 
against  expected  outputs,  and  converts  them  to  a  fuse  map 
for  a  PAL  device.  A  description  of  this  fuse  map  can  then 


be  downloaded  to  a  programmer  in  a  standard  JEDEC 
format  for  customization  of  the  chip. 

Software  support  is  a  critical  part  of  the  product  because 
it  is  through  software  that  the  customer  implements  his  de- 
sign. A  designer  wants  the  conversion  from  what  he  knows 
to  be  the  required  logic  to  what  a  PAL  programmer  can 
understand  to  be  as  simple  and  easy  as  possible.  PALASM  2 
allows  the  designer  to  express  his  function  with  Boolean 
equations.  It  then  uses  that  input  to  configure  output 
polarity,  register  bypassing,  and  all  the  other  configuration 
features. 


Process  technology 

The  process  used  for  the  PAL20RA10  is  a  junction- 
isolated  one.  A  cross  section  of  the  20RA10  is  shown  in 
Figure  A.  NPN  transistors  are  separated  from  one  an- 
other by  junctions. 

The  process  is  a  washed-emitter  one.  In  this  process,  a 
layer  of  nitride  (Si3NJ  is  used  as  a  mask.  This  mask  layer 
defines  the  size  of  the  emitter  and  the  emitter  contact  and 
allows  the  emitter  and  the  emitter  contact  to  be  the  same 
size.  In  other  processes,  the  emitter  is  usually  made  larger 
than  the  emitter  contact  to  guarantee  good  contact. 
When  formed  at  the  same  time,  however,  they  can  be 
the  same  size  and  still  maintain  proper  contact.  This 


allows  the  emitter  to  become  the  minimum  feature  size  of 
the  design,  providing  a  smaller  overall  die  size  than  the 
die  size  that  could  be  provided  by  a  standard  process. 

A  programming  fuse  is  located  between  an  emitter  and 
a  product  line  that  in  turn  leads  to  a  collector  (Figure  B)  I 
driving  a  large  current  through  this  fuse  opens  its  narrow 
neck  and  causes  it  to  change  from  a  very  low  resistance 
to  a  very  high  one.  The  process  does  not  cause  the  fuse 
to  explode,  as  some  claim;  it  merely  severs  the  greater 
part  of  the  connection.  This  change  opens  the  link  be- 
tween the  emitter  and  the  product  line. 


Emitter  metallization 

Collector  metallization  ,ai 
TiW 


Buried  subcollector 


Si02 


Figure  A.  Cross  section  of  the 
PAL20RA10— NPN  transistors 
are  separated  from  one  another 
by  junctions. 


Fuse  (cross  section) 


Figure  B.  Location  of  the  pro- 
gramming fuse  in  the 
PAL20RA10. 


The  PAL20RA10  Story— The  Customization  of  a  Standard  Product 


The  new  features  of  the  PAL20RA10  required  that  some 
new  syntactical  conventions  be  added  to  the  PALASM  2 
software  package.  In  addition  to  the  standard  logic  transfer 
functions,  the  designer  has  to  be  able  to  specify  several  nov- 
el architectural  situations.  For  example,  he  has  to  be  able  to 
specify  register  bypassing  in  either  of  two  ways:  implicitly  or 
explicitly.  He  specifies  bypassing  implicitly  by  writing  a 
combinatorial  logic  equation  using  an  "  =  "  sign  instead  of 
the  ":  =  "  sign,  which  is  reserved  for  registered  outputs.  He 
specifies  bypassing  explicitly  by  activating  the  set  and  the 
reset  on  the  flip-flop  simultaneously,  causing  the  register  to 
be  bypassed  and  the  output  to  be  combinatorial. 

The  control  terms  are  described  by  dot  operators  such  as 
"term.set."  We  selected  dot  operators  for  the  same  reason 
they  are  used  in  popular  word  processing  programs— they 


rarely  occur  in  natural  language  and  can  therefore  be  given 
special  significance. 

Third-party  software  is  available  that  allows  other  forms 
of  input  such  as  schematic  entry.  ABEL  from  Data  I/O 
Corporation  and  CUPL  from  Assisted  Technology  both 
provide  high-level  support  for  PLDs. 

As  we  asserted  earlier,  true  customization  of  the 
PAL20RA10  involves  much  more  than  matching  the  func- 
tion to  the  customer's  requirements.  Speed  and  power  con- 
sumption must  be  acceptable,  design  support  for  the  device 
must  be  available,  pricing  must  be  reasonable,  product 
availability  must  be  high,  and  quality  must  be  high.  The 
PAL20RA10  has  been  in  full  production  for  well  over  a 
year,  and  to  the  good  fortune  of  Monolithic  Memories  all  of 
these  requirements  have  been  met.  Moreover,  FAEs  are  ex- 
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cited  about  teaching  customers  about  a  product  that 
originated  with  one  of  their  own  members.  In  fact,  the 
20RA10  was  the  first  product  proposed  by  an  FAE  to  go 
into  production  at  Monolithic  Memories.  Market  accep- 
tance has  been  beyond  expectations,  and  as  a  result  FAEs  are 
being  brought  even  further  into  the  product  planning  process. 

The  PAL20RA10  is  the  first  member  of  a  new  family  of 
devices;  new  products  are  in  various  stages  of  proposal,  de- 
sign, and  production.  A  20-pin  spin-off,  the  PAL16RA8,  is 
now  in  full  production.  A  higher-speed  version  is  in  design. 
A  version  in  a  40-pin  DIP  was  considered,  but  since 
20RA10  outputs  operate  independently,  the  higher  density 
and  resulting  larger  package  were  difficult  to  justify.  Two 
24-pin,  300-mil-wide  parts  can  do  more  in  less  board  space 
than  one  40-pin,  600-mil-wide  device. 
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Figure  16.  A 
PAL20RA10used 
as  a  universal  port 
interface  adapter 
for  a  Motorola 
MC68020  32-bit 
microprocessor.  It 
generates  logic  that 
maintains  both 
byte  and  word  ad- 
dressability within 
the  68020's  32-bit- 
wide  word  format 
and  that  makes 
possible  the  use  of 
both  high-  and 
moderate-speed 
components  and 
peripherals  in  a 
single  system. 


The  PAL20RA10-an 
application  example 

To  demonstrate  the  capabilities  of  the  PAL20RA10  and 
the  PALASM-based  design  process,  we  present  an  applica- 
tion that  uses  the  PAL20RA10  as  a  universal  port  interface 
adapter  for  a  Motorola  MC68020  microprocessor  (Figure 
16).  This  example  should  help  demonstrate  the  features  we 
described  above  and  point  out  how  easily  the  20RA10  can 
be  customized  to  a  particular  user's  application. 

The  design  engineer  began  this  design  by  considering  the 
large  blocks:  the  microprocessor,  the  memory,  and  the  I/O. 
Now,  near  the  end  of  the  design,  he  needs  to  place  some 
glue  logic  between  the  microprocessor  and  the  I/O  ports. 
He  is  open  to  anything  from  standard  to  full-custom  logic, 
and  he  considers  each  alternative. 

Because  he  is  very  unsure  of  the  finality  of  the  design,  he 
does  not  want  to  commit  himself  to  a  major  development 
effort.  Therefore,  he  rules  out  full-custom  logic  and  stan- 
dard cells.  He  may  be  able  to  make  a  gate  array  work,  but 
he  does  not  have  time  to  work  with  a  foundry  in  specifying 
the  design  nor  time  to  wait  for  the  foundry  to  deliver  the 
part.  In  fact,  he  would  like  to  have  the  part  immediately. 
Moreover,  since  his  design  is  not  yet  proven,  he  cannot  jus- 
tify the  high  cost  of  a  gate  array  development  effort. 

He  could  use  off-the-shelf  ICs  easily  enough,  but  he  does 
not  want  to  take  up  that  much  space  on  his  board.  He  has  a 
limited  amount  of  room  with  which  to  work.  He  could  use 
standard  LSI  devices,  but  none  provide  the  functions  he  re- 
quires. Thus,  he  arrives  at  the  programmable  logic  device  as 
the  logical  alternative. 

As  we  said  above,  in  this  design  the  PAL20RA10  func- 
tions as  an  asynchronous  universal  port  interface  adapter 
for  a  32-bit  microprocessor,  the  MC68020.  Among  the 
greatest  challenges  the  engineer  faces  when  designing  with 
the  new  generation  of  32-bit,  high-performance  micropro- 
cessors is  maintaining  both  byte  and  word  addressability 
within  the  microprocessor's  32-bit-wide  word  format  and 
accommodating  both  high-speed  and  moderate-speed  mem- 
ory components  and  peripherals  in  a  single  system. 

The  MC68020  has  a  dynamic  bus  sizing  feature  that  helps 
solve  the  first  of  these  two  problems— it  has  internal  logic 
that  manipulates  and  directs  words  smaller  than  32  bits. 
However,  it  needs  external  logic  to  handle  8-  and  16-bit 
memory  arrays  and  peripherals.  It  even  needs  such  logic  to 
maintain  byte  addressability  of  32-bit  memory  arrays.  This 
external  logic  must  be  able  to  decode  the  transfer  size  con- 
trol signals  from  the  MC68020  so  it  can  generate  memory 
chip  selects  and  handshake  back  to  the  microprocessor  with 
the  port  size  acknowledge  signal.  The  PAL20RA10  is  well 
suited  to  this  task. 

External  logic  also  helps  the  MC68020  solve  the  second  of 
the  two  problems.  The  16.7-MHz  speed  of  the  MC68020 
version  used  here  translates  into  60-ns  machine  cycles  and 
90-ns  memory  cycles,  and  therefore  the  MC68020  can  keep 
up  with  many  of  today's  fastest  memories.  But  it  may  have 
to  work  with  slower  bulk  memories  as  well.  To  do  this  with- 
out slowing  down  the  high-performance  memories,  the 
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Table  2. 

DIP  switch  settings  used  in  the  universal  port 
interface  adapter  to  choose  the  data  bus  port 
size. 


Table  3. 

DIP  switch  settings  used  in  the  universal  port 
interface  adapter  to  choose  the  number  of  wait 
states. 


Position 
PS1  PSO 


L 
L 
H 
H 


L 
H 
L 
H 


Data  bus 
port  size 

No  port  present 
8-bit 
16-bit 
32-bit 


Position 
WS1  WSO 

L  L 

L  H 

H  L 

H  H 


Number  of 
wait  states 

0 
1 
2 
3 


68020  must  be  able  to  insert  wait  states  into  the  memory  cy- 
cles of  the  slower  memories.  The  20RA10  can  help  the 
68020  do  this  by  intercepting  microprocessor  memory  access 
requests  and,  when  needed,  generating  control  signals  to  the 
microprocessor  requesting  wait  states. 

In  this  design  the  PAL20RA10  functions  as  a  universal 
port  interface  adapter.  Both  blocks  of  external  logic  we  al- 
luded to  in  the  previous  two  paragraphs  are  implemented  in 
a  single  PAL20RA10.  The  asynchronous  capabilities  of  this 
PAL  circuit  are  required  by  this  design  because  the 
MC68020  provides  asynchronous  control  signals  that  must 
be  processed  by  the  PAL  device  between  clock  signals.  On 
the  other  hand,  the  synchronous  capabilities  of  the  20RA10 
are  needed  for  generating  memory  control  signals.  An  inter- 
nal PAL  state  machine  must  be  synchronous  with  the  sys- 
tem clock.  Therefore,  neither  a  combinatorial  (i.e.,  asyn- 
chronous only)  nor  registered  (i.e.,  synchronous  only)  PAL 
device  will  fill  the  bill.  The  PAL20RA10's  unique  combina- 
tion of  features  allows  it  lo  provide  a  single-chip  solution. 

The  PAL20RA10  used  here  generates  the  byte  data  select 
signals  (UUD,  UMD,  LMD,  and  LLD)  and  the  current  port 
size  select  signals  (DSACKO  and  DSACK1)  and  performs 
wait  state  generation.  The  designer  selects  the  data  bus  port 
size  and  the  number  of  wait  states  by  setting  DIP  switches 
(Tables  2  and  3). 

The  PAL20RA10  is  a  breakthrough  product.  It  does 
not  merely  extend  existing  capabilities;  it  adds  novel 
and  highly  useful  features.  It  is  also  the  first  product 
produced  by  our  company  that  originated  with  a  customer. 
Its  technical  and  market  performance  has  exceeded  our  ex- 
pectations and  has  proven  to  us  the  wisdom  of  listening  to 
customers.  As  a  result,  we  have  devised  ways  of  listening  to 
them  even  better,  si 

Questions  about  this  article  can  be  directed  to  Marc  A. 
Baker,  Strategic  Marketing,  Programmable  Products  Divi- 
sion, Monolithic  Memories,  Inc.,  2175  Mission  College 
Blvd.,  Santa  Clara,  CA  95054. 


Marc  Baker  is  a  strategic  marketing  engineer  at  Monolithic 
Memories,  Inc.,  where  he  is  responsible  for  the  merchandis- 
ing and  document  literature  for  programmable  products. 
He  holds  a  BSEE  from  Stanford. 

Vince  Coli  is  Monolithic  Memories'  strategic  marketing 
manager  in  the  Programmable  Products  Division.  He  was 
one  of  the  original  authors  of  MMI's  handbook  on  pro- 
grammable devices.  He  holds  a  MSEE  from  Santa  Clara 
University. 
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PLDs  Abound:  RAM-Based  Logic  Joins  In 


CMOS  Logic  Cell  Arrays,  only  recently  available  from  more  than  one  source,  are  almost 
as  useful  as  small  gate  arrays,  but  don 't  cany  the  risk  of  commiting  a  design  to  silicon. 





By  Marc  A.  Baker 

and  Chris  Jay 
Product  Applications 
Monolithic  Memories  Inc. 
Santa  Clara,  Calif. 

The  rise  of  advanced  CMOS  tech- 
nology has  dramatically  affected 
the  PLD  market.  Although  not 
as  fast  as  bipolar  devices,  CMOS  prod- 


the  circuit  by  deselecting  its  chip  se- 
lect input.  Large  EPROM  devices 
may  store  three  or  four  configura- 
tions, and  tying  the  higher-order  ad- 
dresses to  DIP  switches  allows  the 
user  to  select  one  of  a  given  number 
of  logic  functions.  Thus  one  card  may 
carry  a  large  EPROM  and  an  LCA, 
allowing  for  some  degree  of  de- 
ferred design. 

 .  


The  LCA  is  actually  a  PLD,  but  its  dense  CMOS  RAM 
enables  designers  to  configure  logic  cells  within 
the  device  and  to  specify  I/O  blocks. 


ucts  are  gradually  approaching  the  per- 
formance that  was  expected  of  bipolar 
about  two  years  ago. 

The  logic  designer  gains  two  signifi- 
cant benefits  from  the  latest  in 
CMOS:  greater  densities  for  more 
complex  logic,  and  reduced  power  con- 
sumption at  low  clocking  rates.  Since 
cutting  power  consumption  means 
smaller  power  supplies  and  heat 
sinks,  both  factors  reduce  the  size  of 
electronic  equipment. 

Most  CMOS  parts  can  be  backed  up 
by  batteries,  making  them  useful  for 
RAM-based  devices  such  as  the  Logic 
Cell  Array  (LCA). 

Although  the  LCA  is  based  on  a  dense 
RAM  architecture,  the  array  itself  is  not  a 
conventional  RAM-based  device.  Actual- 
ly, it's  a  PLD,  but  the  dense  CMOS  RAM 
enables  the  designer  to  configure  logic 
cells  within  the  device  and  to  specify  VO 
blocks.  The  interconnected  logic  cells, 
called  CLBs  (configurable  logic  blocks) 
and  IOBs  (input/output  blocks)  allow  com- 
plex circuits  to  be  constructed  on-chip. 
And  the  LCA's  battery  backup  powers 
down  to  save  data. 

Its  dense  architecture  enables 
complex  VLSI  functions  to  be  pro- 
grammed in  the  cell  array.  Also,  the 
random  glue  logic  that  would  other- 
wise ride  on  system  cards  is  incorpo- 
rated into  the  array.  Because  the 
LCA  is  RAM  based,  it  must  be  config- 
ured on  power-up  from  a  non-volatile 
source.  Configuration  data  can  .be 
read  from  a  CMOS  EPROM  on  a 
byte-wide  basis.  When  the  reading  is 
done,  the  EPROM  is  switched  out  of 


Dual-layer  metal,  advanced  twin- 
well  CMOS  has  produced  a  program- 
mable logic  device  that  can  extend  the 
PLD  into  the  area  of  low-end  gate 
arrays. 

Any  custom  or  semicustom  device 
must  be  tested,  and  PLDs  are  no  ex- 
ception. But  since  they  are  only  func- 
tionally configured,  functional  testing 
offers  a  high  degree  of  reliability. 
Such  testing  can  be  done  with  the  pro- 
grammer, immediately  after  pro- 
gramming. Production  volumes  can 
be  taken  care  of  simply  by  using  an 
automatic  handler. 

An  alternative  to  user  program- 
ming is  factory  programming:  Manu- 
facturers can  furnish  factory  pro- 


gramming or  can  create  custom  masks 
for  hard-wired  versions  of  the  pro- 
grammed pattern. 

PLDs  also  are  less  expensive  than 
other  alternatives.  Examining  chip 
price  alone,  both  mass-produced  SSI 
devices  and  custom  parts  look  very 
good.  But  other  costs,  such  as  inven- 
tory handling  and  non-recurring  engi- 
neering (NRE)  charges  must  be  taken 
into  account. 

One  way  to  estimate  the  cost  of 
such  items  as  inventory,  testing  and 
board  space  is  to  add  about  $1  per 
chip.  Thus,  if  five  SSI  chips  cost  only 
25  cents  each,  they  actually  cost  the 
user  $1.25  each,  or  $6.25.  Those  five 
devices  can  therefore  be  economical- 
ly replaced  by  a  $5  PLD.  Today, 
many  PLDs  cost  less  than  the  devices 
they'll  replace,  and  prices  will  con- 
tinue to  drop. 

A  gate  array  can  replace  multiple 
PLDs,  but  creating  a  gate  array  de- 
mands a  large  NRE  charge  for,  say, 
producing  masks  and  generating  test 
programs.  The  NRE  charges  for  a 
large  gate  array,  including  design 
costs,  masks,  a  possible  design  fix 
and  test  programs,  can  be  tens  of 
thousands  of  dollars.  A  PLD  has  no 
NRE  charges. 

With  the  LCA,  testability  becomes 
an  interesting  issue — since  the  de- 
vice is  RAM  based,  test  programs  can 
be  loaded  into  it.  Once  configured  for 
testing,  test  vectors  may  be  applied 


IMPLEMENTATION  COST  COMPARISON 


Cost 

PLD 

SSI 

Gate  Array 

Price 

Medium 

Very  low 

Low 

Inventory  & 
Handling 

Medium 

Medium 

Medium 

NRE 

None 

None 

Very  High 

Design  Time/ 
Equipment 

Low 

Low 

Medium-High 

Design 
Changes 

Medium 

Medium 

High 

Test 

Very  Low 

Very  Low 

High 

Quantity  Required 
Per  Application 

Low-Medium 

nign 

Low 

SSI  chips  still  cost  the  least  per  unit.  But  since  more  are  needed,  circuit  size  can  be  limiting. 
Gate  arrays  cost  less  than  PLDs,  but  their  high  NRE  can  make  the  difference  In  the  long  run. 
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to  determine  11  tne  logic  cens  aim 
blocks  are  functioning  properly.  For 
example,  when  configuring  an  LCA 
with  buried  diagnostic  shift  regis- 
ters, serial  diagnostic  data  may  be 
shifted  through  the  system  to  a  seri- 
al output  port,  where  the  data  would 
be  available  for  analysis.  When  a 
complete  system  is  configured  for  di- 
agnostic testing,  internal  nodes  that 
were  previously  inaccessible  become 
available  via  a  serial  diagnostic 
scan  loop. 

When  the  LCA  serves  as  part  of  a 
larger  system,  it  can  be  initialized 
with  buried  shift  registers  perform- 
ing functional  tests  on  the  system 
itself.  Once  testing  is  complete,  the 
LCA  can  be  set  up  for  the  required 
function. 

Suppliers 

In  1978,  only  two  suppliers  were 
producing  PLDs.  Now  there  are  more 
than  a  dozen,  with  several  sources 
for  the  most  popular  parts.  Higher- 
volume  production  and  second  sourc- 
ing  alleviate  concerns  with  factory- 
programmed  devices,  like  gate 
arrays.  Control  of  the  design  is  re- 


encoding/decoding,  terminal  adapter 
interfaces,  CRC  error  correction, 
frame  synchronization  and  other  ap- 
plications. 

Since  the  LCA  is  reconfigurable,  it's 
possible  to  modify  the  logic  after  de- 
sign. In  telecommunications,  that 
means  that  functions  can  be  changed  to 
accommodate  different  communica- 
tions standards  and  protocols,  or  both, 
without  resorting  to  separate  logic  cir- 
cuits for  each. 

Tomorrow's  Market 

Tomorrow's  market  will  be  driven 
more  by  users  rather  than  suppliers. 
Applications  beyond  computers  will 
be  found,  and  engineers  will  push  for 
the  required  technology.  Moreover, 
users  will  drive  product  definitions. 
Software  will  be  an  integral  part  of 
the  design  solution  and  will  require 
more  adaptation  to  a  user's  design 
environment.  An  important  factor  in 
the  increasing  role  of  the  user  in  the 
PLD  market  is  the  fact  that  PLDs  are 
becoming  part  of  up-front  system  de- 
sign, not  just  last-minute  replace- 
ments of  discrete  logic. 

New  settings  for  PLDs  in  high-speed 


Since  the  device  is  RAM  based,  test  programs  can  be 
loaded  into  it,  and  vectors  can  be  applied  to  see  if  the 
logic  cells  and  I/O  blocks  are  functioning  properly. 


turned  to  the  customer. 

In  addition,  the  customer  has 
choices  not  only  in  silicon  but  in  ser- 
vice and  support.  Manufacturers  have 
varying  types  of  quality  control,  test 
procedures,  reliability  verification, 
software  and  technical  support,  price 
ranges  and  delivery  capabilities.  As  a 
recent  survey  in  EE  Times  showed, 
these  issues  are  very  important.  Qual- 
ity control  was  the  primary  thing  want- 
ed in  a  supplier,  and  software  support 
was  the  second  most  important  issue. 

Telecom  Applications 

Because  CMOS  devices  are  not  power 
hungry,  they  are  welcomed  in  telecom- 
munications. 

Low  power  consumption  in  modems, 
fax  machines  and  the  like  means  more 
features  and  more  functions  without  an 
increase  in  size. 

Further,  in  central  switching  sta- 
tions, units  must  be  kept  compact 
because  many  line  interface  cards 
are  likely  to  placed  in  one  rack,  and 
many  racks  might  share  one  cabinet. 

The  LCA  can  put  its  talents  to  good 
use  in  telecommunications,  includ- 
ing time-division  multiplexing,  line 
encoding/decoding  and  corner  bend- 
ing circuits.  Also,  zero-standby-pow- 


processors,  superminis  and  military 
equipment  are  fueling  advances  in  bi- 
polar speeds.  In  the  past,  the  fastest 
digital  systems  could  not  employ  pro- 
grammable logic,  but  today's  PLDs  are 
matching  and  surpassing  their  speed- 
iest rivals. 

CMOS  technology  is  being  forced 
to  meet  the  demands  of  telecom- 
munications, as  well  as  adapt  to  in- 
dustry and  instrumentation.  Such 
equipment  calls  for  low  power  and 
immunity  to  temperature,  power  and 
noise  fluctuations.  CMOS  promises 
to  meet  those  needs,  although  most 
CMOS     PLDs     today  simply 


Users  are  also  demanding  more  ap- 
plication-specific PLDs.  At  first  ap- 
pearing to  be  redundant,  applica- 
tion-specific PLDs  are  devices  whose 
architecture  is  aimed  at  a  particular 
application.  Such  architectures  may 
boast  bus  interface  controllers,  sig- 
nal generators  and  decoders.  True, 
these  devices  are  limited  to  specific 
tasks,  but  will  serve  better  than 
their  generic  kin.  Suppliers  are  tak- 
ing risks  in  pursuing  limited  mar- 
kets, but  the  clear  winner  is  the  user 
who  will  soon  have  devices  tailored 
to  his  needs. 

Recently,  there's  been  an  explosion 
in  the  number  of  PLD  architectures 
and  the  amount  of  support  software. 
The  first  PLDs  were  programmed  by 
manually  encoding  the  individual 
fuses,  which  then  had  to  be  blown. 
Later,  software  was  supplied  that 
automatically  converted  Boolean 
equations  into  programming  pat- 
terns. Now,  software  can  use  a  num- 
ber of  design  descriptions  (equations, 
schematic  capture,  truth  tables,  etc.) 
and  can  simulate  the  design  and  cre- 
ate test  vectors  in  addition  to  the  pro- 
gramming pattern. 

A  number  of  support  packages  are 
on  hand  to  give  the  engineer  the 
CAD  facilities  that  ensure  a  success- 
ful design.  Most  of  these  packages 
play  on  an  IBM-PC  and  start  with 
assemblers  that  convert  Boolean 
equations  into  fuse  plots  suitable  for 
programming  into  the  PLD. 

More  complex  devices  require 
greater  levels  of  software  support,  so 
state  machine  entry  is  made  possible 
by  many  PLD  languages.  PLD  assem- 
blers usually  need  not  only  Boolean 
literacy  from  system  designers  but  also  a 
detailed  understanding  of  device  architec- 
ture. State  machine  entry  does  not  require 
such  a  high  degree  of  understanding,  so 
for  these  applications  the  logic  designer 
can  often  use  a  descriptive  rather  than  a 
mathematical  language. 

User-friendly  software  tends  to  make 
an  architecture  transparent,  and  state 
machine  entry  ensures  rapid  turnaround. 
Small  PLD  designs  can  be  turned  around 


Users  are  also  demanding  more  application-specific 
PLDs.  At  first  appearing  to  be  redundant,  application- 
specific  PLDs  are  devices  whose  architecture  is 
aimed  at  a  particular  application. 


match  low-power  bipolar  specs.  To- 
morrow's CMOS  PLDs  will  offer  the 
zero  standby  power,  wider  operating 
conditions  and  dc  characteristics  of 
standard  CMOS  products. 


in  minutes,  and  more  complex  devices  in 
several  hours. 

The  software  of  tomorrow  will  be 
more  responsive  to  users'  needs,  allow- 
ing for  a  multitude  of  design  descrip- 
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tions  while  not  limiting  the  implemen- 
tation to  a  particular  device.  Along 
with  device  independence  will  come  the 
ability  to  handle  multiple  devices,  with 
workstations  offering  the  highest  level 
of  support. 

Programming  equipment  must  also  do 
more  than  just  generate  waveforms.  Ad- 


en by  manufacturers,  tomorrow's  will 
be  guided  more  by  the  needs  of  the 
logic  designer.  As  new  applications 
arise,  pressure  will  be  put  on  both 
technology  and  architecture.  Commu- 
nication with  the  customer  will  as- 
sume new  importance,  since  it  defines 
the  right  combinations.  In  addition, 


Just  as  today's  market  is  driven  by  manufacturers, 
tomorrow's  will  be  guided  by  designers.  As  needs  arise, 
pressure  will  be  on  technology  and  architecture. 


vanced  programmers  will  provide  many 
test  features,  including  complete  func- 
tional testing.  Or,  programmers  will  be 
linked  to  more  sophisticated  testers. 

PLD  suppliers  are  increasing  the 
quality  of  their  products,  while  adding 
test  features  to  verify  that  quality. 
Power-up,  reset  and  register  preload- 
ing simplify  testing,  but  the  program- 
mer must  be  able  to  take  advantage  of 
them. 

Just  as  today's  PLD  market  is  driv- 


software  from  third-party  vendors 
will  make  programmable  logic  more 
at  home  across  the  board. 

PLDs  will  continue  to  be  used  along- 
side discrete  logic,  fixed  LSI  functions, 
and  gate  arrays.  But  as  the  market 
matures,  the  benefits  of  the  PLD  over 
other  options  will  become  greater.  In 
time,  programmable  logic  will  continue 
to  enjoy  a  higher  growth  rate  than  the 
overall  IC  industry.  ■ 
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A  look  at  the  architectural  differences 
between  PALs  and  other  programmable  logic  devices 


Vincent  J.  Coli 


PROGRAMMABLE  LOGIC  devices  are 
integrated  circuits  that  hardware  designers 
can  program  to  perform  specific  logic 
functions.  Most  PLD  functions  are  avail- 
able from  many  vendors  and  in  several 
technologies  with  different  speed,  power, 
and  cost  options.  As  with  standard  7400 
chips,  PLDs  are  available  off  your  local 
d  stributor's  shelf.  PLDs  offer  one  distinct 
advantage  over  standard  7400  discrete 
logic:  They  are  user-programmable. 

Most  PLDs  consist  of  two  arrays  of 
logic  gates— an  AND  array  followed  by  an 
OR  array.  The  input  signals  to  a  PLD  must 
first  pass  through  an  array  of  AND  gates 
where  combinations  of  the  input  signals 
are  formed.  Each  group  of  AND  com- 
binations is  called  a  minterm  in  Boolean 
algebra  or  a  product  line  in  PLD  nomen- 
clature. Then  the  product  lines  are 
summed  in  an  array  of  OR  gates.  The  in- 
put buffers  generate  both  the  true  and 
complement  of  the  input  signals. 

Three  basic  types  of  AND/OR  array- 
based  PLDs  exist:  programmable  read- 
only memories  (PROMs),  programmable 
logic  arrays  (PLAs),  and  programmable 
array  logic  (PAL)  devices.  The  types  are 
distinguished  by  the  programmability  of 
their  arrays. 

In  a  PROM,  the  AND  array  is  fixed  and 
the  OR  array  is  programmable.  In  a  PLA, 
both  arrays  are  programmable.  PAL 
devices  have  a  programmable  AND  array 
and  a  fixed  OR  array.  I  will  compare  the 
PAL  device  to  the  PLA  and  PROM  and 
then  examine  the  architecture  of  some 
commonly  used  PAL  devices.  For  a  brief 
h  story  of  the  PAL  device,  see  the  text  box 


"Evolution  of  PALs"  by  John  Martin 
Birkner  on  page  208. 

PROMs 

While  most  people  think  of  PROMs  as 
devices  for  storing  fixed  programs  and 
memory,  the  PROM  is  also  ideal  for  logic 
applications  requiring  less  than  10 
inputs— especially  when  many  product 
lines  are  required.  PROMs  designed  as 
logic  devices  are  usually  referred  to  as 
PLEs  (programmable  logic  elements). 

Figure  1  shows  the  PROM's  fixed- 
AND/programmable-OR  arrays.  For  a 
discussion  of  notation  used  to  describe 
PLD  devices,  see  the  text  box  "PLD 
Notation  Panel"  on  page  210.  Every  input 
combination  is  available  in  the  AND  ar- 
ray, whether  that  combination  of  inputs  is 
required  or  not.  Since  the  AND  array  is 
hard-wired,  it  is  not  possible  to  perform 
logic  minimization  between  input  com- 
binations. 

The  OR  array  is  programmed  to  select 
the  AND  gate  combinations  (or  product 
lines).  Since  every  OR  gate  is  connected 
to  each  product  line,  outputs  may  share 
product  lines.  For  those  familiar  with 
memory  design,  the  fixed-AND  array  is 
often  called  the  address  decoder,  while  the 
programmable-OR  array  stores  the  mem- 
ory bits.  Another  way  of  looking  at  this 
is  that  PROMs  store  the  logic  transfer 
function  as  a  lookup  table  in  memory. 

The  advantage  of  PROMs  is  that  every 
input  combination  can  be  decoded.  The 
disadvantage  is  that  the  number  of  input 
pins  available  is  restricted  because  the  ar- 
ray size  must  be  doubled  for  each  addi- 


tional input.  The  arithmetic  works  like 
this:  A  PROM  with  h  inputs  and  m  out- 
puts requires  an  OR  array  of  2"  lines  deep 
by  m  lines  wide.  For  example,  a  PROM 
with  10  inputs  and  8  outputs  requires  an 
OR  array  of  2 10  by  8  or  8192  fuse  loca- 
tions. An  11th  input  would  require  that  the 
array  size  be  doubled  to  16,384.  Cost  and 
performance  constraints  limit  PROMs  to 
13  inputs  and  8  outputs.  PROMs  designed 
specifically  for  logic  applications  feature 
either  5  or  6  inputs  and  16  outputs. 

PLAs 

The  PLA  structure  offers  the  highest  level 
of  flexibility  because  both  arrays  are  pro- 
grammable. Figure  2  shows  the  PLA's 
programmable-AND/programmable-OR 
structure.  Because  their  OR  arrays  are 
programmable,  PLAs,  like  PROMs,  can 
share  product  terms  among  outputs.  For 
example,  one  product  line  would  be  saved 
if  two  outputs  required  the  same  input 
combination  (i.e.,  product  line). 

Programmability  in  the  AND  array 
removes  the  restriction  found  in  PROMs 
that  the  AND  array  must  be  large  enough 
to  provide  all  possible  input  combinations. 
This  works  because,  statistically,  only  a 
continued 
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The  Evolution  of  PALs 


Computers  used  to  be  constructed 
from  SSI.  MSI,  PROM,  and  RAM 
chips  connected  injigsaw-puzzle  fashion 
on  many  printed  circuit  boards  plugged 
into  a  connector  backplane.  The  com- 
puter designer's  task  was  to  build  a  func- 
tional unit  such  as  a  processor,  disk  con- 
troller. I/O  controller,  or  memory  board. 
If  the  design  overflowed  onto  another 
board,  connectors  and  ribbon  cables  had 
to  be  added.  This  made  the  design  more 
expensive  and  sometimes  risky  due  to 
noise  coupling.  The  name  of  the  game 
was  to  get  it  all  on  one  board. 

Mixing  and  Matching  TTL  Chips 

Designers  who  had  studied  switching 
theory,  information  theory.  Boolean 
algebra,  and  Quine-McCluskey  mini- 
mization at  college  soon  found  that  their 
textbooks  would  not  be  of  much  use. 
They  learned  that  the  practical  art  of 
computer  design  did  not  consist  of  op- 
timizing an  architecture  with  an  or- 
thogonal instruction  set.  It  consisted  of 
mixing  and  matching  the  collage  of  exist- 
ing TTL  chips  onto  a  single  board  until 
an  approximation  of  the  design  goal  was 
reached.  They  did  not  design  state- 
control  sequence  logic  from  top-down 
state-graph  theory,  but  rather,  slapped 
down  a  74174  hexadecimal  register  and 
some  7400  NAND  gates.  Control-logic 
design  theories  usually  consisted  of 
following  signal  lines  around  the  logic 
schematic  until,  through  superhuman 
powers  of  concentration,  designers 
achieved  clarity. 

Designers  found  the  information  they 
needed  in  the  catalogs  of  young  semi- 
conductor companies  in  California. 
Arizona,  and  Texas.  A  favorite  was  The 
TTL  Daiabook  by  Texas  Instruments. 
Most  logic  designers  believed  that 
74-series  TTL  parts  found  in  this  book 
would  be  second-sourced  and  could  be 
"designed  in." 

A  processor  design  would  begin  with 
the  block  diagram  consisting  of  an  ALU. 
data  path  and  register  file,  microprogram 
memory  and  sequencer,  and  then  a  small 
and  obscure  block  called  "control  logic." 
It  might  have  been  a  small  block  on  the 
diagram,  but  the  control  logic  usually 
represented  the  majority  of  the  chip 
count. 

The  control  logic  consisted  of  SSI/MSI 
gates  and  flip-flops  connected  together  in 
random  fashion,  and  there  seemed  to  be 
no  way  to  reduce  it.  The  control  logic 
also  represented  the  area  of  highest 
design  errors  and  was  easily  recognizable 


John  Martin  Birkner 

on  the  printed  circuit  board  as  the  area 
with  all  the  "cuts  and  jumpers."  The 
engineering  change  notice  (ECN)  was  the 
standard  remedy  for  such  errors  and  was 
a  constant  source  of  agony  between 
manufacturing  and  engineering.  Manu- 
facturing would  use  yellow  wires  to  stand 
out  on  the  green  PC  board.  Engineering 
would  use  green  jumper  wires  to  camou- 
flage embarrassing  mistakes. 

The  engineering  manager  would  "pilot 
release"  the  current  revision  PC  board 
as  soon  as  the  green  wire  count  was  low 
enough  to  pacify  manufacturing.  The 
design  engineers  would  then  flee  to  the 
next  design,  where  they  were  expected 
to  cram  even  more  functions  onto  the 
single  PC  board  to  beat  the  competition's 
new  threat. 

I  was  convinced  that  there  must  be  a 
better  way  to  build  computers.  So.  in 
1975,  I  packed  my  bags  and  headed  for 
Silicon  Valley.  I  remember  seeing  the  first 
single-chip  microprocessor  systems  on 
the  market.  They  had  one  microcomputer 
chip  surrounded  by  a  sea  of  over  100 
SSI/MSI  chips.  The  new  LSI  chips 
needed  either  some  good  planning  so  that 
they  could  talk  to  each  other  or  some 
good  "glue  chips"  to  hook  them  up. 

The  PROM .  pioneered  by  Harris  and 
Monolithic  Memories,  showed  some 
promise  as  a  universal  and  general- 
purpose  glue  logic  element.  Applications 
like  memory-address  decoding  began 
showing  up  for  the  32-word  by  8-bit 
PROM.  National  Semiconductor  pio- 
neered the  programmable  logic  array 
(PLA)  in  a  14-in.  8-out.  96-product-lerm. 
24-pin  fat  (0.6  inch  wide)  DIP.  bench- 
marked  for  96-character  EBCDIC-to- 
ASCII  conversion. 

Intersil  made  a  field  programmable 
logic  array,  or  FPLA.  in  the  National  pin- 
out,  but  with  about  half  the  product  terms 
at  48.  Signetics  increased  the  package 
pins  to  28.  making  the  16-in.  8-out. 
48-product-term  82S100  FPLA.  These 
first  attempts  at  providing  the  computer 
designer  with  LSI  glue  were  met  with 
mild  enthusiasm.  The  new  glue  chips 
were  too  big  (fat  DIPs)  and  were  slow, 
expensive,  and  hard  to  use. 

Monolithic  Memories  was  the  first 
company  to  take  advantage  of  the  bipolar 
fuse-link  PROM  technology  to  make 
some  fast  little  FPLAs.  as  we  first  called 
them.  We  put  them  in  industry-standard 
20-pin  skinny  (0.3  inch)  DIPs.  for 
minimum  PC  board  area.  We  also  re- 
duced the  two  programmable  arrays  down 
to  one  for  35-ns  high-speed  operation  and 


lower  cost.  We  mimicked  the  TTL  data- 
sheet specs  down  to  the  same  ter- 
minology, graphics,  and  printing  style  to 
make  the  computer  design  engineer 
secure  in  replacing  old  74-series  TTL 
chips.  We  added  programmable  three- 
state  output  enable  for  I/O  pin  allocation. 
We  added  output  registers  with  feedback 
for  direct  implementation  of  state- 
machine  control  logic  from  state  graphs. 

We  designed  the  programming  algo- 
rithm to  be  compatible  with  existing 
PROM  programmers,  making  low-cost 
programming  possible.  The  first  PAL 
programming  module  had  a  PAL  in  it. 
This  presented  a  chicken-and-egg  prob- 
lem that  we  solved  by  emulating  with 
some  PROM  and  SSI  chips.  The  first 
PAL  to  be  programmed  was,  of  course, 
the  pattern  for  the  PAL  programmer 
module.  I  headed  the  project,  specified 
the  design,  and  sold  the  customers.  H.  T. 
Chua  provided  a  clever  and  ingenious  cir- 
cuit design. 

New  Design  Methodology 

The  new  chips  required  a  new  design 
methodology.  Actually,  it  was  the  same 
method  that  we  learned  in  school,  so  we 
had  to  drag  out  our  old  textbooks  and 
releam  Boolean  logic  and  top-down  state- 
machine  design.  We  showed  the  designer 
how  to  "design  your  own  chip"  using 
Boolean  logic  equations.  We  wrote  the 
first  silicon  compiler.  PALASM  (PAL 
Assembler),  and  published  the  FOR- 
TRAN source  in  the  PAL  Handbook 
(available  from  McGraw-Hill),  along 
with  numerous  design  examples. 

The  PAL  chips  replaced  SSI/MSI  chips 
at  a  chip-count  reduction  of  5  to  1.  Data 
General  gambled  on  the  new  single- 
sourced  chips  by  designing  them  into  the 
MV8000  computer  (see  The  Soul  of  a 
New  Machine  by  Tracy  Kidder). 

Apple  put  six  PAL  chips  in  the  Macin- 
tosh. Soon  the  PAL  chips  were  no  longer 
single-sourced,  as  National  Semiconduc- 
tor, Texas  Instruments,  Advanced  Micro 
Devices,  and  others  joined  in  licensing 
the  now-patented  PAL  chips  from  MMI. 
Now  you  can  find  these  chips  every- 
where. Look  at  the  PC  expansion  boards 
in  this  magazine;  you  can  recognize  PAL 
chips  by  their  easy-to-read  part-number 
system  (e.g..  PAL16L8  and  PALI6R8). 


John  Martin  Birkner  is  president  of  Struc- 
tured Design  Inc.  and  coinventor  of  the 
programmable  array  logic  (PAL)  device. 
He  can  be  contacted  at  988  Bryant  Wa\. 
Sunnyvale.  CA  94087. 
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Figure  A:  Differences  in  ihe  logic       Figure  B:  The  partially  programmed 
notation  for  a  TTL  AND  gate  and  a     product  line  to  implement  A*B. 
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Figure  C:  Portion  of  the  PLD  array  programmed  to  implement  A*B. 
Having  all  fuses  intact  in  the  second  product  line  causes  a  logic  zero  to  be 
input  to  the  second  AND  gate,  which  does  not  contribute  to  the  sum  at  the 
OR  gate. 
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Figure  D:  The  PAL  device  implementation  of  the  function  /Output  =  A*B 
+  /A*B.  Tlte  standard  combinatorial  logic  diagram  of  the  function  /Output 
=  A*/B  +  /A*B  is  shown  in  the  inset  to  figure  D  above. 


Because  PLD  structures  are  much 
different  from  ordinary  TTL  gates, 
new  logic  notations  were  developed  for 
them.  Figure  A  shows  the  logic 
convention  adopted  for  a  three-input 
AND  gate.  The  PLD  representation  for 
an  AND  gate  is  called  a  "product  line." 
Note  that  the  three  vertical  lines  are  the 
inputs  (A,  B,  and  C),  which  are  con- 
nected to  the  AND  gate  inputs  through 
fuse  links.  An  unprogrammed  (or  closed) 
fuse  link  is  represented  by  an  X  at  the 
intersection  of  an  input  line  with  a 
product  line.  If  you  wanted  to  disconnect 
one  of  those  inputs  from  AND  gate  C, 
for  example,  you  would  remove  the 
appropriate  X  from  the  point  of  inter- 
section for  the  C  input  line  with  the 
product  line  to  signify  a  programmed  (or 
open)  fuse  link.  This  product  line,  which 
now  implements  the  A*B  function,  is 
shown  in  figure  B. 

Since  every  input  is  available  to  every 
product  line  in  a  PLD,  it  is  convenient 
to  show  the  input  lines  as  long  lines 
running  vertically  through  the  array. 
Also,  two  input  lines  are  associated  with 
each  input  pin  because  both  input 
polarities  are  available  in  a  PLD.  There- 
fore, the  input  buffer  is  shown  with  both 
a  noninverted  (true)  and  inverted  (com- 
plement) output  path;  each  path  is  hard- 
wire connected  (shown  as  a  dot)  to  an 
input  line. 

Figure  C  shows  a  portion  of  a  PLD 
array  illustrating  the  input  lines  and 
buffers.  Notice  tha'  an  OR  gate  is  added 
to  the  structure.  Ail  the  fuse  links  in  the 
lower  product  line  are  left  intact,  leaving 
the  product  line  in  a  logic  low  (since  true 
inputs  are  ANDed  with  complements), 
while  appropriate  fuse  links  in  the  upper 
product  line  are  programmed  to  imple- 
ment the  A*B  function  from  figure  B. 

It  is  common  to  implement  two  or 
more  levels  of  logic  gates  such  as  an 
AND/OR/invert  circuit  in  a  PLD.  For 
example,  consider  the  following  function 
implemented  in  a  PAL  dev  ice: 

/Output  =  A*/B  +  /A*B. 

Shown  in  figure  D  are  the  standard  com- 
binatorial logic  diagram  (see  inset)  and 
the  PAL  logic  equivalent  for  this 
function. 

Notice  the  details  added  to  figure  D 
that  magnify  the  programmed  fuse  link 
for  B  in  the  upper  product  line.  This 
magnification  details  each  fuse  link  and 
its  associated  diode  for  a  bipolar  PAL 
device.  A  CMOS  PAL  device  is  similar, 
except  an  ultraviolet  cell  would  substitute 
for  the  fuse  link. 
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limited  number  of  product  terms  is  re- 
quired in  any  equation.  Eliminating 
redundant  combinations  with  logic 
minimization  techniques,  such  as  Kar- 
naugh maps,  can  reduce  the  required 
number  of  product  terms  even  more. 
Therefore,  almost  any  combination  of  in- 
puts can  be  decoded  in  a  PLA. 

PLAs  were  the  first  products  offered 
specifically  for  logic  applications.  Due  to 
programming  limitations,  early  PLAs 
were  available  only  in  mask-programmed 
versions.  Just  like  on  a  ROM.  a  logic 
designer  would  indicate  on  the  vendor's 
PLA  AND/OR  logic  map  where  the  de- 
sired connections  were  to  be  made.  The 
vendor  would  then  tool  up  a  custom  metal 
mask  for  the  PLA  to  implement  the 
customer's  logic.  Today,  most  PLAs  are 
user-programmable.  However,  mask-pro- 
grammed PLA  structures  are  used  often 
in  the  control  section  of  LSI/VLSI  stan- 
dard logic  chips,  such  as  microprocessors, 
and  offered  in  standard-cell  libraries. 

In  the  world  of  engineering,  there  are 
always  compromises.  The  facts  reveal  that 
a  performance  and  silicon-die  size  penalty 
must  be  realized  to  provide  the  flexibility 
of  programming  both  arrays.  PAL  devices 
are  generally  5  to  10  nanoseconds  faster 
than  PLAs  at  the  same  power  level  and 
save  the  silicon  area  required  to  program 
and  verity  the  second  array.  It  turns  out 
that  the  flexibility  of  a  programmable-OR 
array  is  not  required  for  most  PLD  ap- 
plications, but  it  can  be  useful  for  com- 
plex state-machine  and  sequencer  applica- 
tions. 

Because  of  the  long  history  of  PLAs, 
their  nomenclature  can  be  a  little  confus- 
ing. Early  vendors  of  user-programmable 
PLAs  called  their  products  FPLAs  to 
highlight  their  "field  programmability" 
and  to  distinguish  FPLAs  from  factory 
mask-programmed  PLAs.  Just  as  ROMs 
and  PROMs  could  be  easily  distinguished, 
so  could  PLAs  and  FPLAs.  However, 
since  most  of  the  PLAs  offered  today  are 
programmed  by  the  customer,  many  ven- 
dors have  dropped  the  F  prefix  and  sim- 
ply call  them  PLAs.  Furthermore,  PLAs 
designed  for  sequencer  applications  are 
called  PLSs  (programmable  logic  se- 
quencers). 

PAL  Devices 

Figure  3  shows  the  programmable- 
AND/fixed-OR  array  structure  of  a  PAL. 
As  with  the  PLA,  having  the  AND  array 
programmable  lets  the  user  program  only 
the  desired  input  combinations.  But  fix- 
ing the  OR  array  requires  that  certain 
product  lines  be  tied  to  specific  outputs— 
typically,  eight  product  lines  per  output. 

Many  people  use  PAL  and  PLD  synony- 
mously. Several  PLD  vendors  add  an  E 
prefix  to  PLD.  to  come  up  with  EPLD, 


which  signifies  ultraviolet  erasable  PLDs. 
Just  as  there  are  PROMs  and  EPROMs. 
now  there  are  PLDs  and  EPLDs. 

The  name  HAL  (hard  array  logic), 
refers  to  mask-programmed,  or  ROM. 
versions  of  PAL  devices.  If  the  volume  of 
devices  needed  were  large,  converting  a 
design  to  a  HAL  might  be  appropriate 
once  the  design  is  thoroughly  debugged 
with  the  PAL. 

While  all  PAL  devices  are  characterized 
by  a  programmable-AND/fixed-OR  array 
structure,  there  is  a  whole  line  of  PAL 
devices  with  different  options.  They  come 
with  varying  numbers  of  inputs  and  out- 
puts. They  might  have  feedback  paths 
from  the  output  back  to  the  array.  Some 
of  these  pins  can  be  programmable  I/O 


pins.  They  can  have  active-high  or  active- 
low  outputs,  or  the  output  polarity  might 
be  programmable  via  an  XOR  gate  and  a 
fuse.  Some  come  with  registers  at  their 
outputs  and  are  good  for  making  sequen- 
tial circuits.  Let's  look  at  two  commonly 
used  PAL  devices:  the  16L8  and  the  16R8. 

The  PALI6L8 

One  popular  combinatorial  PAL  is  the 
PAL16L8  (figure  4).  Notice  how  the  pins 
on  the  left  side  and  bottom  of  the  logic 
diagram  (pins  1  to  9  and  pin  11)  are  used 
for  inputs  and  the  pins  on  the  right  (pins 
12  to  19)  are  available  as  outputs.  Pins  12 
and  19  can  be  used  only  as  outputs,  but 
six  of  the  outputs  (pins  13  to  18)  are  also 
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Figure  1:  A  simplified  diagram  of  the  fixed-AND/programmable-OR  array 
structure  of  the  PROM.  Figures  in  this  article  are  reprinted  from  Monolithic 
Memories'  PAL  Handbook  (3rd  ed.)  with  permission  from  Monolithic  Memories. 
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Figure  2:  A  simplified  diagram  of  a  PLA  showing  that  both  arrays  are 
programmable. 


available  as  inputs  via  the  feedback  line 
connection  after  the  inverting  output  buf- 
fer. This  feature,  called  programmable 
I/O.  lets  the  user  program  each  of  these 
six  pins  to  be  either  an  input  or  output. 
I'll  discuss  programmable  I/O  in  more 
detail  later  on.  Now  the  PAL16L8  part- 
numbering  scheme  should  be  a  little  more 
obvious;  16  signifies  the  maximum 
number  of  potential  inputs  (10  dedicated 
inputs  and  6  programmable  I/O),  while  8 
signifies  the  number  of  outputs  and  L 
signifies  the  output  type,  which  is  active 
low  for  this  PAL  part  type. 

Refer  to  the  logic  diagram  in  figure  4 
and  you'll  see  that  the  vertical  lines  run- 
ning through  the  array,  numbered  0 
through  31,  are  the  input  lines.  Notice  that 
each  input  or  I/O  pin  is  associated  with 


two  input  lines;  one  input  line  is  connected 
to  the  true  (or  noninverted)  sense  of  the 
input  buffer,  while  the  other  input  line  is 
connected  to  the  complement  (or  inverted) 
sense.  This  allows  availability  of  both 
input-signal  polarities  to  the  array. 

The  horizontal  lines  running  through 
the  array,  numbered  0  through  63,  are  the 
product  lines.  You  can  think  of  each  of 
these  product  lines  as  an  AND  gate  with 
32  inputs,  which  corresponds  to  the  total 
number  of  input  lines.  Actually,  both  the 
true  and  complement  of  every  input  signal 
are  connected  via  fuses  to  each  product 
line  before  the  device  is  programmed. 
This  is  the  programmable-AND  array  in 
the  PAL  structure.  To  program  the  array, 
the  user  selects  different  combinations  of 
continued 
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Figure  3:  A  simplified  diagram  showing  the  programmable-AND/fixed-OR  array 
structure  of  a  PAL  device. 


input  signals  by  disconnecting,  via  the 
blown  fuse,  the  unwanted  input  signals  in 
a  product  line.  In  total,  2048  fuses  are 
available  in  this  PAL  device  (64  product 
lines  by  32  input  lines). 

Notice  that  each  output  pin  has  eight 
product  lines  associated  with  it.  The  lower 
seven  product  lines  of  each  group  are 
summed  at  the  OR  gate,  while  the  upper 
product  line  is  connected  to  the  inverting 
output  buffer.  The  lower  seven  product 
lines  and  the  OR  gate  provide  the  sum- 
of-products  logic  power  for  the  PAL 
device.  The  OR  gate  determines  whether 
any  of  the  product  lines  are  active,  or  true, 
and  then  the  output  buffer  inverts  the 
signal  from  the  OR  gate  for  output.  Note 
that  a  product  line  with  all  fuses  left  in- 
tact will  not  affect  the  sum  at  the  OR  gate. 


since  the  logical  result  of  each  input 
ANDed  with  its  complement  is  false. 

Programmable  I/O 

This  upper  product  line  associated  with 
each  output  controls  the  three-state  logic 
in  the  output  buffer.  When  this  product 
line  is  active,  or  true,  the  output  is  enabled 
and  the  sum-of-products  logic  determines 
the  output  state.  However,  when  this  prod- 
uct line  is  inactive,  or  false,  the  output  is 
disabled  with  the  three-state  buffer  in  the 
high-impedance  state.  This  lets  the  out- 
put pin  drive  a  three-state  bus  just  like  a 
74S240  octal  buffer.  Furthermore,  since 
most  PAL  devices  feature  an  output  drive 
capability  of  24  milliamperes.  they  are 
quite  handy  for  bus  interfacing. 

continued 
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Figure  4:  Vie  actual  logic  diagram  of  the  combinatorial  PAU6L8.  The  fuse  links  are  not  shown  so  that  a  logic  designer 
may  place  an  X  at  those  points  where  a  fuse  should  he  left  intact. 


The  three-state  product  line,  along  with 
the  feedback  path  on  six  of  the  outputs, 
makes  the  programmable  I/O  feature 
work.  The  pin  is  an  input  to  the  AND  ar- 
ray when  all  the  fuses  in  the  three-state 


enable  product  line  are  left  intact,  while 
the  pin  is  an  output  when  all  the  fuses  are 
programmed.  Note  that  a  product  line  will 
always  be  true,  regardless  of  input  com- 
binations,  when  all   fuses  are  pro- 


grammed. The  programmable  I/O  feature 
lets  the  user  allocate  pins  for  input  or  out- 
put as  required  by  the  application. 

An  even  higher  level  of  flexibility  is 
possible  if  you  let  the  logic  in  the  product 


NOTE   FUSE  LINKS 
NOT  SHOWN 


Figure  5:  The  actual  logic  diagram  of  the  PAU6R8.  Notice  the  D-type  flip-flops  at  the  outputs. 
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line  determine  the  pin's  direction.  This  is 
done  by  programming  a  condition  in  the 
product  line  for  which  the  pin  will  be  an 
output.  You  can  use  this  feature  to  allocate 
available  pins  for  I/O  functions  or  to  pro- 
vide bidirectional  transfer  for  operations 
such  as  shifting  and  rotating  data. 

The  PAL16L8  is  used  in  applications 
such  as  complex  decoders,  encoders, 
multiplexers,  comparators,  and  replace- 
ment of  SSI/MSI  random  logic.  Another 
way  of  viewing  this  is  that  the  PAL16L8 
programmable  AND  array  contains  2048 
fuses.  You  can  program  these  fuses  to 
create  almost  any  configuration  of  up  to 
250  AND.  OR.  and  inverter  gates,  which 
is  roughly  250  equivalent  gates. 

PALs  with  Registered  Outputs 
and  Feedback 

The  structure  of  registered  PALs  is  similar 
to  that  of  the  PAL16L8  except  for  the  ad- 
dition of  the  registered  outputs.  In  the 
PAL16R8.  each  of  the  eight  registers  is  ac- 
tually a  D  (data)  flip-flop  that  is  clocked 
on  the  rising  edge  (see  figure  5).  The 
clock  signal  (pin  1)  is  shared  by  all  eight 
flip-flops.  Each  OR  gate  sums  eight  prod- 
uct lines  and  is  the  D  input  to  the  flip-flop. 
The  Q  output  from  the  flip-flop  is  avail- 
able both  for  feedback  into  the  PAL  array 
and  for  output  from  the  device.  Either 
polarity  of  the  feedback  signal  is  available. 


This  feedback  lets  the  PAL  device 
"remember"  the  previous  state,  and  it  can 
alter  its  function  based  upon  that  state. 
Thus,  registered  PAL  devices  are  ideal  for 
implementing  single-chip  state  sequencers 
and  state  machines. 

Conclusion 

PROMs  are  limited  in  the  number  of  in- 
puts they  can  handle,  since  every  input 
combination  is  made  available.  They  are 
useful  for  applications  that  require  a  large 
number  of  product  terms  but  few  inputs. 

PLAs  are  the  most  flexible  of  the 
AND/OR  array  PLDs  with  both  arrays 
programmable.  This  flexibility  makes 
them  slower,  since  the  signal  has  to  prop- 
agate through  two  programmable  arrays. 
PAL  devices  with  their  programmable- 
AND/fixed-OR  array  structure  can  ac- 
commodate more  inputs  than  PROMs 
because,  statistically,  not  every  input  com- 
bination is  required.  With  only  one  array 
programmable,  they  are  faster  than  PLAs. 

PAL  devices  with  registered  outputs  are 
particularly  useful  for  building  sequential 
circuits.  PAL  devices  also  can  provide 
feedback,  altering  the  function  of  a  given 
state  based  on  the  condition  of  the  im- 
mediately prior  state.  The  overriding  ad- 
vantage of  all  PLDs,  however,  is  the  in- 
tegration of  multiple  functions  onto  a 
single  programmed  circuit  to  save  board 
space  and  reduce  chip  count  and  cost.  ■ 
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Becoming  the  technology  of  choice  in  superminicomputer 
design,  programmable  array  logic  mixes  with  alternative 
devices  to  meet  design  constraints. 


by  Bradford  S.  Kitson  and 
B.  Joshua  Rosen 


Programmable  array  logic  devices  have  been  a 
driving  force  behind  the  latest  generation  of  32-bit 
superminicomputer  designs.  These  supermini- 
computers range  from  redesigns  of  existing  archi- 
tectures that  reduce  cost  or  overcome  packaging 
limitations,  to  designs  requiring  ultrafast  turn- 
around, to  high  performance  architectures  speci- 
fied to  take  full  advantage  of  such  devices. 

Superminicomputers  designed  with  program- 
mable array  logic  (PAL®)  include  the  VAX® 1 1/730 


Bradford  S.  Kitson  is  section  manager  in  product 
planning  and  applications  for  programmable  logic 
devices  at  Advanced  Micro  Devices,  901  Thompson 
PI,  Sunnyvale,  CA  94088.  He  holds  a  bs  in  electrical 
engineering  and  computer  science  from  the  University 
of  California  at  Berkeley. 

B.  Joshua  Rosen  is  manager  of  processor  design  at 
Dataflow  Systems  Corp,  42  Nagog  Park,  Acton,  MA 
01 720.  He  was  manager  of  processor  development  at 
Computervision  Corp,  Bedford,  MA,  when  this  article 
was  written.  He  holds  a  ba  in  physics  from  Lawrence 
University  and  an  MSEE  from  Northwestern 
University. 


from  Digital  Equipment  Corp,  the  MV/8000  and 
MV/ioooo  from  Data  General,  and  Computervision's 
apu  "(analytic  processing  unit).  A  ^implementa- 
tion of  the  original  VAX-1 1/780,  the  VAX-l  1/730  sup- 
plies 25%  of  the  performance  in  10<Vo  of  the  board 
space.  Programmable  array  logic  was  chosen  by 
the  MV/8000  designers  to  allow  the  shortest  possible 
design  cycle  and  to  catch  up  with  their  competitors. 
The  MV/10000  upgrades  performance  of  the 
MV/8000.  The  Computervision  APU  was  designed 
for  high  performance  with  PAL  devices  in  mind, 
and  provides  excellent  examples  of  how  to  use  such 
devices  to  their  fullest. 

Logic  Design  Alternatives 

Although  by  no  means  the  only  option,  pro- 
grammable array  logic  has  become  the  technology 
of  choice  in  superminicomputer  design.  Logic 
design  alternatives  (Fig  1)  include  standard  prod- 
ucts (fixed-function  devices),  semi-custom 
(programmable  logic,  gate  arrays,  and  standard 
cells),  and  fully-custom  logic  devices.  In  supermini- 
computer design,  the  primary  alternatives  are  stan- 
dard products,  gate  arrays,  and  PAL  devices.  The 
best  choice  for  any  given  function  depends  upon 
the  design  alternative  capabilities,  the  design  con- 
straints, and  the  function  actually  being 
implemented. 

Reprinted  with  permission  of  COMPUTER  DESIGN 
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Fig  1  Basic  categories  of  digital  logic  present  designers  with 
trade-off  opportunities.  Standard  products  fit  where  cost  is  a 
major  concern;  custom  and  semi-custom  products  can  be  used 
where  high  diversity  is  desirable. 

Via  fuse  programming,  a  PAL  device  allows  the 
designer  to  construct  a  custom  device  or  group  of 
devices  that  precisely  implement  a  desired  func- 
tion. In  contrast,  fixed-function  transistor- 
transistor  logic  (ttl)  small  scale  integration/ 
medium  scale  integration  (SSI/MSI)  alternatives 
seldom  seem  to  fit  any  application  in  the  desired 
way.  Thus,  the  SSI/MSI  designer  usually  pays 
penalties  via  extra  logic  levels  in  the  critical  path 
and  an  increased  package  count.  Gate  arrays  allow 
custom  devices  to  be  created  via  mask  program- 
ming. However,  designers  using  a  gate  array  must 
finalize  architecture  early  in  the  design  cycle.  Any 
errors  will  require  a  mask  change,  which  can  take 
months. 

In  most  cases,  the  best  choice  is  a  combination  of 
the  alternatives  and  probably  includes  some 
memory  as  well.  Typically,  the  six  basic  design  con- 
straints are  performance  (speed),  cost,  density 
(packaging),  power  dissipation,  reliability,  and 
design  turnaround.  Assigning  a  priority  to  these 
constraints  will  usually  define  the  logic  alternative 
that  a  machine  is  based  on.  Actual  implementation 
trade-offs  are  made  at  the  function  level.  The  three 
basic  functional  portions  of  a  design  are  data  path, 
control  path,  and  interface. 

Standard  Products  Have  Their  Place 

Standard  products  are  defined  as  devices  created 
for  a  wide  market.  Examples  of  standard  products 
are  TTL  ssi/msi,  fixed  instruction  set  metal  oxide 
semiconductor  (MOS)  microprocessors,  and  micro- 
programmable  large  scale  integration  (LSI)  building 
blocks.  These  devices  are  usually  multiple  sourced 
and  produced  in  high  volume,  resulting  in  lower 
individual  device  costs.  In  a  design  where  cost  is  the 
main  concern,  and  performance,  power  dissipation, 
density,  and  design  turnaround  are  of  little  or  no 
importance,  standard  products  are  probably  the 
best  choice.  In  a  design  such  as  a  superminicom- 


puter, where  these  other  considerations  have  a  high 
priority,  inherent  disadvantages  limit  standard 
product  use. 

While  SSl/MSl  devices  offer  fast  individual  gates, 
on  a  system  level  their  density,  power  dissipation, 
and  reliability  characteristics  are  not  as  good  as 
those  of  the  alternatives.  In  addition,  design  turn- 
around characteristics  are  inadequate  because 
changes  usually  require  printed  circuit  (PC)  boards 
to  be  laid  out  again.  In  most  superminicomputers, 
therefore,  SSI/MSI  gates  are  used  only  in  selected 
critical  path  functions  that  require  one  or  two  gate 
levels  (at  the  expense  of  density  and  power  dissipa- 
tion), and  in  interface  applications,  such  as  bus 
buffers,  latches,  registers,  and  transceivers. 

Standard  LSI  products  (mostly  bipolar)  offer 
exceptional  performance,  power  dissipation,  density, 
and  reliability  characteristics,  but  their  inflexible 
architectures  limit  their  application  range.  Super- 
minicomputer designers  rely  on  proprietary,  highly 
complex  architectures  to  differentiate  their  designs 
from  those  of  their  competitors,  and  LSI  imposes 
an  architecture.  Therefore,  applications  are  limited 
to  general  purpose,  well-defined  functions  in  the 
data  path,  such  as  parallel  multipliers  and  arith- 
metic logic  units  (ALUs)  that  can  benefit  from  their 
high  performance  characteristics. 

Gate  Arrays  Are  "Cast  in  Concrete" 

Semi-custom  gate  arrays  are  defined  by  inte- 
grated circuit  (IC)  manufacturers  as  large  arrays  of 
unconnected  gates.  End  users  specify  how  gates  are 
interconnected  with  actual  interconnection  occurring 
at  the  metal-mask  layer  of  the  ic  process.  The  main 
advantages  of  gate  arrays  are  high  density  and  the 
ability  to  customize  a  design,  while  primary  dis- 
advantages are  cost  and  design  turnaround.  Each 
custom  device  is  single  sourced  and  low  volume; 
therefore  they  are  not  cost-effective  unless  the 
application  is  density  limited  or  the  volume  is  very 
high.  Gate  arrays  can  adversely  affect  design  turn- 
around because  the  system  designer  must  design 
both  the  IC  and  the  system  in  which  the  ic  is  used. 
In  addition,  any  change  in  the  gate  array  requires 
new  masks  and  a  delay  for  each  mask  iteration. 

By  using  gate  arrays  only  where  the  design  can  be 
defined  early,  designers  minimize  these  turnaround 
disadvantages.  They  then  hedge  their  bets  by  sur- 
rounding the  gate  arrays  with  logic  that  can  correct 
any  design  bug(s)  discovered  later  on.  As  in  LSI, 
what  is  "cast  in  concrete"  is  usually  the  data  path. 
However,  gate  arrays  allow  more  of  the  data  path 
to  be  integrated  because  the  device  can  be  optimized 
for  specific  design  requirements.  A  proprietary 
16-bit  ALU  slice  might  be  a  typical  gate  array. 

Gate  arrays  are  also  used  to  interface  multiple 
onboard  buses  together  as  data  path  "glue." 
Control-path  and  interface  applications,  however, 
tend  to  be  too  likely  to  change.  Therefore,  control 
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path  functions  are  based  on  implementation  tech- 
niques such  as  writable-control-store  (wcs).  Inter- 
face applications  frequently  require  changes 
because  of  the  need  to  interface  one  designer's 
board  to  another's.  Consider  the  critical  timing 
between  a  cache,  an  instruction  fetching  unit,  and 
multiple  register-files.  The  exception  handling 
capability  required  for  typical  operations,  such  as  a 
cache  miss,  can  be  an  indeterminant  problem 
affecting  all  of  the  above-mentioned  functional 
units  in  the  system.  Should  a  bug  occur  in  the  cache 
unit,  the  interface  section  of  all  other  units  will  have 
to  be  changed  to  accommodate  the  correction. 

PAL  Structure  Paves  the  Data  Path 

Combining  simplicity  and  flexibility,  the  basic 
PAL  structure  is  a  fuse-programmable  AND  gate 
array  that  drives  fixed  connection  OR  gates,  allow- 
ing logic  to  be  implemented  in  sum-of-products 
(AND-OR)  Boolean  form. 

By  selectively  blowing  the  appropriate  fuses,  a 
PAL  device  can  implement  any  logic  function  as 
long  as  the  number  of  inputs  or  AND  gates  required 
does  not  exceed  the  number  provided  in  the  device 


chosen.  In  the  AmPALi6R4,  for  example,  the  true 
and  complement  version  of  each  of  the  16  inputs  is 
connected  via  fuses  to  each  of  the  64  AND  gates  in 
the  device  (Fig  2).  pal  devices  provide  additional 
features,  such  as  programmable  input/output  (I/O) 
pins  and  registered  outputs  with  internal  feedback 
that  further  enhance  their  ability  to  implement 
logic  functions  efficiently.  Programmable  I/Os  are 
especially  useful  for  trading  off  the  number  of 
device  outputs  for  inputs  to  fit  the  exact  number 
required  by  the  logic  functions  being  implemented. 
Internal  registered  feedback  is  desirable  when 
implementing  complex  state  machine  designs. 

Programmable  logic  devices,  like  gate  array,  are 
semi-custom  devices.  Created  by  the  IC  manufac- 
turer, they  are  alterable  by  fuse  programming  for  a 
specific  application.  Designed  specifically  for  logic- 
oriented  applications,  PAL  devices  enable  designers 
to  create  custom  devices  with  fast  turnaround  time. 
PAL  devices  compare  favorably  to  alternative 
devices  in  terms  of  performance,  cost,  density, 
power  dissipation,  and  reliability.  They  fall  behind 
gate  arrays  and  LSI  in  density  and  power  dissipa- 
tion, and  behind  SSI/MSI  in  individual  device  cost. 
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(Note  however,  that  PAL  devices  are  cheaper  on  a 
system  basis.) 

Performance  and  density  characteristics  of  PAL 
devices  have  led  to  significant  applications  in  the 
data  paths  of  superminis  where  they  are  used  along 
with  LSI  and/or  gate  arrays.  These  devices  serve  to 
"glue"  the  LSI  and  the  "cast  in  concrete"  gate 
arrays  into  the  system.  If  necessary,  the  design  turn 
around  capability  that  they  display  can  be  used  to 
optimize  the  data  path  architecture  and,  in  some 
cases,  fix  a  bug  in  a  gate  array  by  reprogramming  the 
PAL  devices  around  it.  Typical  data  path  functions 
for  the  devices  include  barrel  shifters,  masking, 
code  conversion,  and  multiple  bus  interface. 

Design  turnaround  becomes  especially  beneficial 
in  the  control  path  and  interface  portions  of  a 
design.  Most  control  path  functions  are  highly  ran- 
dom and  are  prone  to  change  and/or  error  While 
wcs  allows  design  changes  to  be  made  by  rewriting 
microcode,  PAL  devices  permit  changes  that  cannot 
be  made  in  microcode,  or  would  adversely  affect 
system  performance.  For  instance,  one  supermini- 
computer manufacturer  has  established  the  rule 
that  a  gate  array  can  be  used  in  the  control  path 
only  if  eight  or  more  PAL  devices  are  necessary  for 
the  same  function.  In  interface  design,  from  a  den- 
sity standpoint,  PAL  devices  allow  the  interface  to 
merge  with  the  data  path  "glue"  function.  Since 
the  interface  is  just  as  likely  to  change  as  the 
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Fig  3    Multiplier  calculates  56-bil  x  56-bit  product.  Partial 
product  generation  logic  uses  seven  8x8  slices  to  form  8  x 
56-bit  multiplication  array.  Carry-save  adders  implement 
3-to-2  counting  techniques  to  form  two  operands  thai  are 
summed  in  a  lookahead  ALU. 


control  path,  but  without  the  benefit  of  wcs,  design 
turnaround  becomes  a  factor.  If  one  board's 
designer  needs  to  change  the  interface,  the  designs 
of  many  other  boards  are  impacted.  Updates  on  all 
boards  can  be  easily  made  by  reprogramming  one 
or  more  PAL  devices.  These  devices  can  also  pro- 
vide the  drive  capability  required  by  this  appli- 
cation area. 

Designing  with  PAL  Devices  in  Mind 

Trade-offs  among  PAL  devices  and  other  digital 
logic  alternatives  were  key  factors  in  Computervi- 
sion's  APU  design.  In  conjunction  with  standard 
products,  such  as  ALUs,  LSI  multipliers,  and 
memory  devices,  PAL  devices  were  used  to  create  a 
patented  architecture  not  feasible  in  standard  TTL 
SSI/MSI. 

Designed  as  a  very  high  speed  32-bit  supermini- 
computer for  engineering  applications,  the  Com- 
putervision  APU  is  twice  as  fast  as  competitive 
designs,  yet  occupies  the  same  board  space.  Its 
designers,  instead  of  merely  replacing  TTL  SS1/MS1 
with  PAL  devices,  used  PAL  as  customizable  logic 
building  blocks.  This  allowed  them  to  implement 
powerful  logic  functions  in  a  minimum  of  space. 

The  APU  processor  board  set  is  divided  into  four 
modules.  The  parser/sequencer  contains  an  instruc- 
tion processor  that  fetches  and  decodes  instruc- 
tions in  parallel  with  the  execution  unit.  The 
control  processor  performs  address  and  integer 
computations  and  the  floating  point  pipe  (FPP)  per- 
forms both  scalar  and  vector  floating  point  opera- 
tions. The  cache/address  translation  unit  contains 
a  256-slot  area  page  table  entry  cache  and  a 
16K-byte  memory  cache. 

Approximately  25%  of  the  chips  in  the  APU  board 
set  are  in  PAL.  Since  the  APU  is  the  first  implemen- 
tation of  the  new  CPU  architecture,  many  design 
aspects  are  subject  to  change  as  the  architecture 
evolves.  PAL  devices  permit  designers  to  rapidly 
modify  hardware  to  fit  the  architecture  needs,  and 
to  implement  feature  and  performance  enhance- 
ments with  minimal  impact  on  the  development 
schedule. 

Ability  to  generate  a  very  large  number  of  custom 
ICS  (over  200  different  PAL  codes  are  used  in  the 
APU),  significantly  reduces  the  processor's  size 
while  increasing  overall  performance.  This  means 
that,  although  the  APU  and  Digital  Equipment 
Corp's  VAX-l  1/750,  a  gate  array-based  machine, 
consume  exactly  the  same  amount  of  board  space, 
the  APU  is  more  than  twice  as  fast.  In  fact,  the 
APU's  Fortran  performance  is  substantially  faster 
than  that  of  the  VAX- 11/780,  a  machine  that  con- 
sumes 5.2  times  as  much  board  space  as  the  APU. 

Designed  as  a  high  speed  arithmetic  extension  to 
the  APU  execution  engine,  the  APU  FPP,  unlike 
comparable  machines,  is  an  integral  part  of  the 
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internal  architecture  and  not  an  optional  add-on. 
As  a  result,  the  FPP  not  only  accelerates  scalar  and 
vector  floating  point  arithmetic,  but  also  performs 
byte,  word,  double-word,  and  quad-word  string 
operations.  In  addition,  it  serves  to  enhance  the 
performance  of  important  nonfloating  point 
instructions  such  as  Procedure  Call  and  Return.  A 
total  of  79  PAL  devices  are  used  for  both  control- 
and  data-path  applications  on  the  FPP  board. 

Counter  Solves  Multiplication 
KroDiem 

The  heart  of  the  FPP  is  the  multiplier  section 
(Fig  3).  Double-precision  floating  point  multiplica- 
tion requires  the  calculation  of  a  56-bit  x  56-bit 
product.  Unfortunately,  56  x  56  parallel  multipliers 
do  not  exist  on  silicon.  From  a  cost/performance 
standpoint,  the  best  solution  is  to  use  a  number  of 
small  multipliers  to  build  an  intermediate  sized  par- 
allel multiplier  and  then  produce  a  large  product 
(56  x  56)  in  multiple  cycles. 

Partial  product  generator  FPP  logic  uses  seven 
Am25S558  8  x  8  multiplier  slices  to  implement  an  8  x 
56-bit  multiplication  array.  Each  multiplier  chip 
produces  a  16-bit  product.  In  general,  the  8  most 
significant  bits  (MSBs)  of  each  partial  product  gen- 
erator must  be  added  to  the  8  least  significant  bits 


(I.SBs)  of  the  next  higher  slice  to  generate  the  full 
64-bit  partial  product.  Exceptions  are  the  8  MSBs 
and  LSBs. 

This  technique  also  requires  the  ability  to  accu- 
mulate partial  products  with  the  partial  products 
from  previous  cycles.  Thus,  each  cycle  must  be 
accompanied  by  two  additions:  the  partial  product 
summation  and  the  intermediate  product  accumu- 
lation. While  this  can  be  done  by  following  the 
multipliers  with  two  levels  of  lookahead  adders, 
usually  74S18IS,  the  resulting  nanocycle  time  is 
approximately  three  times  longer  than  the  partial 
product  generation  time  of  the  8  x  8  multipliers. 
Modifying  this  scheme,  however,  by  adding  reg- 
isters between  each  level  of  logic,  the  pipeline 
multiplier  reduces  the  nanocycle  time  to  near  the 
propagation  delay  time  of  the  multiplier  chips  plus 
the  clock  to  output  time  of  the  multiplier  register 
plus  the  setup  time  of  the  intermediate  result  reg- 
ister. This  scheme  has  two  disadvantages:  increased 
pipe  latency,  caused  by  the  two  extra  levels  of 
pipelining,  and  a  high  parts  count. 

Still  another  technique  involves  replacing  one 
level  of  the  pipe  and  one  level  of  lookahead  adders 
with  carry-save  adders  between  the  partial  product 
generators  and  the  pipeline  registers.  Carry-save 
adders  are  used  to  implement  a  technique  that  is 
called  3-to-2  counting.  As  seen  in  Fig  4(a),  any 
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Fig  4    Carry-save  adders  use  3-to-2 
counting  techniques  to  recode  any 
three  equally  weighted  bits  into  a 
2-bit  field  (a).  Sixteen  pal  units 
form  triple  3-to-2  counters  <b)  to 
reduce  the  three  multiplication 
operands  to  two  intermediale 
results. 
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Without  adding  pipe  latency,  the  APU 
is  able  to  accumulate  partial  products 
at  a  rate  of  8  x  56  bits  every  112  ns. 


combination  of  3  equally  weighted  bits  can  be 
recoded  into  a  2-bit  field. 

This  makes  it  possible  to  reduce  the  three  oper- 
ands generated  by  the  multiplication  process  (high 
and  low  partial  products  and  64-bit  intermediate 
product)  into  only  two  operands  which  may  then 
be  summed  together  in  a  single  lookahead  ALU. 
The  3-to-2  recoding  requires  no-carry  propagate 
logic  and  is  therefore  very  fast.  Only  one  level  of 
pipelining  is  required  because  of  the  3-to-2 
counter's  speed,  resulting  in  both  a  reduced  parts 
count  and  a  reduced  pipe  latency. 

In  the  APU  floating  point  engine,  16  AmPALl6R6s, 
programmed  as  triple  3-to-2  counters,  are  used  to 
reduce  the  three  multiplication  operands  to  two 
intermediate  results  [Fig  4(b)].  The  registered  PAL 
outputs  are  connected  to  the  input  buses  of  the 
Mantissa  ALU  that  is  also  used  for  floating  point 
addition  and  subtraction.  The  Mantissa  ALU  then 
calculates  the  next  intermediate  product  in  parallel 
with  the  partial  products  calculations  occurring  in 
the  8  x  8  multipliers.  This  intermediate  product  and 
the  new  partial  products  are  recoded  by  the  3-to-2 
counter  PAL  devices  to  form  the  next  pair  of  inter- 
mediate results.  This  process  continues  until  the 
complete  56  x  56  product  is  generated.  Thus,  with- 
out adding  pipe  latency,  the  APU  is  able  to  accumu- 
late partial  products  at  a  rate  of  8  x  56  bits  every 
112  ns,  which  coincides  with  the  basic  nanocycle 
machine  time. 

Barrel  Shift,  a  3-Level 
Implementation 

The  APU's  barrel  shifter  performs  left  shift,  right 
shift,  and  rotate  operations  of  0  to  63  bits  in  a 
single  microcycle.  Used  mainly  for  floating  point 
prescale  and  normalize  operations,  the  barrel 
shifter  (Fig  5)  is  implemented  in  three  stages:  the 
word  rotater,  nibble  shifter,  and  bit  shift  and  mask 
logic.  It  is  controlled  by  associated  prescale, 
leading  zero  detect,  and  mask  control  logic. 

Prescale  logic  converts  the  signed  difference  pro- 
duced by  the  exponent  arithmetic  units  based  on 
the  comparisons  of  the  two  operand  exponents, 
into  an  absolute  shift  distance.  This  shift  distance 
is  then  used  to  right  shift  (prescale)  the  smaller 
operand  Mantissa  of  a  floating  point  add  or  sub- 
tract operation.  The  leading  zero  detect  logic  deter- 
mines the  left  shift  distance  required  to  produce  a 
left-justified  (normalized)  result.  Mask  control 
logic  converts  rotated  data  to  shifted  data  by  mask- 
ing off  the  appropriate  leading  or  trailing  bits  to 


implement  right  or  left  shifts.  These  three  sections 
are  implemented  in  PAL. 

Implementing  the  3-level,  64-bit  barrel  shifter  in 
MSI  requires  the  use  of  Am25SlO  4-bit  shifters.  The 
first  level  is  the  word  rotater  which  performs  a  cir- 
cular rotate  of  0,  16,  32  or  48  bits.  Although  imple- 
mentation is  simple,  the  MSI  solution  requires  16 
packages.  The  second  level  is  the  nibble  shifter, 
essentially  identical  to  the  word  rotater  but  wired 
to  rotate  0,  4,  8,  or  12  bits.  The  final  barrel  shifter 
stage  requires  not  only  bit  rotate  but  also  leading 
and  trailing  bit  masking  and  sticky  bit  computation 
(ie,  the  logical  OR  of  the  masked-out  bits).  An  MSI 
solution  requires  not  only  the  16  packages  of 
AM25S10S,  used  in  each  preceding  level,  but  also  16 
AND  gate  packages  for  masking,  with  another  16 
AND  packages  for  the  sticky  bit  computation.  Con- 
trol logic  for  the  mask  operation  requires  as  much 
logic  as  the  entire  shift  path.  A  more  practical  solu- 
tion consists  of  building  separate  left  and  right 
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Fig  5  Implementation  of  64-bil  "Nearest  Neighbor  Shifler" 
is  in  three  stages:  word  rotaler,  nibble  shifler,  and  bit  shifl  and 
mask  logic.  Control  derives  from  associaied  prescale,  leading 
zero  delect,  and  mask  control  logic. 
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Fiji  6  Word  rotation,  firsl  level  of  3-level  shifter,  consists 
of  eight  pai  units,  each  programmed  as  two  4-bil  rotators. 
It  performs  a  circular  rotate  of  0,  16,  32,  or  48  bits. 
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Fig  7    Nibble  shifter  makes  up  second  level  of  barrel 
shirter.  It  is  wired  to  rotate  0,  4,  8,  or  12  bits. 

shifters,  48  packages  apiece,  and  not  implementing 
a  slick y  bit  at  all. 

Implemented  in  PAL,  a  64-bit  rotater  and  shifter 
with  sticky  bil  computation  requires  considerably 
fewer  packages  than  a  unidirectional  MSI  shifter. 
The  word  rotaler  consists  of  eight  identical  PAI. 
devices  programmed  as  two  4-bit  rotaters/ package 
(Fig  6).  The  nibble  shifter  requires  four  \rrt2JSI0S 
and  eight  PAI  devices  programmed  as  6-bit  wide, 
4-place  shifters  (Fig  7).  The  bit  shift  and  mask  logic 
requires  16  I'M  devices  in  the  data  path  and  two 
PAI  devices  in  the  control  path.  (Fig  8). 

To  implement  the  masking  function  required  for 
shifting,  a  technique  called  "Nearest  Neighbor 
Shifting"  (U.S.  patent  pending,  Computervision 
Corp)  is  used.  Each  shift  and  mask  PAI  device  has 
an  enable  input  from  one  mask  control  PAL  unit. 
In  addition,  each  shift  and  mask  PAL  device  is  also 
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Fig  8    Bit  shift  and  mask  logic  tor  barrel  shifter  requires  16 
PAL  devices  in  the  data  path  and  two  pai  devices  in  (he 
control  path.  Each  shift  and  mask  pai  device  has  an  enable 
input  from  one  mask  control  pai  device  and  is  connected  to 
enable  inputs  of  its  left  and  right  neighbors.  The  64-bit 
masking  operation  requires  only  16  control  lines. 
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Kig  9  "Nearest  Neighbor"  interconnection  is  used  to 
implement  masking  function  for  shifting.  A  mask  control  pai 
device  determines  if  all  4  bits  are  to  be  masked;  enables  from 
the  adjacent  slice  determine  if  the  pm  device  is  at  shift 
houndarv . 

connected  to  the  enable  inputs  from  its  left  and 
right-hand  neighbors  (Fig  9).  The  mask  control 
PAL  input  determines  if  all  4  bits  from  the  slice 
should  be  masked  off.  Enables  from  the  adjacent 
slice  determine  if  a  PAL  unit  is  at  a  shift  boundary. 
If  only  one  of  the  neighboring  slices  is  disabled,  the 
shift  and  mask  PAL  unit  masks  off  from  0  to  3  of 
the  bits  adjacent  to  the  disabled  slice,  depending  on 
the  bit-rotation  distance.  In  this  way,  the  64-bit 
masking  operation  can  be  implemented  with  only 
16  control  lines  as  opposed  to  at  least  64  for  an 
SSI/MSI  solution. 

In  addition  to  performing  the  final  shift  and 
mask  operation,  the  bit-shifter  PAL  unit  also  com- 
putes the  logical  OR  of  the  masked-out  bits  at  each 
slice  position.  These  outputs  are  then  logically 
ORed  to  generate  a  sticky  bit.  The  extra  hardware 
required  is  less  than  two  SSI  packages.  The  entire 
PAL  barrel  shifter  requires  38  devices  to  implement 
64-bit  rotation,  left  shifting,  right  shifting,  and 
sticky  bit  accumulation.  An  MSl-based  left/right 
shifter,  without  sticky  bit  computation,  requires  a 
minimum  of  96  parts.  In  addition,  the  logic  re- 
quired for  implementing  the  prescale  and  normal- 
ize operations  is  significantly  reduced  through  the 
use  of  PAL  devices. 


New  PAL®  Device  Architecture 
Extends  Design  Flexibility 
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The  introduction  of  Programmable  Array  Logic  (PAL®)  devices 
by  Monolithic  Memories  in  the  late  1970s  marked  the  beginning 
of  rapid  growth  for  the  programmable  logic  device  market.  By 
offering  fast  turn  development  of  user-defined  circuit  functions 
and  significant  package  count  reduction,  programmable  logic 
devices  (PLDs)  solved  such  system  design  problems  as  time  to 
market,  system  size  and  cost,  and  the  cost  of  making  design 
changes. 

First  generation  PLDs,  such  as  the  PAL1 6L8,  PAL16R4,  PAL1 6R6, 
and  PAL16R8,  offer  relatively  simple,  single-function  architec- 
tures. Designers  choose  either  devices  with  combinatorial  out- 
puts, registered  outputs,  or  a  combination  of  both,  to  solve  their 
specific  design  tasks. 

While  simple  architectures  still  offer  the  fastest  operating 
speeds,  more  universal  architectures  have  emerged  among 
second  generation  PAL  devices,  which  allow  more  user  defini- 
tion of  the  device  configuration.  The  universal  "macroceH" 
approach  allows  users  to  reduce  the  number  of  different  parts  in 
a  system  and  enjoy  considerable  flexibility  in  defining  the  final 
PLD  architecture. 

All  of  these  devices  enable  logic  designers  to  combine  the  tasks 
of  several  SSI  devices  into  a  single  PLD.  System  functions  in  the 
form  of  small  state  machines  or  simple  data  flow  machines  are 
possible  in  a  single  PLD.  But  the  ability  of  designers  to  build 
significant  "systems-on-a-chip"  has  been  somewhat  hampered 
either  by  a  lack  of  device  flexibility  or  the  inability  to  make  full 
use  of  the  logic  power  of  the  PLD.  This  has  kept  the  benefits  of 
PLDs  inaccessible  to  some  designers.  The  purpose  of  this  paper 
is  to  describe  some  of  the  key  features  of  a  new  PAL  device 
which  opens  up  a  whole  range  of  possibilities  by  removing  many 
of  these  limitations. 

The  PAL32VX1 0 

The  new  PAL32VX1 0  takes  the  synchronous  logic  approach 
used  by  the  PAL22V10.  Judicious  improvements  to  the  macro- 
cell  provide  logic  power  far  greater  than  anything  currently 
available  in  24  pins.  This  is  accomplished  while  maintaining  a  25- 
ns  propagation  delay  (tpo)  with  only  180-mA  current  drain. 

The  PAL32VX10  shares  some  of  the  features  of  the  PAL22V10, 
such  as  varied  product  term  distribution,  and  can  emulate  any 
PAL22V10  function.  It  is  therefore  appropriate  to  describe  the 
individual  features  of  the  PAL32VX10  by  contrasting  them  with 
the  corresponding  more  limited  options  in  the  PAL22V10.  The 
key  features  to  be  discussed  are: 

1)  Dual  Feedback 

2)  Programmable  Flip-Flops 

3)  Register  Bypass  Scheme 

After  discussing  these  features  we  can  look  at  the  overall  power 
provided  by  the  integration  of  these  features  into  a  single  PLD. 
Adapted  from  a  paper  presented  at  Northcon/86 


The  feedback  scheme  used  in  the  PAL22V10  (Figure  1a)  pro- 
vides two  sources  for  the  feedback  signal,  only  one  of  which  is 
used  at  any  given  time.  When  the  flip-flop  is  used,  the  feedback 
signal  is  taken  from  the  output  of  the  flip-flop  itself;  when  the 
flip-flop  is  bypassed,  the  feedback  signal  comes  from  the  output 
pin,  allowing  for  bidirectional  signal  flow. 

There  are  two  basic  limitations  in  this  approach.  The  first  is  the 
fact  that  registered  outputs  cannot  be  made  bidirectional.  One 
could  add  a  separate  feature  allowing  such  an  option  (Figure 
1  b),  but  such  an  improvement  by  itself  provides  an  incremental 
benefit.  The  second  limitation,  which  is  substantial  from  the 
standpoint  of  device  utilization,  is  the  fact  that  if  an  output  is 
needed  as  an  input,  the  output  macrocell  for  that  pin  is  com- 
pletely Wi 
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(a)  As  It  Currently  Exists 
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(b)  A  Possible,  But  Marginal  Improvement 

Figure  1.  PAL22V10  Feedback  Selection  Scheme 

The  approach  taken  in  the  PAL32VX10  is  to  provide  two  com- 
pletely independent  feedback  lines:  one  from  the  flip-flop,  and 
one  from  the  output  pin  (Figure  2).  That  this  solves  the  first 
limitation  is  obvious;  the  output  can  always  be  disabled  and  used 
as  an  input  without  interfering  with  the  signal  being  fed  back 
from  the  flip-flop. 


PAL®  is  a  registered  trademark  of  Monolithic  Memories. 
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Figure  2.  PAL32VX10  Dual  Feedback  Scheme 


But  the  benefits  go  far  beyond  that.  Using  the  output  pin  per- 
manently as  an  input  does  not  require  that  the  macrocell  be 
wasted:  the  flip-flop  can  be  "buried",  allowing  internal  state 
machines  to  be  built.  This  removes  the  second  limitation  of  the 
PAL22V10  scheme.  Alternatively,  the  macrocell  can  be  used  as 
an  input  register  by  feeding  the  input  into  the  flip-flop,  and  then 
using  the  flip-flop  feedback  to  drive  other  logic  in  the  device. 
This  same  approach  can  also  be  used  when  the  pin  is  used  as  an 
output,  allowing  the  flip-flop  to  be  loaded  directly  from  its  output 
pin. 

Thus  the  thirty-two  inputs  to  the  array  consist  of  twelve  dedi- 
cated inputs,  ten  output  pin  feedback  inputs,  and  ten  flip-flop 
feedback  inputs. 

Programmable  Flip-Flops 

PAL  devices  have  traditionally  been  equipped  with  D-type  flip- 
flops  for  data  synchronization  and  state  machines.  D-type  flip- 
flops  work  quite  well  for  many  state  machines,  but  in  many  other 
cases,  J-K  or  T-type  flip-flops  would  allow  for  implementation  of 
more  complex  state  machines.  Yet  offering  alternative  products 
with  other  dedicated  types  of  flip-flops  would  unnecessarily 
overpopulate  the  product  family,  complicating  the  designer's 
device  selection  and  inventory  tasks. 

The  ideal  solution  is  to  have  a  single  device  which  can  have  each 
flip-flop  individually  programmed  to  define  its  type.  And,  as  will 
be  seen,  it  is  sometimes  beneficial  to  be  able  to  change  types  "on 
the  fly".  These  capabilities  have  been  provided  in  the  PAL32VX1 0. 

T-type  (lip-flops 

As  an  example,  the  classic  application  involving  T-type  flip-flops 
is  in  the  design  of  binary  counters.  When  a  D-type  flip-flop  is 
used,  the  Boolean  equations  have  to  account  for  each  time  the 
device  output  is  to  be  a  logic  1 .  This  means  that  the  nth  bit  of  the 
counter  requires  n  product  terms: 

Qn+  :=  /Qn*Qn-1*...'Q1 
+  Qn7Q1 
+  Qn*/Q2 


+  Qn7Qn-1 

By  using  a  T-type  flip-flop,  one  only  has  to  determine  when  the 
flip-flop  output  should  toggle.  The  nth  bit  toggles  only  when  all 
lower  order  bits  are  "1".  This  can  be  expressed  as 

"Qn  should  HOLD  UNLESS  all  lower  order  bits  are  HI." 

The  UNLESS  operator  can  be  implemented  by  an  Exclusive-OR 
(XOR)  gate: 

Qn+  :=  Qn  :+:  (Qn-1"Qn-2*...'Q1) 


This  requires  two  product  terms  (one  on  each  input  of  the  the 
XOR  gate),  regardless  of  which  bit  of  the  counter  is  being 
defined. 

So  by  providing  an  XOR  gate  with  Q  fed  back  to  one  input,  the 
combination  XOR  gate/D-type  flip-flop  acts  as  a  T-type  flip-flop. 
A  sum  of  products  on  the  other  XOR  input  provides  the  Toggle 
signal. 

J-K  flip-flops 

The  classic  formula  for  a  J-K  flip-flop  is 

Q+  :=  J7Q  +  /K*Q. 

This  expression  can  be  directly  implemented  in  a  D-type  flip- 
flop,  providing  IK  fits  within  the  available  product  terms.  If  this  is 
not  the  case,  or  if  finding  /K  is  inconvenient,  another  formulation 
is  needed.  We  know  that  by  the  definition  of  a  J-K  flip-flop, 

Q  should  HOLD  UNLESS: 

the  output  is  0  and  we  SET  to  1 ; 
the  output  is  1  and  we  RESET  to  0; 
we  are  toggling  (JK  =11). 

The  above  conditions  for  "not  holding"  can  be  plugged  into  a 
Karnaugh  map  and  minimized  to  give  the  following  expression: 

Q+  :=  Q  :+:  (J7Q  +  K'Q). 

Here,  with  the  help  of  the  XOR  gate,  a  J-K  flip-flop  is  imple- 
mented directly  in  terms  of  J  and  K,  instead  of  /K.  Again,  this  is 
done  by  feeding  Q  to  one  input  of  the  XOR  gate,  and  by  ORing 
several  product  terms  into  the  other  XOR  input.  These  terms  are 
divided  between  J  terms  and  K  terms;  J  terms  are  ANDed  with 
10,  while  K  terms  are  ANDed  with  Q. 

So  we  can  implement  J-K,  T,  and  D-type  flip-flops  by  using  the 
XOR  gate  in  combination  with  a  D-type  flip-flop.  By  setting  one 
XOR  input  to  Q,/Q,  1,or0,wecan  implement  J-K, /J-/K.J-/K,  or 
/J-K  flip-flops.  The  other  XOR  input  provides  a  sum  of  products 
for  J  and  K.  Of  course,  since  S-R  flip-flops  are  a  subset  of  J-K 
flip-flops,  they  can  be  implemented  in  the  same  fashion. 

Reconfigurable  flip-flops 

This  is  a  step  in  the  right  direction,  but  one  can  do  more. 
Consider  the  design  of  a  loadable  counter.  This  is  a  situation 
where  a  T-type  flip-flop  is  preferred  when  counting,  but  a  D-type 
is  better  when  loading.  A  T-type  configuration  can  be  converted 
to  D-type,  however,  if  the  LOAD  control  input  is  used  to  disable 
the  XOR  input  containing  Q. 

Thus  by  making  the  XOR  input  a  product  term,  instead  of  a 
dedicated  single  input,  the  flip-flop  type  can  be  changed  on  the 
fly  to  allow  each  particular  function  to  be  implemented  as  effi- 
ciently as  possible.  Note  that  PLDs  which  provide  programma- 
ble flip-flops  without  the  XOR  control  product  term  cannot  have 
their  flip-flops  altered  once  configured.  So,  for  instance,  adding 
a  LOAD  feature  to  a  counter  then  requires  twice  as  many  extra 
product  terms  as  required  with  a  reconfigurable  flip-flop. 

All  of  these  considerations  have  been  accommodated  in  the 
PAL32VX10  (see  Figure  3).  The  single  XOR  product  term  pro- 
vides control  of  the  flip-flop  type,  for  either  permanent  or  recon- 
figurable flip-flops.  From  eight  to  sixteen  product  terms  are 
available  as  D  or  T  inputs,  or  to  be  divided  between  J  and  K  in- 
puts. Some  of  the  possible  configurations  are  shown  in  Figure  4. 


New  PAL®  Device  Architecture  Extends  Design  Flexibility 


{>o  


D  Register  with  I/O 


J-K  Register  with  I/O 


T  Register  with  I/O 


E> 


-{>o  


3J- 


Buried  Register  with  Dedicated  Input 


Figure  4.  A  Few  Selected  Flip-Flop  Configurations 
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Register  Bypass 

The  two  primary  second-generation  PLDs,  the  PAL20RA1 0  and 
the  PAL22V10,  have  different  ways  of  bypassing  their  flip-flops. 
The  PAL22V10  has  a  single  fuse  which  configures  the  output 
permanently  (see  Figure  5a).  Once  the  configuration  is  chosen,  it 
cannot  be  changed.  On  the  other  hand,  the  PAL20RA1 0  bypasses 
its  flip-flop  by  activating  both  SET  and  RESET  simultaneously 
(see  Figure  5b).  This  can  be  done  dynamically,  but  since  the  SET 
and  RESET  affect  the  contents  of  the  flip-flop,  one  cannot,  for 
instance,  clock  data  into  the  flip-flop,  change  to  combinatorial, 
and  then  change  back  to  look  at  the  flip-flop;  the  data  in  the 
flip-flop  will  have  been  lost. 

The  PAL32VX1 0  eliminates  this  restriction  by  using  a  product 
term  to  control  the  bypass  multiplexer  (see  Figure  5c).  While  the 
output  is  combinatorial,  the  flip-flop  is  unaffected.  This  makes  it 
poss  ble  to  use  the  flip-flop  in  macrocells  that  are  sometimes 
combinatorial. 
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(a)  The  PAL22V10  Scheme 


(b)  The  PAL20RA10  Scheme 
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(c)  The  PAL32VX10  Scheme 
Figure  5.  Register  Bypass  Schemes 


The  Complete  Macrocell 

This  has  not  been  an  exhaustive  description  of  all  of  the  features 
of  the  PAL32VX10  macrocell,  a  summary  of  which  is  shown  in 
Figure  6.  This  paper  has  .merely  highlighted  the  more  revolu- 
tionary capabilities.  The  complete  macrocell  is  shown  in  Figure 
7.  All  of  the  elements  work  together  to  offer  unparalleled  flexibil- 
ity, without  compromising  the  performance  of  the  device. 

An  application  will  serve  to  illustrate  the  power  afforded  by  all  of 
these  features  operating  in  concert.  It  is  beyond  the  scope  of  this 
paperto  describe  the  details;  a  brief  description  will  suffice  here. 


Programmable  registered  or  combinatorial  outputs 

Dual  independent  feedback  paths  allow  I/O  with  combinatorial 

or  feedback  register  outputs 

Programmable  flip-flops  allow  J-K,  S-R,  T  or  D  types 
Programmable  output  polarity 

Register  set  and  register  reset  can  be  asynchronous  or 
synchronous 

Automatic  register  preset  on  power  up 
Varied  product  term  distribution 
Up  to  16  product  terms  per  output 


Figure  6.  Features  of  the  PAL32VX10 


Figure  7.  The  Complete  PAL32VX10  Macrocell 


A  PDP-1 1  Floating-Point 
Coprocessor  Controller 

This  device  monitors  a  PDP-1 1  bus  for  instructions  that  should 
be  sent  to  a  coprocessor.  As  shown  in  Figure  9,  it  provides  the 
PDP  instruction  decode,  an  instruction  tracking  state  machine 
which  provides  various  wait  states  for  operand  fetching,  and 
coprocessor  instruction  encoding.  The  3-bit  state  machine  is 
not  needed  externally;  thus  it  is  buried,  and  the  output  pins 
provide  additional  inputs.  Reset  capability  and  a  test  feature  for 
examining  the  buried  flip-flops  are  included. 

This  application  takes  advantage  of  the  varied  product  term 
distribution,  the  buried  flip-flops,  and  J-K,  T,  and  D-type  flip- 
flops.  To  implement  the  same  application  without  a  PAL32VX10 
would  require: 

1  PAL22V10 
1  PAL16R4 
1  PAL14H4 

Of  course,  since  we  are  talking  about  replacing  several  PAL 
devices  with  another  PAL  device,  the  number  of  discrete  TTL 
chips  that  are  being  replaced  is  very  large  indeed. 
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Summary 

The  PAL32VX10  provides  significantly  increased  flexibility  as  a 
result  of  a  few  carefully  placed  improvements.  The  addition  of 
another  feedback  line,  an  XOR  gate,  and  a  few  other  enhance- 
ments allows  full  utilization  of  the  device.  The  utilization  can  be 
further  optimized  through  the  ability  to  represent  the  desired 
functions  in  the  most  compact  form  possible.  These  capabilities 
combine  to  form  the  most  versatile  24-pin  PLD  available  to  date. 


PAL32VX10 


Figure  8.  Coprocessor  Controller 
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Introduction 

With  increasing  density  and  functionality,  programmable  logic 
devices  are  being  used  for  such  specialized  applications  as 
state  machines.  The  PROSE  (PROgrammable  SEquencer) 
PMS14R21  device  is  the  first  device  developed  for  state  ma- 
chine design  which  integrates  the  hardware  and  software  as- 
pects into  a  new  design  methodology.  On  a  single  device,  the 
PROSE  architecture  combines  the  advantages  of  both  PROM 
and  PAL  elements  for  implementing  internal  state  sequencing 
functions,  and  provides  a  powerful  device  for  designing  state 
machines.  These  architectural  features  also  lend  themselves 
easily  to  the  requirements  of  software  design  tools  (such  as 
PALASM®2  software)  for  providing  an  easy-to-use,  high-level 
state  machine  design  syntax. 

The  PROSE  design  methodology  offers  high  functionality  and 
ease  of  use  along  with  the  traditional  advantages  of  program- 
mable logic  devices,  such  as  instant  customizability,  fast  turn- 
around time,  low  development  cost  and  reduced  real  estate 
requirements. 

State  Machine 

A  state  machine  is  defined  as  a  digital  logic  system  which  tra- 
verses through  a  sequence  of  states  in  an  orderly  fashion.  The 
state  is  a  set  of  values  measured  at  various  points  in  the  digital 
system.  This  state  value  is  typically  stored  in  a  set  of  storage 
registers.  Asimple model ofa  state  machine  is  shown  in  Figure  1. 
It  consists  of  three  basic  elements:  storage  registers,  next 
state  decoder,  and  output  decoder. 


OUTPUT 
DECODER 


NEXT 
STATE 
DECODER 


STORAGE 
REGISTERS 


STATE 
OF  THE 
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Figure  1 .  General  Model  of  a  State  Machine 

The  storage  registers  perform  the  functions  of  storing  present 
state  of  the  state  machine  as  well  as  the  present  output  infor- 
mation (in  which  case  it  is  also  referred  to  as  the  pipeline  regis- 
ter). The  next  state  decoder  decodes  the  present  state  of  the 
machine  along  with  the  present  input  conditions  and  generates 
the  next  state  of  the  machine.  Typically  it  is  combinatorial  logic 
in  PAL/PLA-based  state  machines  and  is  a  PROM  in  microcoda- 
ble  sequential  state  machines. 

The  output  decoder  determines  the  outputs  of  the  machine  from 
the  present  state  of  the  machine  and  the  input  conditions.  It  is 
not  mandatory  to  have  an  output  decoder  as  the  outputs  can 
also  be  generated  directly  by  the  registers. 


Sequential  state  machines  are  usually  classified  into  two 
types  depending  upon  their  output  generation.  In  Mealy  type 
state  machines,  outputs  are  a  function  of  the  present  state 
and  input  conditions.  The  output  decoder  takes  the  present 
state  and  input  conditions,  decodes  them  and  generates  the 
outputs.  In  Moore  type  state  machines,  outputs  are  strictly  a 
function  of  the  state  of  the  machine.  Thus,  the  Moore  machine 
usually  requires  a  simpler  output  decoder  to  decode  the 
present  state  only,  or  none  at  all  in  a  case  where  the  storage 
registers  can  be  used  to  provide  these  outputs  directly. 

As  transitions  to  the  next  state  are  dependent  upon  the 
present  state  and  input  conditions,  the  operation  of  a  Mealy 
machine  can  be  emulated  by  a  Moore  machine,  provided  that 
one  cycle  delay  is  allowed  between  input  conditions  and  output 
generation. 


State  Machine  Representation 

The  first  step  in  the  design  methodology  requires  converting 
the  design  problem  into  its  state  diagram  representation.  The 
state  diagram  shown  in  Figure  2  consists  of  state  identifiers, 
input  conditions  and  the  outputs  generated.  The  states  are 
represented  by  bubbles,  with  arrows  indicating  transitions  to 
other  states.  The  input  conditions  as  well  as  outputs  gene- 
rated are  associated  with  these  transitions. 


INPUTS 
OUTPUTS 


Figure  2.  A  Typical  State  Diagram 

There  are  two  major  operational  requirements  for  state  ma- 
chines as  shown  in  Figure  3. 

•  Control  sequencing 

•  Output  generation 

Control  sequencing  is  changing  from  one  state  to  another  in  a 
state  machine.  If  the  transition  from  the  present  state  to  the 
next  state  is  dependent  upon  the  present  state,  it  is  direct 
control  sequencing.  If  the  transition  from  the  present  state  to 
the  next  state  is  dependent  upon  the  present  state  and  the 
input  conditions,  it  is  called  conditional  control  sequencing. 
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As  mentioned  earlier,  the  output  generation  can  be  based  upon 
the  present  state  for  a  Moore  type  of  state  machine,  or  the 
present  state  and  inputs  for  a  Mealy  type  of  state  machine. 
The  state  diagram  representation  includes  both  the  control 
sequencing  and  output  generation  representations. 


INSTRUCTION 
MICROCODE 


INSTRUCTION 
DECODE 


AVAILABLE: 
INPUTS 

PRESENT  STATE 


GENERATE: 
OUTPUTS 
NEXT  STATE 


INSTRUCTION 
EXECUTION 


CONTROL 
SEQUENCING 


DIRECT 
CONTROL 
SEQUENCING 
(BASED  UPON 
PRESENT 
STATE) 


OUTPUT 
GENERATION 


CONDITIONAL 

CONTROL 
SEQUENCING 
(BASED  UPON 
PRESENT 
STATE  AND 
INPUTS) 


MOORE 
MACHINE 
(BASED  UPON 
PRESENT 
STATE) 


MEALY 
MACHINE 
(BASED  UPON 
PRESENT 
STATE  AND 
INPUTS) 


Figure  3.  Instruction  Execution  in  State  Machines 

The  design  process  involves  converting  the  hardware  problem 
to  this  state  machine  representation.  The  system  design  is 
analyzed  for  its  signal  logic  and  timing  requirements.  The  out- 
puts generated  are  based  upon  the  system  signal  logic  re- 
quirements, and  the  state  transitions  depend  upon  the  system 
timing  requirements  of  these  signals. 


Traditional  Microprogrammable 
Sequencers 

Traditional  high-end  microprogrammable  sequencers  fit  this 
general  model  of  state  machines.  They  use  simple  clocked 
D-type  registers  as  pipeline  registers  as  shown  in  Figure  4. 
Typically  they  do  not  have  an  output  decoder  and  allow  only 
Moore  machine  implementations. 
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For  the  next  state  decoder,  microprogrammable  sequencers 
use  a  PROM  array.  Each  location  of  the  PROM  functions  as  a 
state.  PROMs  have  been  traditionally  used  for  this  purpose 
because  of  their  familiarity  and  relative  ease  of  use.  This 
PROM  is  addressed  by  the  pipeline  register  field  storing  the 
present  state.  The  PROM  data  bits  are  inputs  to  the  pipeline 
register  which  are  loaded  at  every  clock  cycle.  The  PROM  data 
consists  of  the  next  state  (branch),  output  logic  values,  and 
an  instruction  op-code. 

Output  generation  in  microprogrammable  sequencers  is  the  di- 
rect transfer  of  PROM  data  to  the  output  field  of  the  pipeline 
register  on  every  clock  cycle. 

For  direct  control  sequencing,  next  state  values  from  the  PROM 
are  loaded  into  the  pipeline  registers  at  every  clock  cycle,  rep- 
resenting a  state  transition.  In  most  microprogrammable  se- 
quencers, a  binary  counting  mechanism  is  also  added  to  the 
state  registers.  This  allows  the  PROM  to  be  addressed  with  a 
state  counter.  This  is  effectively  control  sequencing  with  serial 
binary  state  numbers.  The  PROM  next  state  value  is  used  only 
when  a  non-serial  branch  is  required.  A  branch  select  multiplex- 
er is  used  to  select  between  this  serial  count  and  the  branch  for 
the  next  state  of  the  machine. 

For  conditional  control  sequencing  an  input  condition  testing 
multiplexer  is  used.  Usually  it  tests  for  one  of  the  inputs  at  logic 
high  or  low.  Based  on  this  input  condition,  the  branch  select 
multiplexer  is  controlled,  providing  conditional  branching. 

The  instruction  op-code  stored  in  the  PROM  for  every  state 
controls  the  sequencer  operation  for  that  state.  It  decides  be- 
tween direct  and  conditional  control  sequencing.  It  also  decides 
the  input  condition  to  be  tested.  These  instruction  op-codes  are 
stored  in  the  data  form  and  are  decoded  just  before  execution. 
Extra  hardware  is  required  to  decode  these  instruction  op- 
codes and  generate  internal  multiplexer  control  signals.  This 
slows  down  the  device  operation  by  the  time  required  to  decode 
the  instructions. 

The  combination  of  instruction  op-codes,  output,  and  next  state 
values  is  called  the  instruction  microcode.  The  design  process 
requires  writing  instruction  microcode  based  upon  the  state 
diagram.  This  approach  to  a  microprogrammable  sequencer, 
though  popular,  imposes  several  restrictions  on  the  state  ma- 
chine operation  as  the  instruction  op-codes  are  predefined  and 
fixed.  These  fixea  op-codes  can  only  perform  certain  opera- 
tions and  the  state  machine  design  is  usually  modified  to  fit 
those  restrictions.  This  means  that  the  sequencer  operation  is 
usually  not  optimal  for  the  system  design  problem. 

For  example,  condition  testing,  which  is  restricted  to  one  input 
testing  only,  may  not  fit  a  design  requiring  simultaneous  testing 
of  two  signals.  The  branching  is  also  restricted  to  a  two-way 
branch.  For  multi-way  branching,  a  microprogrammable 
sequencer  requires  multiple  cycles  which  could  disrupt  the 
system  signal  timing  requirements.  In  some  devices  further 
dedicated  hardware  is  added  to  improve  the  functionality  of  the 
sequencer.  Such  hardware  is  usually  under  the  control  of  in- 
struction op-code  and  provides  features  such  as  testing  more 
than  one  input  or  branching  to  more  than  one  next  state. 


Figure  4.  Traditional  Microprogrammable  Sequencer 
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Figure  5.  A  Block  Diagram  for  the  PROSE  Device 

PROSE  Device  Architecture 

The  PROSE  device  architecture  is  very  similar  to  this  model  of  a 
state  machine.  Its  architecture  (Figure  5)  is  optimized  to  per- 
form the  functions  of  a  sequential  state  machine  as  represent- 
ed in  the  model.  For  the  storage  registers,  the  PROSE  device 
provides  a  set  of  twenty-one  pipeline  registers.  These  registers 
are  used  to  store  the  present  state  and  the  outputs  generated 
by  the  machine.  This  is  similar  to  the  microprogrammable 
sequencers.  Beyond  this  however,  the  similarity  ends.  The 
architecture  of  the  PROSE  device  has  been  defined  specially  to 


optimize  its  functionality. 

The  next  state  decoder  in  the  PROSE  device  consists  of  a 
PROM.  This  PROM  stores  the  next  state,  outputs  and  branch 
control  information.  The  output  generation  is  similar  to  micro- 
programmable  sequencers,  and  is  a  direct  transfer  of  PROM 
data  to  the  output  field  of  the  pipeline  register.  There  are  eight 
outputs  in  the  PROSE  PMS14R21  device. 

Control  Sequencing 

Direct  control  sequencing  in  the  PROSE  device  is  implemented 
by  the  PROM.  The  depth  of  the  PROM  is  128,  allowing  up  to  128 
states.  Seven  bits  (A0-A6)  of  the  pipeline  register's  next  state 
field  are  used  to  address  the  PROM  to  provide  the  state  transi- 
tion capability.  For  direct  control  sequencing  the  next  state  val- 
ues from  the  PROM  are  loaded  into  the  pipeline  register's  next 
state  field,  for  a  state  transition. 

Conditional  Control  Sequencing 

For  conditional  control  sequencing,  a  combinatorial  PAL  array  is 
provided.  All  of  the  eight  conditional  inputs  I0-I7  are  direct  in- 
puts to  this  PAL  array  This  PAL  array  provides  two  major  ad- 
vantages over  microprogrammable  sequencers: 

•  It  allows  selection  of  a  combination  of  input  signals  for  condi- 
tion testing.  This  is  very  useful  for  many  system  designs  which 
require  testing  of  a  combination  of  two  or  more  signals  at  partic- 
ular logic  levels.  This  capability  is  not  available  in  most 
microprogrammable  sequencers. 

•  It  allows  multiple  condition  testing  for  multiple  branching. 
Microprogrammable  sequencers  require  more  than  one  cycle 
to  implement  multiple  branching  on  multiple  conditions.  A  four- 


way  branch  can  be  implemented  in  the  PROSE  device  in  a  sin- 
gle cycle  without  disrupting  the  system  timing  requirements. 

Conditional  control  sequencing  also  requires  present  state 
information  for  allowing  state-dependent  conditional 
branching.  The  state  bits  A0-A6  are  not  used  as  inputs  to  the 
PAL  array  to  provide  the  state  information.  An  easier  way  to 
provide  state  information  to  this  PAL  array  is  to  use  extra  bits 
from  the  pipeline  registers  and  PROM  as  its  inputs.  These  six 
bits,  CS0-CS5,  form  the  condition  select  field,  which  allows 
user-definable  bit  combinations  to  be  used  for  any  state.  This 
optimizes  the  use  of  the  PAL  array  product  terms,  since  the 
same  bit  combinations  can  be  used  for  testing  the  same  input 
conditions  for  different  states. 

Branching 

To  implement  branching,  the  two  outputs  of  this  PAL  array,  XO 
and  X1,  are  XORed  with  the  two  most  significant  bits  of  the 
state  address  (XF1  and  XFO)  from  the  pipeline  register.  This 
gives  a  capability  of  inverting  state  bits  A5  and  A6  depending 
upon  the  condition  testing  in  the  PAL  array.  This  conditional 
change  of  next  state  address  provides  a  four-way  branch  cap- 
ability. 

For  direct  control  sequencing,  no  product  term  in  the  PAL  array 
is  selected.  The  programmable  array  outputs  X1  and  XO  are  at 
logic  low  level,  allowing  next  state  bits  A6-A0  to  decide  the  next 
state  directly.  The  conditional  select  bits,  CS5-CS0,  should  not 
select  any  product  terms  from  the  programmable  array  for 
states  with  direct  control  sequencing. 

For  a  two-way  branch,  either  X1  or  XO  is  asserted  HIGH  on  con- 
ditional inputs.  This  causes  a  branch  to  a  location  where  the  bit 
value  of  A6  or  A5  respectively  is  inverted. 

For  three-way  and  four-way  branches  both  X1  and  XO  are 
asserted  high  depending  on  multiple  condition  inputs.  The  se- 
quencer branches  to  locations  where  the  bit  values  of  A6  and 
A5  are  00,  01,  10,  and  11.  The  PROSE  device  allows  up  to  a 
four-way  branch,  which  is  sufficient  for  most  designs. 

In  the  PROSE  device,  the  next  state  decoder  has  been  effec- 
tively partitioned  into  a  PROM  and  a  PAL  device,  to  utilize  the 
efficiencies  of  the  two  for  control  sequencing  and  conditional 
testing  respectively. 
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PROSE  Device  Programming 

Figure  7a  shows  the  state  diagram  representation  of  direct  con- 
trol sequencing;  Figure  7b  shows  conditional  control  sequenc- 
ing. Similarly,  output  generation  can  be  represented  as  shown 
in  Figures  7c  and  7d  for  both  Mealy  and  Moore  machines.  The 
next  step  in  the  PROSE  device  design  process  is  to  convert 
these  state  diagram  representations  into  a  textual  entry  form. 


A.  Direct  Control 
Sequencing 
STATEA  :=VCC->  STATE  B 


B.  Conditional  Control 

STATEC  :=  11  ->  STATED 
->  STATEE 


C.  Moore  Machine 
STATEA.OUTF  :=  01 


D.  Mealy  Machine 
STATEA.OUTF  :=  11  ->01 


Figure  7.  State  Machine  Operations  and  PALASM  2  Software 


The  instruction/operation  mechanism  of  the  PROSE  device, 
however,  is  slightly  different  from  the  microprogrammable  se- 
quencers. The  PROSE  device  uses  PALASM  2  software  for  pro- 
gramming, which  allows  easy  textual  constructs  that  have  a 
direct  relationship  to  the  hardware  operation.  The  instruction  is 
divided  into  two  distinct  segments:  one  for  control  sequencing 
and  one  for  output  generation. 


Control  Sequencing  Instruction 

The  format  for  control  sequencing  instruction  is  as  follows: 
(present_state) 

(condition_1 )  ->  next_state_name_1 
(condition_2)  ->  next_state_name_2 
(condition_3)  ->  next_state_name_3 
default_state_name_4 


Based  upon  the  state  diagram,  for  each  present  state,  transi- 
tions/branches to  other  states  can  be  entered.  These  transi- 
tions take  place  in  the  PROM  of  the  device.  The  conditional  ■ 
inputs  can  also  be  described  in  the  condition  field  of  the  instruc- 


tion syntax.  This  is  the  condition  testing  done  in  the  PAL  array 
of  the  PROSE  device.  For  each  present  state  a  maximum  of 
four  branch  states  can  be  provided,  which  is  the  device  archi- 
tecture limit.  The  logic  values  of  next  states  differ  from  each 
other  by  the  most  significant  two  bits  only.  PALASM  2  software 
automatically  assigns  these  logic  values  to  the  state  names 
while  minimizing  state  locations  in  the  PROM.  PALASM  2  soft- 
ware also  assigns  the  values  to  the  conditional  select  field  while 
ensuring  the  optimization  of  PAL  array  product  terms. 


Output  Generation  Instruction 

The  output  generation  instruction  syntax  is  as  follows: 

(present_state).OUTF 

:=         (condition_1 )  ->  (output_value_1 ) 

+->  (default_output_value) 

The  present  state  is  the  state  where  the  outputs  need  to  be 
generated.  The  default  output  value  is  the  logic  level  of  the 
outputs  at  the  present  state  which  is  also  the  syntax  for  a 
Moore  machine  implementation.  PALASM  2  software  also  al- 
lows a  Mealy  machine  implementation  using  the  constructs  for 
generation  of  conditional  outputs.  As  the  architecture  of  the 
PROSE  device  allows  only  Moore  machine  implementation, 
PALASM  2  software  automatically  converts  the  Mealy  machine 
description  to  the  Moore  machine  equivalent. 

After  the  conversion  of  state  diagram  to  textual  input  is  com- 
pleted, PALASM  2  software  combines  information  about  the 
control  sequencing  and  output  generation  for  each  present 
state.  These  constructs  have  a  direct  relationship  with  the 
state  diagram  representations  as  well  as  the  PROSE  device 
architecture.  This  allows  an  integrated  design  where  system 
design  requirements  represented  by  the  state  diagrams  have  a 
direct  relationship  with  the  device  hardware  operation. 

In  the  PROSE  device,  this  combination  of  control  sequencing 
and  output  generation  is  equivalent  to  the  instruction  op-code 
of  a  microprogrammable  sequencer.  In  microprogrammable  se- 
quencers, the  instruction  op-codes  are  stored  as  data,  and  are 
decoded  before  execution.  PROSE  device  instructions  do  not 
require  decoding  as  they  are  stored  in  the  device  in  logic  form. 
This  speeds  up  the  operation  of  the  PROSE  device.  The  cycle 
time  of  the  PROSE  device  is  25  MHz,  which  is  better  than  most 
other  single-pipelined  sequencers. 

Figures  7a,  b,  c  and  d  also  show  the  direct  PALASM  2  software 
text  representations  of  the  state  diagrams.  These  text  files  are 
compiled  to  generate  the  JEDEC  fuse  maps  for  programming 
the  PROSE  device.  The  procedure  is  identical  to  that  used  to 
program  any  other  PLDs.  PALASM  2  software  also  supports 
software  simulation  of  device  operation,  which  helps  in  design 
debugging  before  actual  programming  of  the  parts. 

The  PROSE  device  incorporates  the  Diagnostics-On-Chip™ 
(DOC™)  testing  capability  with  a  serial  shadow  register.  This 
serial  shadow  register  can  be  used  to  conduct  diagnostics 
such  as  walking-1  or  walking-0  tests.  It  can  also  be  used  for 
testing  the  PROM  array  by  shifting  in  a  test  vector  and  loading 
it  to  the  pipeline  register.  The  pipeline  register  data  for  the  next 
clock  cycle  is  then  checked  for  verification  Part  of  the  DOC 
circuitry  is  also  used  for  programming,  but  this  is  transparent 
to  the  users. 
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With  continuing  advances  in  technolo 
being  introduced  which  are  tailored  to  s 
requirements.  The  PROSE  device  is  one  such  device,  with  an 
architecture  optimized  for  state  machine  design.  It  combines  a 
PROM  array  and  a  PAL  array  on  a  single  device,  and  utilizes 
the  functional  advantages  of  both  for  control  sequencing  and 
conditional  testing  respectively.  This  combination  offers  higher 
functionality  over  the  traditional  approach  of  microprogramma- 
ble  sequencers  for  state  machines.  It  also  offers  a  newer 
design  methodology  where  software  instructions  are  split  into 
control  sequencing  and  output  generation  instructions.  These 
instructions  are  directly  related  to  hardware  requirements, 
unlike  the  instruction  op-codes  of  traditional  microprogramma- 
ble  sequencers,  which  sometimes  require  design  compromis- 
es. The  PROSE  device  also  offers  extensive  simulation,  thus 
offering  a  complete  solution. 


Figure  8.  DIP  Pin  Configuration 


Blazing  Fast  PAL®  Devices  Enable 
New  Application  Areas 
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The  PAL16R8-1 0  Series  from  Monolithic  Memories  is  the  fastest 
TTL  programmable  logic  family,  and  the  ECL  PAL  devices  from 
Mono  ithic  Memories  are  the  fastest  programmable  logic  devices 
of  any  technology. 

Do  these  devices  simply  drop  into  the  same  applications  as 
previous  versions?  No!  These  high-speed  PAL  devices  offer 
several  advantages,  including  speed,  that  open  up  whole  new 
applications  for  programmable  logic  devices. 


The  World's  Fastest  PAL  Devices 
PAL16R8- 10  Series 

Let's  look  at  the  products  under  discussion.  The  PAL16R8-10 
Series  is  comprised  of  the  four  most  popular  PLDs  in  the 
industry  today;  the  PAL16L8,  PAL1 6R8,  PAL16R6,  and  PAL16R4. 
The  architectures  (Figure  1 )  consist  of  sixteen  inputs  and  eight 
outputs,  with  zero  to  eight  flip-flops  on  the  outputs.  Although 
very  simple  compared  to  some  of  the  latest  high-density  devices 
from  Monolithic  Memories,  these  architectures  are  very  power- 
ful, and  handle  the  majority  of  today's  PLD  applications. 


PAL16L8-10 


PAL16R8-10 


PAL16R6-10 


PAL16R4-10 


Figure  1.  The  Four  Devices  Comprising  the  PAL16R8-10  Series.  Each  Has  Eight  Product  Terms  per  Output. 


The  speed  of  the  PAL16R8-10  Series  is  10  nanoseconds  (ns) 
maximum  from  input  to  output  on  any  combinatorial  path.  The 
registered  paths  offer  a  10-ns  setup  time  and  only  8  ns  from 
clock  input  to  data  output.  This  translates  into  a  maximum  clock 
frequency  (fMAX)  of  1/(10  ns  +  8  ns),  or  55.5  MegaHertz  (MHz),  if 
external  feedback  is  used  from  the  flip-flop. 


CLK 


Adapted 


It  is  important  to  differentiate  this  number  from  another  common 
fMAX  calculation,  the  maximum  flip-flop  toggle  rate.  The  toggle 
rate  is  limited  by  the  greater  of  the  data  setup  time  or  the  clock 
widths  high  and  low.  The  toggle  rate  specification  for  the 
PAL16R8-10  Series  is  62.5  MHz.  However,  this  specification  is 
unrealistic  since  it  does  provide  for  feedback.  It  is  a  more 
representative  path  that  provides  the  55.5-MHz  specification 
(Figure  2).  clk 


(SECOND  CHIP) 


fMAX  no  feedback;  1/(tsu  +  th)  or  1/(twh  +  twl) 


fMAX  external;  1/(tCLK  +  tsu) 


Figure 


2.  Alternative  Methods  of  Measuring  (MAX.  A  More  Realistic  Path  Is  the  Second  One,  Including  External 

from  a  paper  presented  at  Electro/87. 
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ECL  PAL  Devices 

The  PAL10H20P8  from  Monolithic  Memories  was  the  first  ECL 
PAL  device.  This  device  is  combinatorial,  with  twenty  inputs  and 
eight  outputs.  A  sequential  device  followed,  the  PAL10H20G8, 


with  the  same  number  of  inputs  and  outputs,  but  having  the 
added  feature  of  output  latches  (Figure  3) 
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Figure  3.  The  ECL  PAL  Devices.  On  the  Left  Is  the  PAL10H20P8  Combinatorial  Device, 
and  on  the  Right  Is  the  PAL10H20G8  Latched  Device. 


The  PAL10H20P8  and  PAL10H20G8  have  combinatorial  propa- 
gation delays  of  only  6  ns,  faster  than  any  other  PLDs.  The 
latched  version  has  a  setup  time  of  4.5  ns  and  a  latch-to-output 
delay  of  only  2.5  ns.  This  provides  a  maximum  frequency  of  over 
142  MHz. 

History  of  Speed  Developments 

A  short  history  of  speed  improvements  in  bipolar  PAL  devices 
shows  how  these  devices  compare  to  earlier  versions  (Figure  4). 


In  this  graph  of  speed  vs.  power,  performance  characteristics 
closer  to  the  lower  left  are  desired.  The  first  PAL  devices, 
introduced  by  Monolithic  Memories  in  the  late  70s,  were  in  the 
35-ns  range,  and  consumed  1 80  mA.  Technology  improvements 
allowed  the  introduction  of  the  first  25-ns  devices  in  1983.  In 
early  1985  the  first  15-ns  devices  were  released,  with  half-power 
versions  at  25  ns  and  quarter-power  versions  at  35  ns.  Last  year 
Monolithic  Memories  introduced  its  industry-leading  10-ns 
PAL16R8-10  Series,  along  with  the  ECL  devices. 


PAL16R8A-4  SERIES 
S5/S0 


150  200 
SUPPLY  CURRENT  -  mA 


Figure  4.  Speed  vs.  Power  for  Monolithic  Memories  Bipolar  PAL  Families.  The  Year  of  Introduction  Is  Noted. 
The  PAL16R8-10  Series  and  the  ECL  PAL  Devices  Offer  the  Highest  Performance  in  the  Industry. 
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New  High-Performance  Applications 

The  raw  speed  of  these  new  devices  allows  their  use  in  new 
high-performance  applications.  As  microprocessors  reach  20 
MHz  and  beyond.  50  MHz  logic  is  required  for  peripheral  and 
memory  interface.  Additionally,  high-speed  static  RAMs 
require  high  speeds  in  the  select  logic. 

One  application  area  for  these  high-speed  PAL  devices  is  high- 
resolution  graphics,  including  counters  for  screen  refresh, 
generation  of  line  frame  and  video  information,  and  micropro- 
cessor interface.  High-speed  digital  processing  requires  10-ns 
PAL  devices  for  data  pipeline  applications  and  on-the-fly 
instruction  decoding.  High-speed  minicomputers  require  very- 
high-speed  state  machines,  and  55.5-MHz  PAL  devices  are  an 
efficient  solution. 

Speeds  Up  Fastest  Discrete  Designs 

PAL  devices  have  always  been  used  to  replace  discrete  logic, 
thereby  reducing  chip  count,  inventory,  and  board  area; 


simplifying  board  design  and  layout:  and  adding  extensive 
design  flexibility.  Now  PAL  device  speed  has  broken  an 
important  barrier;  the  PAL16R8-10  Series  allows  PAL  devices 
to  speed  up  a  discrete  logic  design,  even  one  built  with  the 
fastest  available  devices. 

Comparison  to  FAST  Logic 

The  following  example  compares  the  PAL16R8-10  Series  to 
FAST  logic  in  a  combinatorial  discrete  logic  design.  The 
design  is  an  interface  between  a  MC6800  microprocessor  and 
memory  elements.  Debouncing  for  the  reset  switch  is  also 
required. 

The  discrete  logic  implementation  is  shown  in  Figure  5.  Eight 
discrete  chips  are  required.  However,  since  FAST  logic  does 
not  include  8-input  AND  gates,  three  additional  chips  will  be 
required,  making  a  total  of  eleven  chips  using  FAST  logic.  The 
worst-case  path  delay  is  27.6  ns. 


Figure  5.  Memory  Interface  Built  with  Discrete  Logic.  Eleven 
FAST  Logic  Chips  Are  Required.  The  Worst-Case 
Path  Delay  Is  27.6  ns. 


Blazing  Fast  PAL  Devices  Enable  New  Application  Areas 


Figure  6  shows  the  same  design  implemented  in  a  PAL16L8-10. 
One  device  replaces  eleven  chips,  and  the  new  worst-case  path 
delay  is  now  only  20  ns. 
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Figure  6.  Same  Memory  Interface  Implemented  in  a  PAL 

Device.  Only  One  Chip  Is  Required,  and  the  Delay 
Is  Only  20  ns  Maximum. 


An  example  of  a  sequential  circuit  (Figure  7)  is  a  simple  dice 
game,  providing  two  die  readouts  on  LEDs.  The  discrete  logic 


design  requires  seven  chips.  The  worst-case  path  for  FAST  logic 
has  a  maximum  frequency  of  27.5  MHz. 


Figure  7.  Simple  Dice  Game  Constructed  from  Discrete 
Gates.  Seven  Chips  Are  Required,  with  a 
Maximum  Frequency  of  27.5  MHz. 
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The  entire  design  can  be  implemented  in  one  PAL16R8-10 
(Figure  8).  The  worst-case  path  has  a  maximum  frequency  of 
55.5  MHz,  over  twice  as  fast  as  the  discrete  logic  version.  In 
addition,  power  requirements  remain  almost  the  same:  almost 
1 70  milliAmps  (mA)  for  the  FAST  version,  and  180  mA  maximum 


for  the  PAL  device.  A  halt-power  version  ot  tne  fAL  aevice  win 
require  47%  less  power  than  FAST  logic  while  still  providing 
higher  speed.  Of  course,  the  greatest  benefit  is  a  7:1  chip  count 
reduction,  reducing  size  and  cost  while  increasing  reliability. 


Figure  8.  The  Same  Dice  Game  Implemented  in  a 

PAL16R8-10.  Only  One  Chip  Is  Required,  and  the 
Speed  Is  Doubled,  with  Little  Increase  in  Power 
Requirements. 


The  PAL  16R8-10  Series  devices  can  also  be  compared  to  FAST 
MSI  functions  on  a  one-to-one  basis.  For  example,  the  clock-to- 
output  time  of  a  standard  74F374  octal  register  is  10  ns,  vs.  only 
8  ns  for  the  PAL16R8-10. 

A  straightforward  combinatorial  example  is  a  simple  3:8  decoder 
with  three-state  outputs,  the  74F538.  This  device  has  a  maximum 
propagation  delay  of  1 7  ns,  while  the  PAL1 6L8-1 0  can  implement 
the  same  function  at  only  10  ns,  41%  faster. 

Of  course,  PAL  devices  add  the  additional  capability  of  doing 
custom  variations  of  these  functions,  if  needed.  For  example, 
with  the  substitution  of  a  registered  PAL16R8-10  in  the  74F538 
example,  control  storage  functions  such  as  load  and  clear  can 
be  added. 
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Even  faster  discrete  logic  families  have  been  announced  that 
improve  upon  the  speeds  of  FAST  logic.  Many  of  these  families 
even  use  CMOS  technology.  But  the  fastest  devices  announced 
are  at  100  MHz,  equivalent  to  the  combinatorial  speed  of  the 
PAL16R8-10  Series.  And  the  PAL16R8-10  Series  is  available 
now,  providing  thousands  of  functions  through  its  programma- 
bility.  We  saw  in  the  memory  interface  example  that  the  high- 
performance  logic  families  are  not  as  complete  as  earlier 
families.  PAL  devices  can  replace  this  logic,  or  complement  it 
with  additional  functions. 

We  have  shown  that  the  PAL16R8-10  Series  creates  a  new 
application  area  for  programmable  logic:  the  design  of  higher 
performance  solutions  than  any  discrete  logic  implementations. 
Speed-critical  discrete  logic  designs  are  now  open  to  replace- 
ment by  programmable  logic. 


Blazing  Fast  PAL  Devices  Enable  New  Application  Areas 


Elimination  of  PC  Board  Noise 

This  capability  may  be  arriving  just  in  the  nick  of  time.  Discrete 
logic  is  today  achieving  speeds  that  challenge  printed  circuit 
board  layout.  The  fast  switching  speeds  can  create  noise  and 
crosstalk  that  can  bring  the  high-performance  board  crashing 
down.  Note  the  new  high-speed  CMOS  logic  families  that 
require  new  pinouts  and  extra  grounds  because  of  switching 
noise. 

Integrating  this  logic  into  programmable  logic  eliminates  not 
just  discrete  chips  but  all  those  noisy  and  troublesome  connec- 
tions between  them.  Putting  the  PC  traces  into  silicon  eliminates 
crosstalk  and  signal  distortions  caused  by  transmission  line 
reflections,  and  improves  the  reliability  of  the  board.  In  high- 
speed systems,  the  length  of  the  interconnect  can  approach  the 
signal  wavelength,  causing  ringing  in  the  connection  line.  This 
ringing  can  cause  a  false  signal  value  to  appear  on  an  input  tied 
to  the  ringing  line,  or  can  even  be  fed  back  into  the  source  device 
if  feedback  is  used  from  the  output. 


Improved  Speed 

The  reduction  in  interconnect  afforded  by  PAL  devices,  with  the 
accompanying  reduction  in  inductance  and  time  delays,  even 
increases  system  speed.  In  high-speed  logic  systems,  each 
12-inch  length  of  interconnect  wiring  introduces  a  time  delay  of 
approximately  2  ns,  about  an  entire  gate  delay.  These  delays 
slow  down  the  system  and  can  cause  glitches  and  signal  errors. 
Thus,  integration  of  high-speed  logic  into  PALdevices  improves 
not  just  reliability,  but  also  speed. 


Technology  Advantages 

Two  other  advantages  of  the  PAL16R8-1 0  Series,  lower  cost  and 
reduced  signal  noise,  are  a  result  of  the  technology  used  to 
achieve  its  high  speed.  Speed  improvements  in  semiconductor 
devices  come  about  primarily  through  size  reductions  and 
reduced  capacitance.  Capacitance,  since  it  stores  charge,  slows 
down  devices  as  they  have  to  charge  and  discharge  capacitive 
elements. 

The  25-ns  devices  were  achieved  through  the  reduction  of 
lateral  geometries  in  a  junction-isolated  process.  This  produced 
lower  collector-base  capacitance  and  therefore  provided  higher  speed. 
The  15-ns  devices  used  the  same  junction-isolated  process,  but 
a  reduced  epitaxial  layer  produced  the  higher  speeds. 


Oxide-Isolated  Process 

For  the  10-ns  devices,  Monolithic  Memories  used  a  completely 
new  oxide-isolated  process  to  increase  the  speed  to  record 
levels  Oxide-isolation  allows  the  transistors  to  be  placed  closer 
together  (Figure  9).  This  greatly  reduces  lateral  geometries 
while  reducing  the  parasitic  capacitance  between  circuit  ele- 
ments, providing  much  higher  speed. 


Figure  9.  Basic  Transistor  Isolation  Structure  in  Oxide- 
Isolated  Process.  A  Fully  Recessed  Oxide  Region 
Separates  Transistors,  Allowing  Smaller 
Geometries  and  Higher  Speed. 

The  process  utilizes  a  walled-base  technology.  It  features  1.5- 
micron  design  rules  and  dual-layer  metallization.  The  isolation 
technology  uses  a  combination  of  high-pressure  oxidation  to 
form  a  fully-recessed  oxide  through  the  epitaxial  layer,  and  a 
diffused  guard  band.  This  technology  avoids  any  contact 
between  the  channel  stop  and  the  buried  layer. 

Unique  Needs  of  Programmable  Logic 

Programmable  logic  devices  have  unique  process  needs  because 
of  the  high  voltages  incurred  during  programming.  The  oxide- 
isolation  technique  is  especially  useful  because  of  its  excellent 
insensitivity  to  substrate  injection,  which  occurs  when  small 
transistors  are  driven  into  hard  saturation  during  programming. 

Lower  Manufacturing  Costs 

Another  requirement  for  programmable  products  is  a  good 
high-current  beta  so  that  emitter  size  can  be  minimized.  The 
NPN  transistors  in  this  process  have  been  measured  to  have  a 
beta  of  higher  than  20  at  an  emitter  current  of  30  mA.  In  addition, 
a  reduction  in  the  required  fusing  current  allowed  smaller  transis- 
tors in  the  programming  array.  Separation  of  the  current  sources 
for  programming  and  normal  operation  improves  the  speed- 
power  product.  These  features  add  reliability  to  the  product 
while  reducing  die  size. 

High  Yield 

The  walled  diffusions  improve  yield  by  reducing  manufacturing 
tolerance  requirements.  In  addition,  the  process  is  insensitive  to 
threshold  problems.  The  result  of  these  techniques  is  a  high- 
yielding,  reliable  process  that  produces  both  the  1 0-ns  TTL  PAL 
devices  and  the  6-ns  ECL  PAL  devices. 

Smaller  die  size  and  high  yields  result  in  a  much  lower  cost  of 
manufacturing.  Thus,  once  the  process  is  mature,  it  can  produce 
faster  products  at  the  same  or  less  cost  than  earlier  technologies. 
This  can  be  seen  with  the  15-ns  devices,  which  now  are  cost- 
competitive  with  any  other  technology,  including  CMOS. 

Many  users  of  ICs  claim  that  it  is  impossible  to  find  products  that 
combine  three  elusive  qualities:  good,  fast,  and  cheap.  The 
advanced  technology  used  in  the  high-speed  PAL  products 
allows  them  to  fit'all  three  requirements  well. 
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Blazing  Fast  PAL  Devices  Enable  New  Application  Areas 


Reduced  Noise 

The  process  also  provides  other  important  advantages,  lower 
capacitance  and  reduced  noise-  An  enhanced  base  doping 
allows  lower  resistance  without  higher  capacitance.  Oxide- 
isolated  transistors  provide  much  lower  input  and  output 
capacitance  levels  than  that  provided  by  earlier  technologies. 
The  input  capacitance  is  typically  only  2  picoFarads  (pF),  while 
the  output  capacitance  is  only  4  pF. 

Elimination  of  False  Logic  Switching 

Lower  capacitance  allows  your  system  to  operate  with  cleaner 
switching  signals  between  devices,  providing  a  much  more 
reliable  system.  This  is  critical  in  high-speed  systems  where  fast 
rise  and  fall  times  can  cause  signal  fluctuations. 

Guard  rings  limit  the  susceptibility  of  the  inputs  and  outputs  to 
voltage  undershoots  and  overshoots.  Bipolar  Schottky  circuitry 


is  designed  with  nonlinear  input  and  output  impedances.  In 
high-speed  systems,  undershoots  in  particular  can  play  havoc 
with  false  logic  states,  if  the  guard  rings  do  not  prevent  them. 

New  Synchronization  Applications 

Another  key  advantage  of  the  PAL1 6R8-1 0  Series,  only  indirectly 
related  to  speed,  is  its  excellent  resistance  to  metastability. 
Metastability  is  a  failure  that  occurs  when  required  minimum 
setup  times  are  violated.  The  PAL16R8-10  Series  is  less  suscep- 
tible to  metastability,  and  recovers  faster,  than  any  other  PLD 
and  most  other  forms  of  logic.  This  subject  is  discussed  in  detail 
in  Session  3  of  Electro/87  on  Metastability. 

Metastability  characteristics  are  explicitly  specified  in  the 
PAL16R8-10  Series  datasheet  (Figure  10).  For  example,  at  21- 
MHz  clock  frequency  and  7-MHz  data  input  frequency,  a 
metastability  failure  will  occur  only  once  in  ten  years. 


Metastability  Characteristics  over  operating  conditions 


SYMBOL 

PARAMETER 

TEST  CONDITIONS 

COMMERCIAL 
MIN    TYP  MAX 

UNIT 

I 

P 

Poisson  process  rate 

0.85    1 .05 

ns-1 

k 

MTBF  constant 

0.8        1 .0 

^1 

•met 

Minimum  recovery  time  in  asynchronous  mode 

MTBF  =  10  years  fd  =  (1/3)f   d  =  3 

20  30 

ns 

fMET 

Maximum  frequency  in  asynchronous  mode 

MTBF  =  10  years  fd  =  (1/3)f    d  =  3 

21  26 

MHz 

Figure  10.  Guaranteed  Metastability  Characteristics  for  the 
PAL16R8-10  Series.  The  High  Resistance  to 
Metastability  Allows  New  Use  in  Synchronization 
Applications. 


Thus,  a  PALI  6R8-10  Series  device  can  be  used  as  a  synchronizer 
circuit,  a  new  application  that  doesn't  even  require  its  program- 
mability.  Almost  every  digital  system  must  at  some  point  handle 
incoming  signals  that  are  not  synchronized  to  the  system  clock 
rate,  and  any  such  system  will  encounter  metastability.  Using 
the  PAL16R8-10  Series  devices  at  the  interface  provides  the 
lowest  possible  failure  rate. 

Lower  Metastability  Means  Higher 


For  applications  using  the  PAL16R8-10  Series  as  a  synchronizer, 
the  lower  metastability  rate  means  higher  clock  speed.  For  a 
given  desired  MTBF,  the  maximum  clock  rate  is  limited  by  the 
susceptibility  to  metastability  failure.  The  reduced  metastability 
rate  for  the  PAL1 6R8-1 0  Series  allows  a  higher  clock  speed,  with 
no  reduction  in  MTBF. 

ECL  Provides  Highest  Speed 

The  two  ECL  PAL  devices,  the  PAL10H20P8  and  PAL10H20G8, 
provide  the  fastest  PLDs.  10KH  ECL  technology  also  eliminates 
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many  of  the  noise  problems  encountered  with  high-speed  TTL 
boards  since  the  internal  switching  voltage  range  is  less  than 
one  volt,  and  it  is  often  a  good  alternative  to  high-speed  TTL. 
ECL  design  techniques  achieve  high  speed  by  not  allowing  the 
bipolar  transistors  to  saturate  when  they  are  on.  Thus,  there  is  no 
storage  time  delay  caused  by  switching  in  and  out  of  saturation. 

The  elimination  of  saturated  transistors  means  that  current  is 
always  flowing  in  ECL  devices.  This  is  the  cause  of  high  power 
consumption  in  ECL  circuits.  However,  the  ECL  PAL  devices 
consume  only  210  and  225  mA  for  the  PAL10H20P8  and 
PAL10H20G8,  respectively.  This  is  only  minimally  higher  than 
the  standard  power  consumption  of  TTL  programmable  logic 
circuits. 


New  PLD  Applications  in  ECL  Systems 

However,  these  devices  do  not  just  speed  up  programmable 
logic.  These  devices  are  fully  10KH  compatible  and  meant  to  be 
used  in  10KH  systems.  They  open  up  new  applications  for 
programmable  logic  in  the  highest  performance  ECL  systems. 
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ECL  MSI/LSI  Functions 

Since  the  ECL  market  is  smaller  than  the  TTL  market,  fewer 
functions  are  available  in  ECL  than  TTL.  This  is  true  on  all  levels, 
from  SSI  and  MSI  through  VLSI.  ECL  PAL  devices  provide  a 
flexible  solution  that  can  implement  functions  not  found  in  fixed 
ECL  logic.  Future  ECL  PAL  devices  will  provide  higher  density 
and  be  capable  of  implementing  the  most  complex  LSI  functions. 

Systems  are  built  with  ECL  technology  primarily  for  high 
performance,  and  ECL  PAL  devices  must  also  be  high  perfor- 
mance. Future  ECL  PAL  devices  will  improve  speeds  down  to 
the  3  to  4-ns  range  and  be  able  to  compete  effectively  with  the 
speed  of  fixed  ECL  logic. 


The  Speed  Challenge  of  CMOS 
Technology 

CMOS  technology  is  quickly  encroaching  on  bipolar  speeds.  In 
fact,  Monolithic  Memories  has  CMOS  mask-programmable 
devices  (ZHAL'"  devices)  that  are  even  faster  than  their  bipolar 
equivalents.  However,  CMOS  has  not  yet  reached  the  best 
bipolar  speeds,  and  is  not  likely  to  in  the  near  future.  Even  if  we 
exclude  ECL  from  the  comparisons,  bipolar  has  a  two-generation 
speed  advantage  over  CMOS  technology. 

The  fastest  CMOS  PLDs  available  today  are  at  25  ns,  while  25-ns 
bipolar  devices  have  been  available  for  years  and  are  currently 
being  shipped  in  huge  volumes.  1 5-ns  CMOS  is  in  development, 
but  bipolar  is  already  at  10  ns  and  moving  further  down  the  scale. 

Although  recent  CMOS  developments  have  been  rapid,  CMOS 
is  not  expected  to  pass  bipolar  speeds,  only  to  get  closer.  As 
geometries  are  reduced,  CMOS  has  to  contend  with  short- 
channel  effects  and  hot-electron  trapping  as  scaling  increases 
the  electric  field.  Bipolar  has  fewer  problems  during  reduced 
scaling. 


Speed  vs.  Density 

Many  vendors  of  high-density  CMOS  PLDs  claim  that  they  have 
surpassed  bipolar  speeds  if  examined  at  a  per-gate  basis.  That 
claim  is  relevant  only  if  the  high  densities  are  required.  Even  gate 
array  applications  are  relatively  low  in  density,  with  over  half 
below  the  2000-gate  level.  Additionally,  the  largest  application 
area  for  semicustom  arrays  is  random  logic,  efficiently  met  by 
the  PAL  architecture.  And  even  in  those  cases  where  greater 
density  than  a  single  PAL  device  is  required,  two  PAL16R8-10 
Series  devices  back-to-back  have  a  combined  delay  of  only 
20  ns,  still  faster  than  any  CMOS  PLD  of  any  density. 

Some  industry  observers  have  raised  an  issue  of  speed  vs. 
density,  asking  which  one  will  win  out  in  the  long  run.  But  these 
two  attributes  do  not  compete;  they  are  usually  required  at 
different  times  in  different  applications,  and  thus  both  areas  will 
develop  in  their  own  market  areas. 


Gate  Array  Performance 

Gate  arrays  and  other  forms  of  semicustom  logic  are  demon- 
strating very  high  performance,  even  in  CMOS.  The  sub- 
nanosecond  gate  delays  of  these  devices,  even  when  input  and 
output  delays  are  taken  into  account,  cannot  be  matched  by 
programmable  logic.  For  the  ultimate  in  speed,  a  more  custom- 
ized design  is  better,  with  full  custom  offering  the  highest  speed. 

Programmable  logic  competes  with  other  high-performance 
semicustom  solutions  in  applications  where  the  additional 
advantages  of  zero  NRE,  instant  customization,  ease  of  design 
changes,  inexpensive  and  easy-to-use  software,  standardization, 
low  cost,  second  sourcing,  and  other  issues  weigh  in  favor  of 
programmable  logic. 

Studies  of  gate  arrays  show  that  clock  frequencies  for  systems 
using  gate  arrays  are  about  equal  to  microprocessor  frequen- 
cies; around  8-12  MHz,  moving  to  16  MHz.  But  further  studies 
show  that  76%  of  users  require  at  least  one  very-high-speed 
(twice  the  clock  frequency)  combinatorial  path  through  the  chip. 

Since  programmable  logic  offers  only  one  speed  through  the 
chip,  the  device  speed  must  match  the  speed  of  the  fastest 
required  path.  This  is  one  reason  why  high-speed  PAL  devices 
are  required  even  if  system  clock  rates  are  low. 

CMOS  PLDs  can  be  erasable,  a  feature  bipolar  cannot  match. 
Although  not  erased  very  often  in  customer  applications, 
erasability  adds  to  the  testability  of  the  device  during  manufac- 
turing. Also,  CMOS  programmable  cells  are  smaller  than  bipolar 
fuses,  allowing  a  smaller  die  size.  These  advantages  of  CMOS 
will  allow  it  to  dominate,  in  the  long  term,  the  PLD  market  areas 
that  do  not  require  high  speed. 

Usable  Speed 

Just  as  semicustom  arrays  offer  a  greater  percentage  of  "usable 
gates"  than  PLDs,  or  gates  used  vs.  gates  provided,  PLDs  can 
offer  more  "usable  speed"  than  semicustom  arrays.  Although 
typical  gate  speeds  in  custom  devices  can  be  below  1  ns,  that 
speed  is  not  relevant  if  the  design  and  simulation  tools  do  not 
provide  the  accuracy  required  to  guarantee  proper  timing. 

Since  most  of  a  PAL  device  is  fixed  logic,  guaranteed  maximum 
delays  through  the  device  can  be  specified  for  any  custom 
design.  In  addition,  minimum  delays  for  the  PAL16R8-10  Series 
are  specified  in  the  datasheet  for  the  first  time,  allowing  accurate 
system  timing.  Tools  such  as  PLDMaster  from  Daisy  Systems 
provide  PAL  device  simulation  in  a  board-level  environment. 
PLDMaster  can  even  help  partition  designs  to  take  full  advan- 
tage of  device  speeds. 

The  PAL  device  has  another  advantage  over  other  custom  logic 
that  provides  the  ultimate  in  timing  simulation— a  device  can  be 
programmed  and  immediately  dropped  into  a  circuit,  to  check 
timing.  All  of  these  advantages  mean  that  designers  can  use 
more  of  the  PAL  device's  speed  than  that  of  other  custom  logic, 
giving  a  PAL  device  an  edge  even  at  equivalent  specs. 


^MonoUtMo^Monrories^ 


2-737 
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CMOS  Advantages 

CMOS  has  several  advantages  over  bipolar,  depending  on  the 
technology  used.  In  fact.  CMOS  technology  has  opened  up 
more  new  applications  for  programmable  logic  than  any  other 
development,  because  of  several  unique  characteristics.  It  can 
be  lower  m  power,  can  be  erasable,  and  can  be  srnaller.  Some  of 
these  advantages  are  important  in  some  cases  but  not  important 
in  others 

Low  power  is  an  important  advantage,  but  Monolithic  Memories 
has  been  producing  half-  and  quarter-power  versions  of  its 
products  in  bipolar,  simply  by  increasing  the  resistor  values.  Our 
PAL1 6R8B-4  Series  devices  at  35  ns  and  only  55  mA  are  lower  in 
power  than  many  competing  CMOS  devices  at  similar  speeds. 
The  upcoming  PAL16R8H-15  Series  at  15  ns  and  90  mA  will 
achieve  even  greater  efficiencies.  As  top  bipolar  speeds  improve, 
the  half-power  and  quarter-power  versions  will  continue  at  the 
highest-performance  CMOS  speed  levels. 

CMOS  power  specifications  are  often  only  stated  at  standby, 
with  the  device  not  operating.  Operating  power  for  CMOS  rises 
dramatically  with  increasing  input  frequency  and  can  surpass 
even  medium-power  bipolar  devices  (Figure  11).  For  high-speed 
applications.  CMOS  has  not  shown  a  power  advantage  over 
bipo  ar. 
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Figure  11.  Power  Consumption  vs.  Operating  Frequency  tor 
CMOS  and  Low-Power  Bipolar  PAL  Devices. 
CMOS  Devices  Can  Consume  as  Much  Power  as 
Bipolar  Circuits  at  High  Speeds.  Thus,  for  High- 
Speed  Applications,  CMOS  Loses  One  of  Its  Key 
Advantages  Over  Bipolar. 


Bipolar's  Place  in  Market 

Bipolar  TTL  and  ECL  PAL  devices  will  continue  to  dominate  the 
large  part  of  the  PLD  market  that  requires  high  speed.  Bipolar 
will  even  be  dominant  in  the  low-cost  part  of  the  market  for  years 
to  come. 

Record  Speeds  and  New  Applications 

The  10-ns  PAL16R8-10  Series  and  the  6-ns  ECL  PAL  devices 
provide  record  speeds  for  programmable  logic,  using  advanced 
process  technology.  Future  plans  call  for  7.5-ns  TTL  and  3  to 
4-ns  ECL  PAL  devices.  The  speed  and  technology  open  new 
applications  for  PAL  devices,  including: 

•  Highest-performance  computers  and  digital  processing  ■ 

•  High-speed  microprocessor  and  memory  support 

•  Speeding  up  the  fastest  discrete  logic  designs 

•  Component  integration,  eliminating  noise  and  delays  from 
device  interconnect  lines 

•  Process  technology  advantages  of  lower  cost  and  reduced 
noise 

•  Wide  range  of  LSI  functions  to  complement  high-speed  TTL 
and  ECL  discrete  logic  families 

•  Synchronization  circuits  for  low  metastability  rates 

All  of  these  new  applications  show  that  these  new  PAL  devices 
are  not  just  faster  devices  for  the  same  sockets,  but  rather  open 
up  whole  new  applications  for  programmable  logic  devices. 
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